Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
184 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

HuMoT: Human Motion Representation using Topology-Agnostic Transformers for Character Animation Retargeting (2305.18897v3)

Published 30 May 2023 in cs.GR

Abstract: Motion retargeting is the long-standing problem in character animation that consists in transferring and adapting the motion of a source character to another target character. A typical application is the creation of motion sequences from off-the-shelf motions by transferring them onto new characters. Motion retargeting is also promising to increase interoperability of existing animation systems and motion databases, as they often differ in the structure of the skeleton(s) considered. Moreover, since the goal of motion retargeting is to abstract and transfer motion dynamics, effective solutions might provide expressive and powerful human motion models in which operations such as cleaning or editing are easier. In this article, we present a novel neural network architecture for retargeting that extracts an abstract representation of human motion agnostic to skeleton topology and morphology. Based on transformers, our model is able to encode and decode motion sequences with variable morphology and topology -- extending the current scope of retargeting -- while supporting skeleton topologies not seen during the training phase. More specifically, our model is structured as an autoencoder, and encoding and decoding are separately conditioned on skeleton templates to extract and control morphology and topology. Beyond motion retargeting, our model has many applications since our abstract representation is a convenient space to embed motion data from different sources. It may potentially be benefical to a number of data-driven methods, allowing them to combine scarce specialised motion datasets (e.g. with style or contact annotations) and larger general motion datasets, for improved performance and generalisation ability. Moreover, we show that our model can be useful for other applications beyond retargeting, including motion denoising and joint upsampling.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (45)
  1. Skeleton-Aware Networks for Deep Motion Retargeting. ACM Transactions on Graphics 39, 4, Article 62 (July 2020), 14 pages. https://doi.org/10.1145/3386569.3392462
  2. Learning Character-Agnostic Motion for Motion Retargeting in 2D. ACM Transactions on Graphics 38, 4, Article 75 (July 2019), 14 pages. https://doi.org/10.1145/3306346.3322999
  3. Adobe. 2021. Mixamo. https://www.mixamo.com Accessed: 2021-09-16.
  4. A Spatio-temporal Transformer for 3D Human Motion Prediction. (April 2020), 17 pages. arXiv:2004.08692
  5. Facial Animation with Disentangled Identity and Motion using Transformers. Computer Graphics Forum 41, 8 (Sept. 2022), 11 pages. https://doi.org/10.1111/cgf.14641
  6. Shape Transformers: Topology-Independent 3D Shape Models Using Transformers. Computer Graphics Forum 41, 2 (May 2022), 195–207. https://doi.org/10.1111/cgf.14468
  7. Kwang-Jin Choi and Hyeong-Seok Ko. 2000. Online Motion Retargetting. IEEE Transactions on Visualization and Computer Graphics 11, 5 (Jan. 2000), 223–235. https://doi.org/10.1002/1099-1778(200012)11:5<223::AID-VIS236>3.0.CO;2-5
  8. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In 2019 Annual Conference of the North American Chapter of he Association for Computational Linguistics. Association for Computational Linguistics, 4171–4186. https://doi.org/10.18653/v1/n19-1423
  9. Stylistic Locomotion Modeling and Synthesis using Variational Generative Models. In International Conference on Motion, Interaction and Games. Association for Computing Machinery (ACM), Article 32, 10 pages. https://doi.org/10.1145/3359566.3360083
  10. Single-Shot Motion Completion with Transformer. (March 2021), 10 pages. arXiv:2103.00776
  11. XCiT: Cross-Covariance Image Transformers. In 35th International Conference on Neural Information Processing Systems, Vol. 34. Curran Associates Inc., 14 pages. https://proceedings.neurips.cc/paper/2021/file/a655fbe4b8d7439994aa37ddad80de56-Paper.pdf
  12. Michael Gleicher. 1998. Retargetting Motion to New Characters. In 25th International Conference on Computer Graphics and Interactive Techniques. Association for Computing Machinery (ACM), 33–42. https://doi.org/10.1145/280814.280820
  13. Robust Motion In-betweening. ACM Transactions on Graphics 39, 4, Article 60 (July 2020), 12 pages. https://doi.org/10.1145/3386569.3392480
  14. Real-Time Motion Retargeting to Highly Varied User-Created Morphologies. ACM Transactions on Graphics 27, 3 (Aug. 2008), 1–11. https://doi.org/10.1145/1360612.1360626
  15. A Deep Learning Framework for Character Motion Synthesis and Editing. ACM Transactions on Graphics 35, 4, Article 138 (July 2016), 11 pages. https://doi.org/10.1145/2897824.2925975
  16. Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments. IEEE Transactions on Pattern Analysis and Machine Intelligence 36, 7 (July 2014), 1325–1339. https://doi.org/10.1109/TPAMI.2013.248
  17. Motion Puzzle: Arbitrary Motion Style Transfer by Body Part. ACM Transactions on Graphics 41, 3, Article 33 (June 2022), 16 pages. https://doi.org/10.1145/3516429
  18. Alias-Free Generative Adversarial Networks. In 35th International Conference on Neural Information Processing Systems. Curran Associates Inc., 852–863. https://proceedings.neurips.cc/paper/2021/file/076ccd93ad68be51f23707988e934906-Paper.pdf
  19. A Style-Based Generator Architecture for Generative Adversarial Networks. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society/Computer Vision Foundation (CVF), 4396–4405. https://doi.org/10.1109/CVPR.2019.00453
  20. Analyzing and Improving the Image Quality of StyleGAN. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society/Computer Vision Foundation (CVF), 8107–8116. https://doi.org/10.1109/CVPR42600.2020.00813
  21. Motion Retargetting based on Dilated Convolutions and Skeleton-specific Loss Functions. Computer Graphics Forum 39, 2 (July 2020), 497–507. https://doi.org/10.1111/cgf.13947
  22. Diederik P. Kingma and Jimmy Lei Ba. 2014. Adam: A method for stochastic optimization. (Dec. 2014), 15 pages. arXiv:1412.6980
  23. Morphology-independent representation of motions for interactive human-like animation. Computer Graphics Forum 24, 3 (Oct. 2005), 343–351. https://doi.org/10.1111/j.1467-8659.2005.00859.x
  24. Context-based Style Transfer of Tokenized Gestures. Computer Graphics Forum 41, 8 (Sept. 2022), 11 pages. https://doi.org/10.1111/cgf.14645
  25. Jehee Lee and Sung Yong Shin. 1999. A hierarchical approach to interactive motion editing for human-like figures. In 26th International Conference on Computer Graphics and Interactive Techniques. Association for Computing Machinery (ACM), 39–48. https://doi.org/10.1145/311535.311539
  26. An Iterative Solution for Improving the Generalization Ability of Unsupervised Skeleton Motion Retargeting. Computers & Graphics 104 (April 2022), 129–139. https://doi.org/10.1016/j.cag.2022.04.001
  27. Bidirectional recurrent autoencoder for 3D skeleton motion data refinement. Computers & Graphics 81 (April 2019), 92–103. https://doi.org/10.1016/j.cag.2019.03.010
  28. A Perceptual-Based Noise-Agnostic 3D Skeleton Motion Data Refinement Network. IEEE Access 8 (March 2020), 52927–52940. https://doi.org/10.1109/ACCESS.2020.2980316
  29. PMnet: Learning of Disentangled Pose and Movement for Unsupervised Motion Retargeting. In 30th British Machine Vision Conference 2019. BMVA Press, Article 196, 13 pages. https://bmvc2019.org/wp-content/uploads/papers/0997-paper.pdf
  30. End-to-End Human Pose and Mesh Reconstruction with Transformers. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society/Computer Vision Foundation (CVF), 1954–1963. https://doi.org/10.1109/CVPR46437.2021.00199
  31. AMASS: Archive of Motion Capture As Surface Shapes. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE Computer Society/Computer Vision Foundation (CVF), 5441–5450. https://doi.org/10.1109/ICCV.2019.00554
  32. Correspondence-free online human motion retargeting. (Feb. 2023), 11 pages. arXiv:2302.00556
  33. Monocular 3D Human Pose Estimation In The Wild Using Improved CNN Supervision. In 2017 International Conference on 3D Vision (3DV). IEEE Computer Society, 506–516. https://doi.org/10.1109/3dv.2017.00064
  34. UnderPressure: Deep Learning for Foot Contact Detection, Ground Reaction Force Estimation and Footskate Cleanup. Computer Graphics Forum 41, 8 (Dec. 2022), 195–206. https://doi.org/10.1111/cgf.14635
  35. A Survey on Deep Learning for Skeleton-Based Human Animation. Computer Graphics Forum 41, 1 (Nov. 2021), 32 pages. https://doi.org/10.1111/cgf.14426
  36. Motion In-betweening via Deep Delta-Interpolator. (Jan. 2022), 11 pages. arXiv:2201.06701
  37. Motion In-Betweening via Two-Stage Transformers. ACM Transactions on Graphics 41, 6, Article 184 (Dec. 2022), 16 pages. https://doi.org/10.1145/3550454.3555454
  38. From Image to Stability: Learning Dynamics from Human Pose. In 16th European Conference on Computer Vision (ECCV). Springer International Publishing, 536–554. https://doi.org/10.1007/978-3-030-58592-1_32
  39. Efficient Neural Networks for Real-time Motion Style Transfer. Proceedings of the ACM on Computer Graphics and Interactive Techniques 2, 2, Article 13 (July 2019), 17 pages. https://doi.org/10.1145/3340254
  40. Leslie N. Smith. 2017. Cyclical Learning Rates for Training Neural Networks. In 2017 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE Computer Society, 10. https://doi.org/10.1109/WACV.2017.58
  41. Seyoon Tak and Hyeong-Seok Ko. 2005. A Physically-Based Motion Retargeting Filter. ACM Transactions on Graphics 24, 1 (Jan. 2005), 98–117. https://doi.org/10.1145/1037957.1037963
  42. Attention is All You Need. In 31st International Conference on Neural Information Processing Systems. Curran Associates Inc., 6000–6010. https://proceedings.neurips.cc/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf
  43. Learning-based pose edition for efficient and interactive design. Computer Animation and Virtual Worlds 32, 3-4 (June 2021), e2013. https://doi.org/10.1002/cav.2013
  44. Contact-Aware Retargeting of Skinned Motion. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE Computer Society/Computer Vision Foundation (CVF), 9700–9709. https://openaccess.thecvf.com/content/ICCV2021/papers/Villegas_Contact-Aware_Retargeting_of_Skinned_Motion_ICCV_2021_paper.pdf
  45. Neural Kinematic Networks for Unsupervised Motion Retargetting. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE Computer Society/Computer Vision Foundation (CVF), 8639–8648. https://doi.org/10.1109/CVPR.2018.00901
Citations (1)

Summary

We haven't generated a summary for this paper yet.