HuMoT: Human Motion Representation using Topology-Agnostic Transformers for Character Animation Retargeting (2305.18897v3)
Abstract: Motion retargeting is the long-standing problem in character animation that consists in transferring and adapting the motion of a source character to another target character. A typical application is the creation of motion sequences from off-the-shelf motions by transferring them onto new characters. Motion retargeting is also promising to increase interoperability of existing animation systems and motion databases, as they often differ in the structure of the skeleton(s) considered. Moreover, since the goal of motion retargeting is to abstract and transfer motion dynamics, effective solutions might provide expressive and powerful human motion models in which operations such as cleaning or editing are easier. In this article, we present a novel neural network architecture for retargeting that extracts an abstract representation of human motion agnostic to skeleton topology and morphology. Based on transformers, our model is able to encode and decode motion sequences with variable morphology and topology -- extending the current scope of retargeting -- while supporting skeleton topologies not seen during the training phase. More specifically, our model is structured as an autoencoder, and encoding and decoding are separately conditioned on skeleton templates to extract and control morphology and topology. Beyond motion retargeting, our model has many applications since our abstract representation is a convenient space to embed motion data from different sources. It may potentially be benefical to a number of data-driven methods, allowing them to combine scarce specialised motion datasets (e.g. with style or contact annotations) and larger general motion datasets, for improved performance and generalisation ability. Moreover, we show that our model can be useful for other applications beyond retargeting, including motion denoising and joint upsampling.
- Skeleton-Aware Networks for Deep Motion Retargeting. ACM Transactions on Graphics 39, 4, Article 62 (July 2020), 14 pages. https://doi.org/10.1145/3386569.3392462
- Learning Character-Agnostic Motion for Motion Retargeting in 2D. ACM Transactions on Graphics 38, 4, Article 75 (July 2019), 14 pages. https://doi.org/10.1145/3306346.3322999
- Adobe. 2021. Mixamo. https://www.mixamo.com Accessed: 2021-09-16.
- A Spatio-temporal Transformer for 3D Human Motion Prediction. (April 2020), 17 pages. arXiv:2004.08692
- Facial Animation with Disentangled Identity and Motion using Transformers. Computer Graphics Forum 41, 8 (Sept. 2022), 11 pages. https://doi.org/10.1111/cgf.14641
- Shape Transformers: Topology-Independent 3D Shape Models Using Transformers. Computer Graphics Forum 41, 2 (May 2022), 195–207. https://doi.org/10.1111/cgf.14468
- Kwang-Jin Choi and Hyeong-Seok Ko. 2000. Online Motion Retargetting. IEEE Transactions on Visualization and Computer Graphics 11, 5 (Jan. 2000), 223–235. https://doi.org/10.1002/1099-1778(200012)11:5<223::AID-VIS236>3.0.CO;2-5
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In 2019 Annual Conference of the North American Chapter of he Association for Computational Linguistics. Association for Computational Linguistics, 4171–4186. https://doi.org/10.18653/v1/n19-1423
- Stylistic Locomotion Modeling and Synthesis using Variational Generative Models. In International Conference on Motion, Interaction and Games. Association for Computing Machinery (ACM), Article 32, 10 pages. https://doi.org/10.1145/3359566.3360083
- Single-Shot Motion Completion with Transformer. (March 2021), 10 pages. arXiv:2103.00776
- XCiT: Cross-Covariance Image Transformers. In 35th International Conference on Neural Information Processing Systems, Vol. 34. Curran Associates Inc., 14 pages. https://proceedings.neurips.cc/paper/2021/file/a655fbe4b8d7439994aa37ddad80de56-Paper.pdf
- Michael Gleicher. 1998. Retargetting Motion to New Characters. In 25th International Conference on Computer Graphics and Interactive Techniques. Association for Computing Machinery (ACM), 33–42. https://doi.org/10.1145/280814.280820
- Robust Motion In-betweening. ACM Transactions on Graphics 39, 4, Article 60 (July 2020), 12 pages. https://doi.org/10.1145/3386569.3392480
- Real-Time Motion Retargeting to Highly Varied User-Created Morphologies. ACM Transactions on Graphics 27, 3 (Aug. 2008), 1–11. https://doi.org/10.1145/1360612.1360626
- A Deep Learning Framework for Character Motion Synthesis and Editing. ACM Transactions on Graphics 35, 4, Article 138 (July 2016), 11 pages. https://doi.org/10.1145/2897824.2925975
- Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments. IEEE Transactions on Pattern Analysis and Machine Intelligence 36, 7 (July 2014), 1325–1339. https://doi.org/10.1109/TPAMI.2013.248
- Motion Puzzle: Arbitrary Motion Style Transfer by Body Part. ACM Transactions on Graphics 41, 3, Article 33 (June 2022), 16 pages. https://doi.org/10.1145/3516429
- Alias-Free Generative Adversarial Networks. In 35th International Conference on Neural Information Processing Systems. Curran Associates Inc., 852–863. https://proceedings.neurips.cc/paper/2021/file/076ccd93ad68be51f23707988e934906-Paper.pdf
- A Style-Based Generator Architecture for Generative Adversarial Networks. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society/Computer Vision Foundation (CVF), 4396–4405. https://doi.org/10.1109/CVPR.2019.00453
- Analyzing and Improving the Image Quality of StyleGAN. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society/Computer Vision Foundation (CVF), 8107–8116. https://doi.org/10.1109/CVPR42600.2020.00813
- Motion Retargetting based on Dilated Convolutions and Skeleton-specific Loss Functions. Computer Graphics Forum 39, 2 (July 2020), 497–507. https://doi.org/10.1111/cgf.13947
- Diederik P. Kingma and Jimmy Lei Ba. 2014. Adam: A method for stochastic optimization. (Dec. 2014), 15 pages. arXiv:1412.6980
- Morphology-independent representation of motions for interactive human-like animation. Computer Graphics Forum 24, 3 (Oct. 2005), 343–351. https://doi.org/10.1111/j.1467-8659.2005.00859.x
- Context-based Style Transfer of Tokenized Gestures. Computer Graphics Forum 41, 8 (Sept. 2022), 11 pages. https://doi.org/10.1111/cgf.14645
- Jehee Lee and Sung Yong Shin. 1999. A hierarchical approach to interactive motion editing for human-like figures. In 26th International Conference on Computer Graphics and Interactive Techniques. Association for Computing Machinery (ACM), 39–48. https://doi.org/10.1145/311535.311539
- An Iterative Solution for Improving the Generalization Ability of Unsupervised Skeleton Motion Retargeting. Computers & Graphics 104 (April 2022), 129–139. https://doi.org/10.1016/j.cag.2022.04.001
- Bidirectional recurrent autoencoder for 3D skeleton motion data refinement. Computers & Graphics 81 (April 2019), 92–103. https://doi.org/10.1016/j.cag.2019.03.010
- A Perceptual-Based Noise-Agnostic 3D Skeleton Motion Data Refinement Network. IEEE Access 8 (March 2020), 52927–52940. https://doi.org/10.1109/ACCESS.2020.2980316
- PMnet: Learning of Disentangled Pose and Movement for Unsupervised Motion Retargeting. In 30th British Machine Vision Conference 2019. BMVA Press, Article 196, 13 pages. https://bmvc2019.org/wp-content/uploads/papers/0997-paper.pdf
- End-to-End Human Pose and Mesh Reconstruction with Transformers. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society/Computer Vision Foundation (CVF), 1954–1963. https://doi.org/10.1109/CVPR46437.2021.00199
- AMASS: Archive of Motion Capture As Surface Shapes. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE Computer Society/Computer Vision Foundation (CVF), 5441–5450. https://doi.org/10.1109/ICCV.2019.00554
- Correspondence-free online human motion retargeting. (Feb. 2023), 11 pages. arXiv:2302.00556
- Monocular 3D Human Pose Estimation In The Wild Using Improved CNN Supervision. In 2017 International Conference on 3D Vision (3DV). IEEE Computer Society, 506–516. https://doi.org/10.1109/3dv.2017.00064
- UnderPressure: Deep Learning for Foot Contact Detection, Ground Reaction Force Estimation and Footskate Cleanup. Computer Graphics Forum 41, 8 (Dec. 2022), 195–206. https://doi.org/10.1111/cgf.14635
- A Survey on Deep Learning for Skeleton-Based Human Animation. Computer Graphics Forum 41, 1 (Nov. 2021), 32 pages. https://doi.org/10.1111/cgf.14426
- Motion In-betweening via Deep Delta-Interpolator. (Jan. 2022), 11 pages. arXiv:2201.06701
- Motion In-Betweening via Two-Stage Transformers. ACM Transactions on Graphics 41, 6, Article 184 (Dec. 2022), 16 pages. https://doi.org/10.1145/3550454.3555454
- From Image to Stability: Learning Dynamics from Human Pose. In 16th European Conference on Computer Vision (ECCV). Springer International Publishing, 536–554. https://doi.org/10.1007/978-3-030-58592-1_32
- Efficient Neural Networks for Real-time Motion Style Transfer. Proceedings of the ACM on Computer Graphics and Interactive Techniques 2, 2, Article 13 (July 2019), 17 pages. https://doi.org/10.1145/3340254
- Leslie N. Smith. 2017. Cyclical Learning Rates for Training Neural Networks. In 2017 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE Computer Society, 10. https://doi.org/10.1109/WACV.2017.58
- Seyoon Tak and Hyeong-Seok Ko. 2005. A Physically-Based Motion Retargeting Filter. ACM Transactions on Graphics 24, 1 (Jan. 2005), 98–117. https://doi.org/10.1145/1037957.1037963
- Attention is All You Need. In 31st International Conference on Neural Information Processing Systems. Curran Associates Inc., 6000–6010. https://proceedings.neurips.cc/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf
- Learning-based pose edition for efficient and interactive design. Computer Animation and Virtual Worlds 32, 3-4 (June 2021), e2013. https://doi.org/10.1002/cav.2013
- Contact-Aware Retargeting of Skinned Motion. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE Computer Society/Computer Vision Foundation (CVF), 9700–9709. https://openaccess.thecvf.com/content/ICCV2021/papers/Villegas_Contact-Aware_Retargeting_of_Skinned_Motion_ICCV_2021_paper.pdf
- Neural Kinematic Networks for Unsupervised Motion Retargetting. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE Computer Society/Computer Vision Foundation (CVF), 8639–8648. https://doi.org/10.1109/CVPR.2018.00901