Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images (2109.05885v1)

Published 13 Sep 2021 in cs.CV

Abstract: This paper studies the task of estimating the 3D human poses of multiple persons from multiple calibrated camera views. Following the top-down paradigm, we decompose the task into two stages, i.e. person localization and pose estimation. Both stages are processed in coarse-to-fine manners. And we propose three task-specific graph neural networks for effective message passing. For 3D person localization, we first use Multi-view Matching Graph Module (MMG) to learn the cross-view association and recover coarse human proposals. The Center Refinement Graph Module (CRG) further refines the results via flexible point-based prediction. For 3D pose estimation, the Pose Regression Graph Module (PRG) learns both the multi-view geometry and structural relations between human joints. Our approach achieves state-of-the-art performance on CMU Panoptic and Shelf datasets with significantly lower computation complexity.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Size Wu (12 papers)
  2. Sheng Jin (69 papers)
  3. Wentao Liu (87 papers)
  4. Lei Bai (154 papers)
  5. Chen Qian (226 papers)
  6. Dong Liu (267 papers)
  7. Wanli Ouyang (358 papers)
Citations (39)

Summary

We haven't generated a summary for this paper yet.