MC-NeRF: Multi-Camera Neural Radiance Fields for Multi-Camera Image Acquisition Systems (2309.07846v4)
Abstract: Neural Radiance Fields (NeRF) use multi-view images for 3D scene representation, demonstrating remarkable performance. As one of the primary sources of multi-view images, multi-camera systems encounter challenges such as varying intrinsic parameters and frequent pose changes. Most previous NeRF-based methods assume a unique camera and rarely consider multi-camera scenarios. Besides, some NeRF methods that can optimize intrinsic and extrinsic parameters still remain susceptible to suboptimal solutions when these parameters are poor initialized. In this paper, we propose MC-NeRF, a method that enables joint optimization of both intrinsic and extrinsic parameters alongside NeRF. The method also supports each image corresponding to independent camera parameters. First, we tackle coupling issue and the degenerate case that arise from the joint optimization between intrinsic and extrinsic parameters. Second, based on the proposed solutions, we introduce an efficient calibration image acquisition scheme for multi-camera systems, including the design of calibration object. Finally, we present an end-to-end network with training sequence that enables the estimation of intrinsic and extrinsic parameters, along with the rendering network. Furthermore, recognizing that most existing datasets are designed for a unique camera, we construct a real multi-camera image acquisition system and create a corresponding new dataset, which includes both simulated data and real-world captured images. Experiments confirm the effectiveness of our method when each image corresponds to different camera parameters. Specifically, we use multi-cameras, each with different intrinsic and extrinsic parameters in real-world system, to achieve 3D scene representation without providing initial poses.
- “NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis” In Computer Vision – ECCV 2020 Springer International Publishing, 2020, pp. 405–421
- “Local light field fusion: practical view synthesis with prescriptive sampling guidelines” In ACM Trans. Graph. 38.4 Association for Computing Machinery, 2019
- “Neural sparse voxel fields” In Proceedings of the 34th International Conference on Neural Information Processing Systems, NIPS’20 Vancouver, BC, Canada: Curran Associates Inc, 2020
- “Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields” In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 5460–5469
- Johannes L. Schönberger and Jan-Michael Frahm “Structure-from-Motion Revisited” In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 4104–4113
- “CamP: Camera Preconditioning for Neural Radiance Fields” In ACM Trans. Graph. 42.6 Association for Computing Machinery, 2023
- “Self-Calibrating Neural Radiance Fields” In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 5826–5834
- “NeRF–: Neural Radiance Fields Without Known Camera Parameters”, 2022 arXiv:2102.07064 [cs.CV]
- “Local-to-global registration for bundle-adjusting neural radiance fields” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 8264–8273
- “BARF: Bundle-Adjusting Neural Radiance Fields” In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 5721–5731
- Olivier Faugeras “Three-dimensional computer vision: a geometric viewpoint” MIT press, 1993
- R. Tsai “A versatile camera calibration technique for high-accuracy 3D machine vision metrology using off-the-shelf TV cameras and lenses” In IEEE Journal on Robotics and Automation 3.4, 1987, pp. 323–344
- Zhengyou Zhang “Flexible camera calibration by viewing a plane from unknown orientations” In Proceedings of the Seventh IEEE International Conference on Computer Vision 1, 1999, pp. 666–673 vol.1
- “Automatic Upright Adjustment of Photographs With Robust Camera Calibration” In IEEE Transactions on Pattern Analysis and Machine Intelligence 36.5, 2014, pp. 833–844
- Menghua Zhai, Scott Workman and Nathan Jacobs “Detecting Vanishing Points Using Global Image Context in a Non-ManhattanWorld” In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 5657–5665
- Qian Chen, Haiyuan Wu and Toshikazu Wada “Camera Calibration with Two Arbitrary Coplanar Circles” In Computer Vision - ECCV 2004 Springer Berlin Heidelberg, 2004, pp. 521–532
- “DeepCalib: a deep learning approach for automatic intrinsic calibration of wide field-of-view cameras” In Proceedings of the 15th ACM SIGGRAPH European Conference on Visual Media Production New York, NY, USA: Association for Computing Machinery, 2018
- “A Perceptual Measure for Deep Single Image Camera and Lens Calibration” In IEEE Transactions on Pattern Analysis and Machine Intelligence 45.9, 2023, pp. 10603–10614
- “Shape from X: Psychophysics and Computation” In Computational Models of Visual Processing, 1991, pp. 305–330
- “Multi-View Stereo for Community Photo Collections”, 2007, pp. 1–8
- “DiffPoseNet: Direct Differentiable Camera Pose Estimation” In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 6835–6844
- “Camera pose estimation using voxel-based features for autonomous vehicle localization tracking” In 2022 37th International Technical Conference on Circuits/Systems, Computers and Communications (ITC-CSCC), 2022, pp. 185–188
- “PVO: Panoptic Visual Odometry” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) IEEE Computer Society, 2023, pp. 9579–9589
- “Vitvo: Vision transformer based visual odometry with attention supervision” In 2023 18th International Conference on Machine Vision and Applications (MVA), 2023, pp. 1–5 IEEE
- A. Kendall, M. Grimes and R. Cipolla “PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization” In 2015 IEEE International Conference on Computer Vision (ICCV) IEEE Computer Society, 2015, pp. 2938–2946
- “Deep Keypoint-Based Camera Pose Estimation with Geometric Constraints” In 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2020, pp. 4950–4957
- “iNeRF: Inverting Neural Radiance Fields for Pose Estimation” In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) IEEE Press, 2021, pp. 1323–1330
- “NeRF++: Analyzing and Improving Neural Radiance Fields”, 2020 arXiv:2010.07492
- “NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections” In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) IEEE Computer Society, 2021, pp. 7206–7215
- “GRAF: generative radiance fields for 3D-aware image synthesis” In Proceedings of the 34th International Conference on Neural Information Processing Systems Curran Associates Inc., 2020
- “Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields” In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 5835–5844
- “TensoIR: Tensorial Inverse Rendering” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) IEEE Computer Society, 2023, pp. 165–174
- “EfficientNeRF - Efficient Neural Radiance Fields” In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 12892–12901
- “Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields” In 2023 IEEE/CVF International Conference on Computer Vision (ICCV) IEEE Computer Society, 2023, pp. 19640–19648
- “Instant Neural Graphics Primitives with a Multiresolution Hash Encoding” In ACM Trans. Graph. 41.4 New York, NY, USA: ACM, 2022, pp. 102:1–102:15
- “TensoRF: Tensorial Radiance Fields” In Computer Vision – ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXII Springer-Verlag, 2022, pp. 333–350
- Changchang Wu “Towards Linear-Time Incremental Structure from Motion” In 2013 International Conference on 3D Vision - 3DV 2013, 2013, pp. 127–134
- “From Dusk Till Dawn: Modeling in the Dark” In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 5488–5496
- “Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes” In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 6494–6504
- “Gaussian Activated Neural Radiance Fields for High Fidelity Reconstruction and Pose Estimation” In Computer Vision – ECCV 2022 Springer Nature Switzerland, 2022, pp. 264–280
- “SiNeRF: Sinusoidal Neural Radiance Fields for Joint Pose Estimation and Scene Reconstruction” In British Machine Vision Conference, 2022
- “Nerfies: Deformable Neural Radiance Fields” In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 5845–5854
- “SAPE: Spatially-Adaptive Progressive Encoding for Neural Optimization” In Neural Information Processing Systems, 2021
- “Attention is All you Need” In Advances in Neural Information Processing Systems 30 Curran Associates, Inc., 2017
- “NoPe-NeRF: Optimising Neural Radiance Field with No Pose Prior” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) IEEE Computer Society, 2023, pp. 4160–4169
- “SPARF: Neural Radiance Fields from Sparse and Noisy Poses” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) IEEE Computer Society, 2023, pp. 4190–4200
- “DBARF: Deep Bundle-Adjusting Generalizable Neural Radiance Fields” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) IEEE Computer Society, 2023, pp. 24–34
- “IBRNet: Learning Multi-View Image-Based Rendering” In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) IEEE Computer Society, 2021, pp. 4688–4697
- “pixelNeRF: Neural Radiance Fields from One or Few Images” In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 4576–4585
- “GNeRF: GAN-based Neural Radiance Field without Posed Camera” In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 6331–6341
- Richard I Hartley and Andrew Zisserman “Multiple View Geometry in Computer Vision” Cambridge University Press, 2003
- “ImageNet: A large-scale hierarchical image database” In 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009, pp. 248–255 DOI: 10.1109/CVPR.2009.5206848
- “AliceVision Meshroom: An open-source 3D reconstruction pipeline” In Proceedings of the 12th ACM Multimedia Systems Conference - MMSys ’21 ACM Press, 2021