SGNet: Structure Guided Network via Gradient-Frequency Awareness for Depth Map Super-Resolution (2312.05799v3)
Abstract: Depth super-resolution (DSR) aims to restore high-resolution (HR) depth from low-resolution (LR) one, where RGB image is often used to promote this task. Recent image guided DSR approaches mainly focus on spatial domain to rebuild depth structure. However, since the structure of LR depth is usually blurry, only considering spatial domain is not very sufficient to acquire satisfactory results. In this paper, we propose structure guided network (SGNet), a method that pays more attention to gradient and frequency domains, both of which have the inherent ability to capture high-frequency structure. Specifically, we first introduce the gradient calibration module (GCM), which employs the accurate gradient prior of RGB to sharpen the LR depth structure. Then we present the Frequency Awareness Module (FAM) that recursively conducts multiple spectrum differencing blocks (SDB), each of which propagates the precise high-frequency components of RGB into the LR depth. Extensive experimental results on both real and synthetic datasets demonstrate the superiority of our SGNet, reaching the state-of-the-art. Codes and pre-trained models are available at https://github.com/yanzq95/SGNet.
- Guided image-to-image translation with bi-directional feature transformation. In ICCV, 9016–9025.
- Augmented reality and virtual reality in physical and online retailing: A review, synthesis and research agenda. Augmented reality and virtual reality: Empowering human, place and business, 119–132.
- Learning graph regularisation for guided super-resolution. In CVPR, 1979–1988.
- Deep convolutional neural network for multi-modal image restoration and fusion. IEEE transactions on pattern analysis and machine intelligence, 43(10): 3333–3348.
- Image guided depth upsampling using anisotropic total generalized variation. In ICCV, 993–1000.
- Robust guided image filtering using nonconvex potentials. IEEE transactions on pattern analysis and machine intelligence, 40(1): 192–207.
- Towards Fast and Accurate Real-World Depth Super-Resolution: Benchmark Dataset and Baseline. In CVPR, 9229–9238.
- Evaluation of cost functions for stereo matching. In CVPR, 1–8.
- Depth map super-resolution by deep multi-scale guidance. In ECCV, 353–369.
- Focal frequency loss for image reconstruction and synthesis. In ICCV, 13919–13929.
- Deformable kernel networks for joint image filtering. International Journal of Computer Vision, 129(2): 579–600.
- Adam: A Method for Stochastic Optimization. Computer Science.
- Deep joint image filtering. In ECCV, 154–169.
- Joint image filtering with deep convolutional networks. IEEE transactions on pattern analysis and machine intelligence, 41(8): 1909–1923.
- Catch Missing Details: Image Reconstruction with Frequency Augmented Variational Autoencoder. In CVPR, 1736–1745.
- Depth enhancement via low-rank matrix completion. In CVPR, 3390–3397.
- Structure-preserving super resolution with gradient guidance. In CVPR, 7769–7778.
- Intriguing findings of frequency selection for image deblurring. In AAAI, 1905–1913.
- Guided Depth Super-Resolution by Deep Anisotropic Diffusion. In CVPR, 18237–18246.
- Depth Super-Resolution from Explicit and Implicit High-Frequency Features. arXiv preprint arXiv:2303.09307.
- Learning conditional random fields for stereo. In CVPR, 1–8.
- Symmetric Uncertainty-Aware Feature Transmission for Depth Super-Resolution. In ACM MM, 3867–3876.
- Indoor segmentation and support inference from rgbd images. In ECCV, 746–760.
- Channel attention based iterative residual learning for depth map super-resolution. In CVPR, 5631–5640.
- Pixel-adaptive convolutional neural networks. In CVPR, 11166–11175.
- Learning scene structure guidance via cross-task knowledge transfer for single depth super-resolution. In CVPR, 7792–7801.
- Gradient profile prior and its applications in image super-resolution and enhancement. IEEE Transactions on Image Processing, 20(6): 1529–1542.
- Joint implicit image function for guided depth super-resolution. In ACM MM, 4390–4399.
- Bridgenet: A joint learning network of depth map super-resolution and monocular depth estimation. In ACM MM, 2148–2157.
- Cbam: Convolutional block attention module. In ECCV, 3–19.
- Augmented reality and virtual reality displays: emerging technologies and future perspectives. Light: Science & Applications, 10(1): 216.
- Learning complementary correlations for depth super-resolution with incomplete data in real world. IEEE transactions on neural networks and learning systems.
- RigNet: Repetitive image guided network for depth completion. In ECCV, 214–230. Springer.
- CODON: on orchestrating cross-domain attentions for depth super-resolution. International Journal of Computer Vision, 130(2): 267–284.
- Recurrent Structure Attention Guidance for Depth Super-Resolution. arXiv preprint arXiv:2301.13419.
- Structure Flow-Guided Network for Real Depth Super-Resolution. arXiv preprint arXiv:2301.13416.
- Image super-resolution using very deep residual channel attention networks. In ECCV, 286–301.
- Spherical space feature decomposition for guided depth map super-resolution. arXiv preprint arXiv:2303.08942.
- Discrete cosine transform network for guided depth map super-resolution. In CVPR, 5697–5707.
- High-resolution depth maps imaging via attention-based hierarchical multi-modal fusion. IEEE Transactions on Image Processing, 31: 648–663.
- Pan-sharpening with customized transformer and invertible neural network. In AAAI, volume 36, 3553–3561.
- Adaptively learning low-high frequency information integration for pan-sharpening. In ACM MM, 3375–3384.
- Spatial-frequency domain information integration for pan-sharpening. In ECCV, 274–291.
- Modeling deformable gradient compositions for single-image super-resolution. In CVPR, 5417–5425.