Fast and Accurate Image Super-Resolution with Deep Laplacian Pyramid Networks (1710.01992v3)

Published 4 Oct 2017 in cs.CV

Abstract: Convolutional neural networks have recently demonstrated high-quality reconstruction for single image super-resolution. However, existing methods often require a large number of network parameters and entail heavy computational loads at runtime for generating high-accuracy super-resolution results. In this paper, we propose the deep Laplacian Pyramid Super-Resolution Network for fast and accurate image super-resolution. The proposed network progressively reconstructs the sub-band residuals of high-resolution images at multiple pyramid levels. In contrast to existing methods that involve the bicubic interpolation for pre-processing (which results in large feature maps), the proposed method directly extracts features from the low-resolution input space and thereby entails low computational loads. We train the proposed network with deep supervision using the robust Charbonnier loss functions and achieve high-quality image reconstruction. Furthermore, we utilize the recursive layers to share parameters across as well as within pyramid levels, and thus drastically reduce the number of parameters. Extensive quantitative and qualitative evaluations on benchmark datasets show that the proposed algorithm performs favorably against the state-of-the-art methods in terms of run-time and image quality.

Authors (4)

Wei-Sheng Lai (29 papers)
Jia-Bin Huang (106 papers)
Narendra Ahuja (32 papers)
Ming-Hsuan Yang (377 papers)

Citations (708)

View on Semantic Scholar

Summary

The paper demonstrates a novel progressive reconstruction strategy that predicts sub-band residuals to efficiently generate high-resolution images.
It reduces computational load and parameters by employing deep supervision with the Charbonnier loss and recursive layers.
The multi-scale trained model offers versatile performance on standard benchmarks, enabling real-time applications in resource-constrained environments.

Overview of "Fast and Accurate Image Super-Resolution with Deep Laplacian Pyramid Networks"

The paper "Fast and Accurate Image Super-Resolution with Deep Laplacian Pyramid Networks" introduces an innovative approach to Single Image Super-Resolution (SISR) by exploiting a deep Laplacian Pyramid architecture. This methodology stands out due to its efficiency and accuracy, achieved through a progressive reconstruction strategy. The authors address critical computational challenges associated with existing convolutional neural network (CNN)-based SISR methods, primarily focusing on parameter reduction and computational load.

Key Contributions

Progressive Reconstruction: The paper proposes reconstructing high-resolution (HR) images by progressively predicting sub-band residuals at multiple pyramid levels. This approach is contrasted with prior methods reliant on bicubic interpolation, which tend to generate excessive computational overhead by producing large intermediate feature maps.
Low Computational Load: By extracting features directly from the low-resolution (LR) input image and employing deep supervision with the Charbonnier loss function, the computational efficiency is greatly enhanced. The Charbonnier loss helps reduce common artifacts and inconsistencies, leading to higher-quality reconstructions.
Parameter Sharing and Reduction: Utilizing recursive layers allows sharing parameters across pyramid levels, drastically reducing the number of parameters by 73% compared to traditional methods, without compromising performance. This parameter efficiency facilitates real-time performance, establishing a balance between speed and network capacity.
Multi-Scale Training: For improved generalization across upsampling scales, the network is trained on multiple scales to naturally handle variations, thereby offering versatility in applications requiring adaptability to different resource constraints.

Quantitative and Qualitative Performance

The proposed framework was thoroughly tested using standard benchmarks and showcased high numerical performance against state-of-the-art methods like VDSR, DRCN, and DRRN. The results exhibit favorable run-time efficiency and image quality judgments, underscoring the network's capability to deliver accurate super-resolution outputs with significantly less computational time.

Implications and Future Directions

This research has noteworthy implications for real-time applications in resource-limited environments due to its efficient architecture. The progressive upsampling strategy not only reduces computational demands but also enables the usage of the same model across varying scales of super-resolution tasks. This aspect could spur new developments in video processing, particularly in streaming and surveillance systems where adaptive resource allocation is crucial.

Future work could explore integrating adversarial training techniques, as indicated by preliminary experiments involving GANs for enhanced perceptual quality. Moreover, the architecture's adaptation for tasks beyond super-resolution, such as inpainting or style transfer, presents an intriguing avenue for further exploration.

In conclusion, this paper contributes a robust and efficient strategy for SISR, leveraging a Laplacian Pyramid Network architecture to achieve impressive results with reduced computational burden. It stands as a significant step towards practical and scalable implementations of deep learning-based super-resolution techniques.

PDF Markdown

Related Papers

YouTube

Show All Videos