Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
124 tokens/sec
GPT-4o
8 tokens/sec
Gemini 2.5 Pro Pro
47 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

LADDER: An Efficient Framework for Video Frame Interpolation (2404.11108v1)

Published 17 Apr 2024 in cs.CV

Abstract: Video Frame Interpolation (VFI) is a crucial technique in various applications such as slow-motion generation, frame rate conversion, video frame restoration etc. This paper introduces an efficient video frame interpolation framework that aims to strike a favorable balance between efficiency and quality. Our framework follows a general paradigm consisting of a flow estimator and a refinement module, while incorporating carefully designed components. First of all, we adopt depth-wise convolution with large kernels in the flow estimator that simultaneously reduces the parameters and enhances the receptive field for encoding rich context and handling complex motion. Secondly, diverging from a common design for the refinement module with a UNet-structure (encoder-decoder structure), which we find redundant, our decoder-only refinement module directly enhances the result from coarse to fine features, offering a more efficient process. In addition, to address the challenge of handling high-definition frames, we also introduce an innovative HD-aware augmentation strategy during training, leading to consistent enhancement on HD images. Extensive experiments are conducted on diverse datasets, Vimeo90K, UCF101, Xiph and SNU-FILM. The results demonstrate that our approach achieves state-of-the-art performance with clear improvement while requiring much less FLOPs and parameters, reaching to a better spot for balancing efficiency and quality.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (34)
  1. Depth-Aware Video Frame Interpolation. In IEEE Conference on Computer Vision and Pattern Recognition.
  2. Two deterministic half-quadratic regularization algorithms for computed imaging. In ICIP, volume 2, 168–172 vol.2.
  3. Video Frame Interpolation via Deformable Separable Convolution. In AAAI.
  4. Channel Attention Is All You Need for Video Frame Interpolation. In AAAI.
  5. Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs. arXiv preprint arXiv:2203.06717.
  6. Residual Conv-Deconv Grid Network for Semantic Segmentation. In Proceedings of the British Machine Vision Conference, 2017.
  7. Fourier space losses for efficient perceptual image super-resolution. In ICCV.
  8. Many-to-many Splatting for Efficient Video Frame Interpolation.
  9. Real-Time Intermediate Flow Estimation for Video Frame Interpolation. In Proceedings of the European Conference on Computer Vision (ECCV).
  10. Super slomo: High quality estimation of multiple intermediate frames for video interpolation. In CVPR.
  11. IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
  12. AdaCoF: Adaptive Collaboration of Flows for Video Frame Interpolation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
  13. AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
  14. Decoupled weight decay regularization. In ICLR.
  15. Video Frame Interpolation with Transformer. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
  16. Montgomery, C. 1994. Xiph.org video test media (derf’s collection). In Online,https://media.xiph.org/video/derf/.
  17. Context-aware Synthesis for Video Frame Interpolation. In CVPR.
  18. Context-aware synthesis for video frame interpolation. In CVPR.
  19. Softmax Splatting for Video Frame Interpolation. In IEEE Conference on Computer Vision and Pattern Recognition.
  20. Video Frame Interpolation via Adaptive Convolution. In IEEE Conference on Computer Vision and Pattern Recognition.
  21. Video Frame Interpolation via Adaptive Separable Convolution. In IEEE International Conference on Computer Vision.
  22. BMBC: Bilateral Motion Estimation with Bilateral Cost Volume for Video Interpolation. In European Conference on Computer Vision.
  23. Asymmetric Bilateral Motion Estimation for Video Frame Interpolation. In International Conference on Computer Vision.
  24. Im-net for high resolution video frame interpolation. In CVPR.
  25. Large Kernel Matters – Improve Semantic Segmentation by Global Convolutional Network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
  26. FILM: Frame Interpolation for Large Motion. In European Conference on Computer Vision (ECCV).
  27. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Navab, N.; Hornegger, J.; III, W. M. W.; and Frangi, A. F., eds., Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015 - 18th International Conference Munich, Germany, October 5 - 9, 2015, Proceedings, Part III, volume 9351 of Lecture Notes in Computer Science, 234–241. Springer.
  28. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild. CoRR, abs/1212.0402.
  29. Video Compression through Image Interpolation. In ECCV.
  30. Optimizing Video Prediction via Video Frame Interpolation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition.
  31. Quadratic video interpolation. In NeurIPS.
  32. Video Enhancement with Task-Oriented Flow. International Journal of Computer Vision (IJCV), 127(8): 1106–1125.
  33. Extracting motion and appearance via inter-frame attention for efficient video frame interpolation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 5682–5692.
  34. View Synthesis by Appearance Flow. In European Conference on Computer Vision.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com