Papers
Topics
Authors
Recent
Detailed Answer
Quick Answer
Concise responses based on abstracts only
Detailed Answer
Well-researched responses based on abstracts and relevant paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses
Gemini 2.5 Flash
Gemini 2.5 Flash 37 tok/s
Gemini 2.5 Pro 41 tok/s Pro
GPT-5 Medium 10 tok/s Pro
GPT-5 High 15 tok/s Pro
GPT-4o 84 tok/s Pro
Kimi K2 198 tok/s Pro
GPT OSS 120B 448 tok/s Pro
Claude Sonnet 4 31 tok/s Pro
2000 character limit reached

LADDER: An Efficient Framework for Video Frame Interpolation (2404.11108v1)

Published 17 Apr 2024 in cs.CV

Abstract: Video Frame Interpolation (VFI) is a crucial technique in various applications such as slow-motion generation, frame rate conversion, video frame restoration etc. This paper introduces an efficient video frame interpolation framework that aims to strike a favorable balance between efficiency and quality. Our framework follows a general paradigm consisting of a flow estimator and a refinement module, while incorporating carefully designed components. First of all, we adopt depth-wise convolution with large kernels in the flow estimator that simultaneously reduces the parameters and enhances the receptive field for encoding rich context and handling complex motion. Secondly, diverging from a common design for the refinement module with a UNet-structure (encoder-decoder structure), which we find redundant, our decoder-only refinement module directly enhances the result from coarse to fine features, offering a more efficient process. In addition, to address the challenge of handling high-definition frames, we also introduce an innovative HD-aware augmentation strategy during training, leading to consistent enhancement on HD images. Extensive experiments are conducted on diverse datasets, Vimeo90K, UCF101, Xiph and SNU-FILM. The results demonstrate that our approach achieves state-of-the-art performance with clear improvement while requiring much less FLOPs and parameters, reaching to a better spot for balancing efficiency and quality.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (34)
  1. Depth-Aware Video Frame Interpolation. In IEEE Conference on Computer Vision and Pattern Recognition.
  2. Two deterministic half-quadratic regularization algorithms for computed imaging. In ICIP, volume 2, 168–172 vol.2.
  3. Video Frame Interpolation via Deformable Separable Convolution. In AAAI.
  4. Channel Attention Is All You Need for Video Frame Interpolation. In AAAI.
  5. Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs. arXiv preprint arXiv:2203.06717.
  6. Residual Conv-Deconv Grid Network for Semantic Segmentation. In Proceedings of the British Machine Vision Conference, 2017.
  7. Fourier space losses for efficient perceptual image super-resolution. In ICCV.
  8. Many-to-many Splatting for Efficient Video Frame Interpolation.
  9. Real-Time Intermediate Flow Estimation for Video Frame Interpolation. In Proceedings of the European Conference on Computer Vision (ECCV).
  10. Super slomo: High quality estimation of multiple intermediate frames for video interpolation. In CVPR.
  11. IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
  12. AdaCoF: Adaptive Collaboration of Flows for Video Frame Interpolation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
  13. AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
  14. Decoupled weight decay regularization. In ICLR.
  15. Video Frame Interpolation with Transformer. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
  16. Montgomery, C. 1994. Xiph.org video test media (derf’s collection). In Online,https://media.xiph.org/video/derf/.
  17. Context-aware Synthesis for Video Frame Interpolation. In CVPR.
  18. Context-aware synthesis for video frame interpolation. In CVPR.
  19. Softmax Splatting for Video Frame Interpolation. In IEEE Conference on Computer Vision and Pattern Recognition.
  20. Video Frame Interpolation via Adaptive Convolution. In IEEE Conference on Computer Vision and Pattern Recognition.
  21. Video Frame Interpolation via Adaptive Separable Convolution. In IEEE International Conference on Computer Vision.
  22. BMBC: Bilateral Motion Estimation with Bilateral Cost Volume for Video Interpolation. In European Conference on Computer Vision.
  23. Asymmetric Bilateral Motion Estimation for Video Frame Interpolation. In International Conference on Computer Vision.
  24. Im-net for high resolution video frame interpolation. In CVPR.
  25. Large Kernel Matters – Improve Semantic Segmentation by Global Convolutional Network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
  26. FILM: Frame Interpolation for Large Motion. In European Conference on Computer Vision (ECCV).
  27. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Navab, N.; Hornegger, J.; III, W. M. W.; and Frangi, A. F., eds., Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015 - 18th International Conference Munich, Germany, October 5 - 9, 2015, Proceedings, Part III, volume 9351 of Lecture Notes in Computer Science, 234–241. Springer.
  28. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild. CoRR, abs/1212.0402.
  29. Video Compression through Image Interpolation. In ECCV.
  30. Optimizing Video Prediction via Video Frame Interpolation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition.
  31. Quadratic video interpolation. In NeurIPS.
  32. Video Enhancement with Task-Oriented Flow. International Journal of Computer Vision (IJCV), 127(8): 1106–1125.
  33. Extracting motion and appearance via inter-frame attention for efficient video frame interpolation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 5682–5692.
  34. View Synthesis by Appearance Flow. In European Conference on Computer Vision.

Summary

We haven't generated a summary for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Lightbulb On Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com