Papers
Topics
Authors
Recent
Detailed Answer
Quick Answer
Concise responses based on abstracts only
Detailed Answer
Well-researched responses based on abstracts and relevant paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses
Gemini 2.5 Flash
Gemini 2.5 Flash 58 tok/s
Gemini 2.5 Pro 52 tok/s Pro
GPT-5 Medium 12 tok/s Pro
GPT-5 High 17 tok/s Pro
GPT-4o 95 tok/s Pro
Kimi K2 179 tok/s Pro
GPT OSS 120B 463 tok/s Pro
Claude Sonnet 4 38 tok/s Pro
2000 character limit reached

Predictive Coding For Animation-Based Video Compression (2307.04187v1)

Published 9 Jul 2023 in cs.CV and cs.MM

Abstract: We address the problem of efficiently compressing video for conferencing-type applications. We build on recent approaches based on image animation, which can achieve good reconstruction quality at very low bitrate by representing face motions with a compact set of sparse keypoints. However, these methods encode video in a frame-by-frame fashion, i.e. each frame is reconstructed from a reference frame, which limits the reconstruction quality when the bandwidth is larger. Instead, we propose a predictive coding scheme which uses image animation as a predictor, and codes the residual with respect to the actual target frame. The residuals can be in turn coded in a predictive manner, thus removing efficiently temporal dependencies. Our experiments indicate a significant bitrate gain, in excess of 70% compared to the HEVC video standard and over 30% compared to VVC, on a datasetof talking-head videos

Definition Search Book Streamline Icon: https://streamlinehq.com
References (18)
  1. “Ultra-low bitrate video conferencing using deep image animation,” in IEEE ICASSP, 2020.
  2. “A hybrid animation codec for low bitrate video conferencing,” in IEEE ICIP, 2022.
  3. “One-shot free-view neural talking-head synthesis for video conferencing,” in CVPR, 2021.
  4. “Low bandwidth video-chat compression using deep generative models,” arXiv preprint arXiv:2012.00328, 2020.
  5. “Beyond keypoint coding: Temporal evolution inference with compact feature representation for talking face video compression,” in DCC, 2022.
  6. “Robust ultralow bitrate video conferencing with second order motion coherency,” in 2022 IEEE 24th International Workshop on Multimedia Signal Processing (MMSP), 2022, pp. 1–6.
  7. “First order motion model for image animation,” in Neurips, 2019.
  8. “Real-time adaptive image compression,” arXiv preprint arXiv:1705.05823, 2017.
  9. “Variational image compression with a scale hyperprior,” arXiv preprint arXiv:1802.01436, 2018.
  10. “Generative adversarial networks for extreme learned image compression,” in IEEE ICCV, 2019.
  11. “Fvc: A new frame- work towards deep video compression in feature space,” in IEEE Conference on Computer Vision and Pattern Recognition(CVPR), 2021.
  12. “End-to-end optimized image compression,” Neurips, 2016.
  13. “Dvc: An end-to-end deep video compression framework,” in IEEE CVPR, 2019.
  14. “Vct: A video compression transformer,” in NeurIPS, 2022.
  15. J. Cleary and I. Witten, “Data compression using adaptive coding and partial string matching,” IEEE Transactions on Communications, vol. 32, no. 4, pp. 396–402, 1984.
  16. “The unreasonable effectiveness of deep features as a perceptual metric,” in CVPR, 2018.
  17. “Very deep convolutional networks for large scale image recognition,” in ICLR, 2015.
  18. “Image quality assessment: Unifying structure and texture similarity,” in IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, vol. 44, pp. 2567–2581.
Citations (6)

Summary

We haven't generated a summary for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Lightbulb On Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube