Quality-Aware Dynamic Resolution Adaptation Framework for Adaptive Video Streaming (2403.10976v1)
Abstract: Traditional per-title encoding schemes aim to optimize encoding resolutions to deliver the highest perceptual quality for each representation. XPSNR is observed to correlate better with the subjective quality of VVC-coded bitstreams. Towards this realization, we predict the average XPSNR of VVC-coded bitstreams using spatiotemporal complexity features of the video and the target encoding configuration using an XGBoost-based model. Based on the predicted XPSNR scores, we introduce a Quality-A ware Dynamic Resolution Adaptation (QADRA) framework for adaptive video streaming applications, where we determine the convex-hull online. Furthermore, keeping the encoding and decoding times within an acceptable threshold is mandatory for smooth and energy-efficient streaming. Hence, QADRA determines the encoding resolution and quantization parameter (QP) for each target bitrate by maximizing XPSNR while constraining the maximum encoding and/ or decoding time below a threshold. QADRA implements a JND-based representation elimination algorithm to remove perceptually redundant representations from the bitrate ladder. QADRA is an open-source Python-based framework published under the GNU GPLv3 license. Github: https://github.com/PhoenixVideo/QADRA Online documentation: https://phoenixvideo.github.io/QADRA/
- Thomas Stockhammer, “Dynamic adaptive streaming over HTTP –: standards and design principles,” in Proceedings of the second annual ACM conference on Multimedia systems. ACM, pp. 133–144.
- Abdelhak Bentaleb, Bayan Taani, Ali C. Begen, Christian Timmerer, and Roger Zimmermann, “A Survey on Bitrate Adaptation Schemes for Streaming Media Over HTTP,” in IEEE Communications Surveys & Tutorials, vol. 21, no. 1, pp. 562–585.
- Apple Inc. HTTP Live Streaming (HLS) authoring specification for Apple devices. [Online]. Available: https://developer.apple.com/documentation/http-live-streaming/hls-authoring-specification-for-apple-devices
- Alexandros Stergiou and Ronald Poppe, “AdaPool: Exponential Adaptive Pooling for Information-Retaining Downsampling,” in IEEE Transactions on Image Processing, vol. 32, pp. 251–266.
- Adam Wieckowski, Jens Brandenburg, Tobias Hinz, Christian Bartnik, Valeri George, Gabriel Hege, Christian Helmrich, Anastasia Henkel, Christian Lehmann, Christian Stoffers, Ivan Zupancic, Benjamin Bross, and Detlev Marpe, “Vvenc: An Open And Optimized Vvc Encoder Implementation,” in 2021 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), pp. 1–2.
- Christian R. Helmrich, Mischa Siekmann, Sören Becker, Sebastian Bosse, Detlev Marpe, and Thomas Wiegand, “Xpsnr: A Low-Complexity Extension of The Perceptually Weighted Peak Signal-To-Noise Ratio For High-Resolution Video Quality Assessment,” in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2727–2731.
- Jan De Cock, Zhi Li, Megha Manohara, and Anne Aaron, “Complexity-based consistent-quality encoding in the cloud,” in 2016 IEEE International Conference on Image Processing (ICIP), pp. 1484–1488.
- Angeliki V. Katsenou, Joel Sole, and David R. Bull, “Content-gnostic Bitrate Ladder Prediction for Adaptive Video Streaming,” in 2019 Picture Coding Symposium (PCS), pp. 1–5.
- Madhukar Bhat, Jean-Marc Thiesse, and Patrick Le Callet, “Combining Video Quality Metrics To Select Perceptually Accurate Resolution In A Wide Quality Range: A Case Study,” in 2021 IEEE International Conference on Image Processing (ICIP), 2021, pp. 2164–2168.
- Vignesh V Menon, Prajit T Rajendran, Christian Feldmann, Klaus Schoeffmann, Mohammad Ghanbari, and Christian Timmerer, “JND-Aware Two-Pass Per-Title Encoding Scheme for Adaptive Live Streaming,” in IEEE Transactions on Circuits and Systems for Video Technology, vol. 34, no. 2, pp. 1281–1294.
- Vignesh V Menon, Samira Afzal, Prajit T Rajendran, Klaus Schoeffmann, Radu Prodan, and Christian Timmerer, “Content-adaptive variable framerate encoding scheme for green live streaming.” [Online]. Available: http://arxiv.org/abs/2311.08074
- Vignesh V Menon, Jingwen Zhu, Prajit T Rajendran, Samira Afzal, Klaus Schoeffmann, Patrick Le Callet, and Christian Timmerer, “Optimal quality and efficiency in adaptive live streaming with JND-aware low latency encoding,” in Proceedings of the 3rd Mile-High Video Conference, ser. MHV ’24. Association for Computing Machinery, pp. 61–67.
- Fatemeh Nasiri, Wassim Hamidouche, Luce Morin, Nicolas Dholland, and Jean-Yves Aubié, “Ensemble Learning for Efficient VVC Bitrate Ladder Prediction,” in 2022 10th European Workshop on Visual Information Processing (EUVIP), pp. 1–6.
- Ahmed Telili, Wassim Hamidouche, Sid Ahmed Fezza, and Luce Morin, “Benchmarking Learning-based Bitrate Ladder Prediction Methods for Adaptive Video Streaming,” in 2022 Picture Coding Symposium (PCS), pp. 325–329.
- Benjamin Bross, Ye-Kui Wang, Yan Ye, Shan Liu, Jianle Chen, Gary J. Sullivan, and Jens-Rainer Ohm, “Overview of the Versatile Video Coding (VVC) Standard and its Applications,” in IEEE Transactions on Circuits and Systems for Video Technology, vol. 31, no. 10, pp. 3736–3764.
- Reda Kaafarani, Mederic Blestel, Thomas Maugey, Michael Ropert, and Aline Roumy, “Evaluation Of Bitrate Ladders For Versatile Video Coder,” in 2021 International Conference on Visual Communications and Image Processing (VCIP), pp. 1–5.
- Vignesh V Menon, Amritha Premkumar, Prajit T Rajendran, Adam Wieckowski, Benjamin Bross, Christian Timmerer, and Detlev Marpe, “Energy-efficient Adaptive Video Streaming with Latency-Aware Dynamic Resolution Encoding,” in Proceedings of the 3rd Mile-High Video Conference, ser. MHV ’24. New York, NY, USA: Association for Computing Machinery, 2024, p. 21–27.
- Christian Herglotz, Elisabeth Walencik, and André Kaup, “Estimating the HEVC decoding energy using the decoder processing time,” in 2015 IEEE International Symposium on Circuits and Systems (ISCAS), pp. 513–516.
- Ibrahim Farhat, Pierre-Loup Cabarat, Daniel Menard, Wassim Hamidouche, and Olivier Déforges, “Energy Efficient VVC Decoding on Mobile Platform,” in 2023 IEEE 25th International Workshop on Multimedia Signal Processing (MMSP), pp. 1–6.
- Zoha Azimi, Amritha Premkumar, Reza Farahani, Vignesh V Menon, Christian Timmerer, and Radu Prodan, “Decoding Complexity-Aware Bitrate-Ladder Estimation for Adaptive VVC Streaming,” 2024.
- Anatoliy Zabrovskiy, Prateek Agrawal, Christian Timmerer, and Radu Prodan, “FAUST: Fast Per-Scene Encoding Using Entropy-Based Scene Detection and Machine Learning,” in 2021 30th Conference of Open Innovations Association FRUCT, pp. 292–302.
- Vignesh V Menon, Christian Feldmann, Klaus Schoeffmann, Mohammad Ghanbari, and Christian Timmerer, “Green video complexity analysis for efficient encoding in Adaptive Video Streaming,” in Proceedings of the First International Workshop on Green Multimedia Systems. ACM, pp. 16–18.
- Qi Cai, Zhifeng Chen, Dapeng Oliver Wu, and Bo Huang, “Real-Time Constant Objective Quality Video Coding Strategy in High Efficiency Video Coding,” in IEEE Transactions on Circuits and Systems for Video Technology, pp. 1–1.
- Vignesh V Menon, Reza Farahani, Prajit T Rajendran, Mohammed Ghanbari, Hermann Hellwagner, and Christian Timmerer, “Transcoding Quality Prediction for Adaptive Video Streaming,” in Proceedings of the 2nd Mile-High Video Conference. ACM, pp. 103–109.
- Vignesh V Menon, Anastasia Henkel, Prajit T Rajendran, Christian R Helmrich, Adam Wieckowski, Benjamin Bross, Christian Timmerer, and Detlev Marpe, “All-Intra Rate Control Using Low Complexity Video Features for Versatile Video Coding,” in 2023 IEEE International Conference on Image Processing (ICIP), 2023, pp. 2760–2764.
- Jingwen Zhu, Suiyi Ling, Yoann Baveye, and Patrick Le Callet, “A framework to map vmaf with the probability of just noticeable difference between video encoding recipes,” in 2022 IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP). IEEE, 2022, pp. 1–5.
- Vignesh V Menon, Reza Farahani, Prajit T Rajendran, Samira Afzal, Klaus Schoeffmann, and Christian Timmerer, “Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Streaming,” in 2023 IEEE International Conference on Visual Communications and Image Processing (VCIP), 2023, pp. 1–5.
- Tianqi Chen and Carlos Guestrin, “XGBoost: A Scalable Tree Boosting System,” in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, pp. 785–794.
- Leo Breiman, “Random forests,” in Machine Learning, vol. 45, no. 1, pp. 5–32.
- Vignesh V Menon, Hadi Amirpour, Mohammad Ghanbari, and Christian Timmerer, “EMES: Efficient Multi-Encoding Schemes for HEVC-Based Adaptive Bitrate Streaming,” in ACM Trans. Multimedia Comput. Commun. Appl., vol. 19, no. 3s, Mar. 2023.
- Amritha Premkumar (5 papers)
- Vignesh V Menon (21 papers)
- Adam Wieckowski (10 papers)
- Benjamin Bross (11 papers)
- Detlev Marpe (14 papers)
- Prajit T Rajendran (15 papers)