深度学习驱动的视频编码:方法、进展与展望
作者:
作者单位:

1四川大学电子信息学院,成都 610065;2四川大学网络空间安全学院,成都 610065

作者简介:

通讯作者:

基金项目:

国家自然科学基金(62271336,62211530110);四川省国际科技创新合作/港澳台科技创新合作项目(2024YFHZ0289);TCL科技创新基金(0020506107005);成都市重点研发支撑计划(2024-YF06-00079-HZ)。


Deep Learning-Driven Video Coding: Methods, Progress, and Perspectives
Author:
Affiliation:

1College of Electronics and Information Engineering, Sichuan University, Chengdu 610065, China;2School of Cyber Science and Engineering, Sichuan University, Chengdu 610065, China

Fund Project:

National Natural Science Foundation of China (Nos.62271336, 62211530110); The Key Research and Development Program of Sichuan Province (No.2024YFHZ0289); TCL Science and Technology Innovation Fund (No.0020506107005); The Key Research and Development Support Program of Chengdu (No.2024-YF06-00079-HZ).

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    随着视频数据量的爆炸式增长,有限的网络带宽和高计算资源需求对视频传输与存储提出了严峻挑战。在此背景下,持续开发高效的视频编码方法以保障在资源受限条件下提供高质量视频服务具有至关重要的理论意义与应用价值。然而,传统混合视频编码框架已逐渐遭遇瓶颈,编码性能的进一步提升越来越困难。近年来,深度学习凭借其强大的非线性拟合与表征能力,为视频编码领域的优化带来了契机。本文对基于深度学习驱动的视频编码技术进行了系统而详细的分析。首先,简要介绍传统编码框架下的视频编码技术,并进一步探讨结合深度学习在帧内/帧间预测等关键模块中的优化;然后,重点讨论了基于深度学习的端到端视频编码框架的发展历程及关键技术路线,并对其性能进行对比分析;最后,进一步介绍深度学习在视频编码领域的重要研究成果,剖析现有技术所面临的挑战和局限性,并对未来视频编码技术的发展趋势进行了展望。

    Abstract:

    With the explosive growth of video data, limited network bandwidth and high computational demands pose significant challenges for video transmission and storage. In this context, the continuous development of efficient video coding methods is of critical theoretical significance and practical value, as it ensures the delivery of high-quality video services under resource-constrained conditions. However, traditional hybrid video coding frameworks have gradually reached performance bottlenecks, making further improvements in coding efficiency increasingly difficult. In recent years, deep learning, with its powerful nonlinear fitting and representation capabilities, has provided new opportunities for optimizing video coding. This paper presents a systematic and detailed analysis of deep learning-driven video coding technologies. First, we briefly introduce video coding techniques under conventional coding frameworks and further explore the optimization of key modules, such as intra- and inter-frame prediction, through deep learning. Then, we focus on the development and key technical routes of end-to-end video coding frameworks based on deep learning, providing a comparative analysis of their performance. Finally, we highlight significant research achievements of deep learning in the field of video coding, examine the challenges and limitations of existing techniques, and offer an outlook on future trends in video coding technologies.

    参考文献
    相似文献
    引证文献
引用本文

何小海,李鑫磊,魏海涛,毕晓东,聂尧佳,熊志娜,张皓彦,熊淑华.深度学习驱动的视频编码:方法、进展与展望[J].数据采集与处理,2026,(2):515-542

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:2026-01-09
  • 最后修改日期:2026-02-26
  • 录用日期:
  • 在线发布日期: 2026-04-15