三维人脸生成技术综述
作者:
作者单位:

1北京交通大学信息科学研究所,北京100044;2视觉智能交叉创新教育部国际合作联合实验室,北京100044

作者简介:

通讯作者:

基金项目:

国家自然科学基金(62372033);北京市自然科学基金“海淀联合-重点”项目(L252025)。


A Survey on 3D Face Generation Technology
Author:
Affiliation:

1Institute of Information Science, Beijing Jiaotong University, Beijing 100044, China;2Visual Intelligence + X International Cooperation Joint Laboratory of the Ministry of Education, Beijing 100044, China

Fund Project:

National Natural Science Foundation of China (No.62372033); Natural Science Foundation of Beijing (No.L252025).

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    近年来,计算机视觉与图形学的快速发展推动了三维人脸生成技术的突破,尤其在以数字化身构建领域,三维视觉技术在互联网快速普及,受到了学术界和工业界的广泛关注。该技术通过从显式或隐式的底层表征中重建几何结构与纹理细节来合成逼真的多视角人脸图像,并在娱乐与交互应用中取得显著成果,如通过文本描述修改面部特征的属性编辑,或生成说话视频的说话人脸技术。但早期基于线性参数化模型的技术存在生成的真实感和细节表现不佳的问题,随后兴起的隐式神经表示技术虽然大幅提升了视觉质量,却面临计算成本高昂、难以实时交互的难题,这给实际部署与应用均带来了极大限制。为了克服速度与质量之间的矛盾,众多学者对基于显式高斯基元的新型表征以及基于概率扩散的生成模型进行了深入研究,并从不同视角提出了一系列混合生成方法。此外,生成技术仍面临小样本泛化困难、头部物理建模不完整与动态一致性不足等挑战,使其在实现完全写实与实时交互的道路上仍有很长一段距离。目前,三维人脸生成与驱动技术的研究仍处在发展期。本综述对迄今为止的主要研究工作进行了科学系统的总结与归纳,并对现有技术的局限性做简要分析。最后,探讨了三维人脸生成与应用技术的潜在挑战与发展方向,旨在为领域内未来的研究工作提供借鉴。

    Abstract:

    In recent years, benefiting from the rapid development of computer vision and graphics, 3D face generation technology has achieved significant breakthroughs; 3D vision technologies, such as digital avatar creation, have become increasingly popular on the internet, attracting extensive attention from both academia and industry. This generation technology synthesizes realistic multi-view face images by reconstructing geometric structures and texture details from explicit or implicit underlying representations. 3D face generation technology has sparked many related entertainment and interactive applications, such as using attribute editing technology to modify facial features via text descriptions, or using talking head generation technology to drive a static portrait to generate a talking video. However, early technologies based on linear parametric models suffered from poor realism and detail performance. And the subsequently emerging implicit neural representation technologies, while significantly improving visual quality, face the challenges of high computational costs and difficulty in achieving real-time interaction, which have brought great limitations to practical deployment and application. In order to overcome the contradiction between speed and quality, numerous scholars have conducted in-depth research on novel representations based on explicit Gaussian primitives and generative models based on probabilistic diffusion, and have proposed a series of hybrid generation methods from different perspectives. However, due to problems such as difficulty in generalizing from small sample data, incomplete modeling of full-head physical structures, and insufficient consistency in dynamic driving, there is still a long way to go for generation technology on the path to becoming fully photorealistic and capable of real-time interaction. In fact, research on 3D face generation and driving technology is still in a developmental stage, and the connotations and extensions of its technology are rapidly updating and iterating. This review provides a systematic summary of the main research works to date, along with a brief analysis of the limitations of current technologies. It also explores potential challenges and future directions for 3D face generation and application technologies, offering a guidance for future research.

    参考文献
    相似文献
    引证文献
引用本文

王伟,何一康,魏云超,赵耀.三维人脸生成技术综述[J].数据采集与处理,2026,(2):543-565

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:2026-01-09
  • 最后修改日期:2026-02-26
  • 录用日期:
  • 在线发布日期: 2026-04-15