以人为中心的可信视觉智能
作者:
作者单位:

1西安电子科技大学电子工程学院,西安 710071;2重庆邮电大学计算机科学与技术学院,重庆 400065

作者简介:

通讯作者:

基金项目:

国家科技重大专项(2025ZD0123601);国家自然科学基金(62472060);重庆市自然科学基金(CSTB2024NSCQ-QCXMX0060, CSTB2023NSCQ-LZX0061,CSTB2023TIAD-STX0016)。


Human-Centered Trustworthy Visual Intelligence
Author:
Affiliation:

1School of Electronic Engineering, Xidian University, Xi’an 710071, China;2School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China

Fund Project:

National Natural Science Foundation of China (No.62472060); Chongqing Natural Science Foundation (Nos. CSTB2024NSCQ-QCXMX0060, CSTB2023NSCQ-LZX0061); Chongqing Key Research and Development Program of Science and Technology Innovation (No.CSTB2023TIAD-STX0016).

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    本文围绕以人为中心的可信视觉智能,系统总结其应用现状、关键技术与发展趋势。随着计算机视觉从感知走向高自主决策与物理执行,视觉智能系统在隐私、公平、鲁棒、透明与安全等方面的风险日益突出,当系统输出可能影响人的安全与权益时,单纯追求性能已难以满足可信需求。为此,本文从计算机视觉视角梳理可信视觉智能的内涵与演进,强调人作为数据主体、认知参与者与最终控制者的多重角色,并提出以信息空间、认知空间与物理空间为主线的统一框架,构建“关注于人—服务于人—受控于人”的递进体系。围绕数据分析、模型设计与系统应用3个层面,本文总结公平与隐私约束下以人为对象的视觉数据分析方法,稳健且负责任的模型设计策略,以及以透明与安全为核心的人机协同控制机制,并结合图像增强、视频分析、机器人操作与三维视觉感知等场景进行分析。最后讨论了鲁棒评估、跨场景泛化、协同治理与可持续部署等挑战与研究方向,为真实世界可信视觉智能系统提供了路线图。

    Abstract:

    This survey reviews human-centered trustworthy visual intelligence by summarizing its application landscape, key techniques, and emerging trends. As computer vision advances from perception to highly autonomous decision making and physical execution, risks related to privacy, fairness, robustness, transparency, and safety become increasingly salient. When system outputs may affect human safety and rights, performance optimization alone can no longer satisfy the requirements for trustworthiness. From a computer vision perspective, the paper traces the concept and evolution of trustworthy visual intelligence, emphasizing the multiple roles of humans as data subjects, cognitive participants, and ultimate controllers. A unified framework is then presented along three complementary spaces, information, cognitive, and physical, and a progressive paradigm is formulated that focuses on humans, serves humans, and remains under human control. The survey synthesizes human-oriented visual data analysis methods under fairness and privacy constraints, robust and responsible model design strategies, and human-machine collaborative control mechanisms centered on transparency and safety, with discussions across representative scenarios such as image enhancement, video analysis, robotic manipulation, and 3D visual perception. Finally, open challenges and future directions are outlined, including robustness evaluation, cross-scenario generalization, collaborative governance, and sustainable deployment, providing a roadmap for trustworthy visual intelligence in real-world systems.

    参考文献
    相似文献
    引证文献
引用本文

高新波,莫梦竟成,张灿,袁钰,张明珠,任路阳,李爽,冷佳旭.以人为中心的可信视觉智能[J].数据采集与处理,2026,(2):303-331

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:2026-01-28
  • 最后修改日期:2026-02-28
  • 录用日期:
  • 在线发布日期: 2026-04-15