School of Control Science and Engineering, Shandong University, Jinan 250061, China
Clc Number:
TP391.41
Fund Project:
Article
|
Figures
|
Metrics
|
Reference
|
Related
|
Cited by
|
Materials
|
Comments
Abstract:
Multimodal continual learning (MMCL), as a significant research direction in the fields of machine learning and artificial intelligence, aims to achieve continuous knowledge accumulation and task adaptation through the integration of multiple modal data (such as images, text, audio, etc.). Compared with traditional single-modal learning methods, MMCL not only enables parallel processing of multi-source heterogeneous data, but also effectively retains existing knowledge while adapting to new task requirements, demonstrating immense application potential in intelligent systems. This paper provides a systematic review of multimodal continual learning. Firstly, the fundamental theoretical framework of MMCL is elaborated from three dimensions: Basic concepts, evaluation systems, and classical single-modal continual learning methods. Secondly, the advantages and challenges of MMCL in practical applications are thoroughly analyzed: Despite its significant advantages in multimodal information fusion, it still faces critical challenges such as modal imbalance and heterogeneous fusion, which not only constrain the performance of current methods but also indicate future research directions. Based on this, the paper then comprehensively reviews the research status and latest advancements in MMCL methods from four main aspects: Replay-based, regularization-based, parameter isolation-based, and large model-based approaches. Finally, a forward-looking perspective on the future development trends of MMCL is presented.
Reference
Related
Cited by
Get Citation
ZHANG Wei, QIAN Longyue, ZHANG Lin, LI Teng. Research Progress on Multimodal Continual Learning Methods[J].,2025,40(5):1122-1138.