School of Computer Science, Hangzhou Dianzi University, Hangzhou 310018, China
TP391.4
GONG Yuxuan, HAN Tingting. Emotional Video Captioning Based on Fine-Grained Visual and Audio-Visual Dual-Branch Fusion[J]. Journal of Data Acquisition and Processing,2025,40(5):1165-1176.
Copy
