Unsupervised Video Person Re-identification Based on Multiple Kernel Dilated Convolution

doi:10.16337/j.1004-9037.2024.05.011

Home > Archive>Volume 39, Issue 5, 2024 >1192-1203. DOI:10.16337/j.1004-9037.2024.05.011

Unsupervised Video Person Re-identification Based on Multiple Kernel Dilated Convolution
DOI:
                        10.16337/j.1004-9037.2024.05.011
                    
CSTR:
                        
Author:
                        
Affiliation:1.School of Electrical Engineering and Information Engineering, Lanzhou University of Technology, Lanzhou 730050, China;2.Key Laboratory of Gansu Advanced Control for Industrial Processes, Lanzhou 730050, China;3.College of Mathematics and Computer Science, Northwest Minzu University, Lanzhou 730030, China
Clc Number:TP391
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Person re-identification aims to identify specific individuals across surveillance cameras， overcoming challenges such as pose variations， occlusions， and background noise that often lead to insufficient feature extraction. This paper proposes a novel unsupervised video-based person re-identification method that utilizes multi-kernel dilated convolution to provide a more comprehensive and accurate representation of individual differences and features. Initially， we employ a pre-trained ResNet50 as an encoder. To further enhance the encoder’s feature extraction capability， we introduce a multiple kernel dilated convolution module. Enlarging the receptive field of convolutional kernels allows the network to more effectively capture both local and global feature information， offering a more comprehensive depiction of a person’s appearance features. Subsequently， a decoder is employed to restore high-level semantic information to a more fundamental feature representation， thereby strengthening feature representation and improving system performance under complex imaging conditions. Finally， a multi-scale feature fusion module is introduced in the decoder output to merge features from adjacent layers， reducing semantic gaps between different feature channel layers and generating more robust feature representations. Offline experiments are conducted on three mainstream datasets， and results show that the proposed method achieves significant improvements in both accuracy and robustness.

Reference

Cited by

Get Citation

LIU Zhongmin, ZHANG Changkai, HU Wenjin. Unsupervised Video Person Re-identification Based on Multiple Kernel Dilated Convolution[J]. Journal of Data Acquisition and Processing,2024,39(5):1192-1203.

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:November 27,2023
Revised:February 26,2024
Adopted:
Online: October 14,2024
Published:

For Authors

Get Citation

Related Videos

Share

Article Metrics

History

Article QR Code