Fine-Grained Image Classification with Multi-channel Visual Attention
CSTR:
Author:
Affiliation:

National Engineering Laboratory of Speech and Language Information Processing, University of Science and Technology of China, Hefei, 230027, China

Clc Number:

TP391.4

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    Visual attention mechanism has been commonly used in state-of-the-art fine-grained classification methods in recent years. However, most attention-based image classification systems only apply single-layer or part-specified attention feature, with simple multiplication-based attention applying method, which limits the information provided by the attention. This paper presents a multi-channel visual attention based fine-grained image classification system. Multi-channel attention features are extracted from the image and applied to low-level features, with subtraction of mean values corresponding to each layer of attention for high-order representation, making the model an end-to-end optimizable deep neural network architecture. On multiple commonly used fine-grained classification datasets, the presented method outperforms state-of-the-art methods with a large margin.

    Reference
    Related
    Cited by
Get Citation

Wang Peisen, Song Yan, Dai Lirong. Fine-Grained Image Classification with Multi-channel Visual Attention[J].,2019,34(1):157-166.

Copy
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:April 28,2018
  • Revised:August 14,2018
  • Adopted:
  • Online: April 12,2019
  • Published:
Article QR Code