Recognition of Vietnamese Text in Natural Scene Based on Modified DAN
CSTR:
Author:
Affiliation:

1.Guangxi Key Laboratory of Image and Graphic Intelligent Processing(Guilin University of Electronic Technology), Guilin 541004, China;2.Guangxi Key Laboratory of Culture and Tourism Smart Technology(Guilin Tourism University), Guilin 541006 China

Clc Number:

TP391

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    Vietnamese characters which are composed of Latin characters and diacritic symbols make recognition more challenging. On the one hand, diacritic symbols are more likely to lead to attention drift. On the other hand, Vietnamese characters include many categories, and the differences between characters are small, for example some characters only differ from diacritical symbols, which further increases difficulty of recognition. Based on the decoupled attention network (DAN) algorithm, this paper designs a visual feature and sequence feature fusion module (VSFM), which utilizes bidirectional gated recurrent unit (Bi-GRU) to model sequences in the horizontal and vertical directions, further alleviating attention drift and enhancing correlation between diacritics and Latin characters. And an enhanced decoupled text decoder module (ETDM) is designed, which employs more feature information to identify similar characters more effectively. A series of experiments validate the effectiveness of the proposed method.

    Reference
    Related
    Cited by
Get Citation

Wang Libing, Feng Yate, Wen Yimin. Recognition of Vietnamese Text in Natural Scene Based on Modified DAN[J].,2023,38(5):1058-1068.

Copy
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:March 13,2022
  • Revised:April 19,2023
  • Adopted:
  • Online: September 25,2023
  • Published:
Article QR Code