Capture Methods of Gambling Related Illegal Websites in Massive Websites
CSTR:
Author:
Affiliation:

1.Department of Computer Information and Cyber Security, Jiangsu Police Institute, Nanjing 210031, China;2.Jiangsu Electronic Data Forensics and Analysis Engineering Research Center, Jiangsu Police Institute, Nanjing 210031, China;3.Key Laboratory of Digital Forensics of Jiangsu Provincial Public Security Department, Jiangsu Police Institute, Nanjing 210031, China;4.Cyber Security Guard Corps, Jiangsu Provincial Public Security Department, Nanjing 210024, China;5.Big Data Center, Nanjing Municipal Public Security Bureau, Nanjing 210005, China

Clc Number:

TP3

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    Aiming at the problem of detecting illegal gambling websites in massive websites, this paper proposes a classification method based on BERT-BiLSTM and multi-classifier decision-level fusion. This method improves the classification performance by adopting the following steps. Firstly, it extracts the textual information considered with high priority, i.e., meta information in HTML head and hyperlink titles on a web page, to enhance the richness of textual features. Secondly, a novel text classification model based on BERT-BiLSTM is designed, and it is proved superior in learning better sentence feature representatives and boosting performance. At last, the decision-level fusion is performed on the classification results from multiple dimensions (i.e., website title, keywords, and page text) to further improve the performance and robustness of the entire system. Moreover, a variety of strategies generating suspicious domain names are used to improve the ability to actively detect illegal websites. Experimental results and running results in real cyberspace demonstrate the effectiveness of the proposed method.

    Reference
    Related
    Cited by
Get Citation

LIU Jiayin, YIN Jie, NIU Bowei, ZHUGE Chengchen, HE Haichen. Capture Methods of Gambling Related Illegal Websites in Massive Websites[J].,2021,36(5):1050-1061.

Copy
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:October 09,2020
  • Revised:January 20,2021
  • Adopted:
  • Online: September 25,2021
  • Published:
Article QR Code