A Set Pair k-means Clustering Algorithm for Incomplete Information System
CSTR:
Author:
Affiliation:

1.College of Science, North China University of Science and Technology, Tangshan,063210,China;2.Qian’an College, North China University of Science and Technology, Tangshan, 063210,China;3.Key Laboratory of Data Science and Application of Hebei Province, Tangshan, 063210,China

Clc Number:

TP391

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    For the data clustering problem of incomplete information system, the set pair analysis theory is introduced into k-means clustering. At the same time, to better represent the relationship between the sample and the cluster, a set pair k-means(SPKM) clustering algorithm for incomplete information system is constructed. Firstly, a set pair distance measurement method is proposed according to set pair theory, and the measurement method is applied to the k-means algorithm to obtain the preliminary clustering results. Then, for samples belonging to multiple clusters at the same time, the samples are assigned into the boundary region of the corresponding clusters. And for samples belonging to only one cluster, it is assigned into the positive region or boundary region of the corresponding clusters. The clustering results are expressed by three parts, which are the positive region belonging to the cluster, the boundary region that may belong to the cluster and the negative region which does not belong to the cluster. Finally, six data sets in the UCI database and four contrast algorithms are selected for experimental evaluation. Experimental results show that the SPKM algorithm has good clustering performance in accuracy, F1 value, Jaccard coefficient, FMI and ARI.

    Reference
    Related
    Cited by
Get Citation

ZHANG Chunying, GAO Ruiyan, LIU Fengchun, WANG Jiahao, CHEN Song, FENG Xiaoze, Ren Jing. A Set Pair k-means Clustering Algorithm for Incomplete Information System[J].,2020,35(4):613-629.

Copy
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:April 30,2020
  • Revised:July 10,2020
  • Adopted:
  • Online: July 25,2020
  • Published:
Article QR Code