الفهرس | Only 14 pages are availabe for public view |
Abstract This Thesis presents a cluster-and-label model using PSO to optimize the cluster centroid. In addition, labeled data are used to label cluster and guide clustering process. In some domains, the number of clusters in semi-supervised classification is unknown as in the Automatic Knowledgebase Construction. This thesis proposes an algorithm 2ESPSO3 to detect the number of clusters in the dataset by using PSO to optimize silhouette score. Then, the detected numbers of clusters are used in exploratory semi-supervised classification tasks with an unanticipated cluster |