Semi supervised clustering for Text Clustering

N.Saranya

抽象的

Semi supervised clustering for Text Clustering

N.Saranya

Based on clustering algorithm Affinity Propagation (AP) I present this paper a semisupervised text clustering algorithm, called Seeds Affinity Propagation (SAP). There are two main contributions in my approach: 1) a similarity metric that captures the structural information of texts, and 2) seed construction method to improve the semisupervised clustering process. To study the performance and efficiency of the new algorithm, I applied it to the benchmark data and compared it to two state-of-the-art clustering algorithms, namely, k-means algorithm and the original AP algorithm. Furthermore, I have analyzed the individual impact of the two proposed contributions. Results show that the proposed similarity metric is more effective in text clustering and the proposed semisupervised strategy achieves both better clustering results and faster convergence. The complete SAP algorithm obtains higher F-measure and lower entropy, improves significantly clustering execution time (25 times faster) in respect that k-means, and provides enhanced robustness compared with all other methods.

免责声明: 此摘要通过人工智能工具翻译，尚未经过审核或验证

期刊亮点

CDMA/GSM Communication Protocol 人工智能图案/图像识别先进的计算架构冷静科技基于代理的中间件安全系统宽带与智能网络开源软件数据仓库数据库安全数据结构无线传感器机器人技术生物信息学和计算生物学网格计算自主和上下文感知计算自组织网络自适应雷达技术高级数值算法

索引于

哥白尼索引

学术钥匙

引用因子

宇宙IF

参考搜索

哈姆达大学

世界科学期刊目录

国际创新期刊影响因子（IIJIF）

国际组织研究所 (I2OR)

宇宙

国际期刊

制药科学医学科学工程普通科学

国际计算机与通信工程创新研究杂志

抽象的

Semi supervised clustering for Text Clustering

期刊亮点

索引于

国际期刊

地址