Simple and Efficient Way to Cluster
Documents for Growing Database

Dikhtiarenko Oleks; r; Biloshchytskyi Andrii

抽象的

Simple and Efficient Way to Cluster Documents for Growing Database

Dikhtiarenko Oleksandr, Biloshchytskyi Andrii

In this article we described a new method of clustering text documents. A frequency table of words from the documents was used as a characteristic of each document. These tables were created using term frequency which were cleaned from words that do not characterize a specific document and are common to the entire set of documents or for most of it. For the identification of such words, we calculated the percentage of documents in which this word occurs (inverse document frequency). The objectives of this publication were to determine the possibility of using frequency dictionary documents as their semantic characteristics and determine clustering method using frequency tables.

免责声明: 此摘要通过人工智能工具翻译，尚未经过审核或验证

期刊亮点

CDMA/GSM Communication Protocol 人工智能图案/图像识别先进的计算架构冷静科技基于代理的中间件安全系统宽带与智能网络开源软件数据仓库数据库安全数据结构无线传感器机器人技术生物信息学和计算生物学网格计算自主和上下文感知计算自组织网络自适应雷达技术高级数值算法

索引于

哥白尼索引

学术钥匙

引用因子

宇宙IF

参考搜索

哈姆达大学

世界科学期刊目录

国际创新期刊影响因子（IIJIF）

国际组织研究所 (I2OR)

宇宙

国际期刊

制药科学医学科学工程普通科学

国际计算机与通信工程创新研究杂志

抽象的

Simple and Efficient Way to Cluster Documents for Growing Database

期刊亮点

索引于

国际期刊

地址