Towards Facts Extraction from Texts in the
Polish Language

Tomasz Boiński; Adam Brzeski

抽象的

Towards Facts Extraction from Texts in the Polish Language

Tomasz Boiński, Adam Brzeski

The Polish language differs from English in many ways. It has more complicated conjugation and declination. Because of that automatic facts extraction from texts is difficult. In this paper we present basic differences between those languages. The paper presents an algorithm for extraction of facts from articles from Polish Wikipedia. The algorithm is based on 7 proposed facts schemes that are searched for in the analyzed text. The analysis includes morphosyntactic tagging, named entity extraction and relation identification. The results acquired for an exemplary Wikipedia text is presented. We indicate the free word formation principle as the main difficulty in the Polish texts analysis. At the same time satisfactory performance of the tagging and analysis tools for the Polish language was confirmed in the conducted experiment.

免责声明: 此摘要通过人工智能工具翻译，尚未经过审核或验证

期刊亮点

CDMA/GSM Communication Protocol 人工智能图案/图像识别先进的计算架构冷静科技基于代理的中间件安全系统宽带与智能网络开源软件数据仓库数据库安全数据结构无线传感器机器人技术生物信息学和计算生物学网格计算自主和上下文感知计算自组织网络自适应雷达技术高级数值算法

索引于

哥白尼索引

学术钥匙

引用因子

宇宙IF

参考搜索

哈姆达大学

世界科学期刊目录

国际创新期刊影响因子（IIJIF）

国际组织研究所 (I2OR)

宇宙

国际期刊

制药科学医学科学工程普通科学

国际计算机与通信工程创新研究杂志

抽象的

Towards Facts Extraction from Texts in the Polish Language

期刊亮点

索引于

国际期刊

地址