CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

SGF (Semantic Graphs Fusion ): A Knowledge based Representation of Textual Resources for Text Mining Applications

عنوان مقاله: SGF (Semantic Graphs Fusion ): A Knowledge based Representation of Textual Resources for Text Mining Applications
شناسه ملی مقاله: JR_JIST-7-2_001
منتشر شده در شماره 2 دوره 7 فصل در سال 1398
مشخصات نویسندگان مقاله:

Morteza Jaderyan - Department of Computer Engineering, Bu Ali Sina University, Hamedan, Iran
Hassan Khotanlou - Department of Computer Engineering, Bu Ali Sina University, Hamedan, Iran

خلاصه مقاله:
The proper representation of textual documents has been the greatest challenge in text mining applications. In this paper, a knowledge-based representation model for text analysis applications is introduced. The proposed functionalities of the system are achieved by integrating structured knowledge in the core components of the system. The semantic, lexical, syntactical and structural features are identified by the pre-processing module. The enrichment module is introduced to identify contextually similar concepts and concept maps for improving the representation. The information content of documents and the enriched contents are then fused (merged) into the graphical structure of a semantic network to form a unified and comprehensive representation of documents. The ۲۰Newsgroup and Reuters-۲۱۵۷۸ datasets are used for evaluation. The evaluation results suggest that the proposed method exhibits a high level of accuracy, recall and precision. The results also indicate that even when a small portion of the information content is available, the proposed method performs well in standard text mining applications.

کلمات کلیدی:
Semantic document representation; Ontology; Knowledge base (KB); Semantic network; Information fusion

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/1142409/