SGF (Semantic Graphs Fusion ): A Knowledge based Representation of Textual Resources for Text Mining Applications

سال انتشار: 1398
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 205

فایل این مقاله در 15 صفحه با فرمت PDF قابل دریافت می باشد

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_JIST-7-2_001

تاریخ نمایه سازی: 23 دی 1399

چکیده مقاله:

The proper representation of textual documents has been the greatest challenge in text mining applications. In this paper, a knowledge-based representation model for text analysis applications is introduced. The proposed functionalities of the system are achieved by integrating structured knowledge in the core components of the system. The semantic, lexical, syntactical and structural features are identified by the pre-processing module. The enrichment module is introduced to identify contextually similar concepts and concept maps for improving the representation. The information content of documents and the enriched contents are then fused (merged) into the graphical structure of a semantic network to form a unified and comprehensive representation of documents. The ۲۰Newsgroup and Reuters-۲۱۵۷۸ datasets are used for evaluation. The evaluation results suggest that the proposed method exhibits a high level of accuracy, recall and precision. The results also indicate that even when a small portion of the information content is available, the proposed method performs well in standard text mining applications.

نویسندگان

Morteza Jaderyan

Department of Computer Engineering, Bu Ali Sina University, Hamedan, Iran

Hassan Khotanlou

Department of Computer Engineering, Bu Ali Sina University, Hamedan, Iran