Ontology Creation and Population for Natural Language Processing Domain

سال انتشار: 1397
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 400

فایل این مقاله در 10 صفحه با فرمت PDF قابل دریافت می باشد

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_IJWR-1-2_007

تاریخ نمایه سازی: 21 اردیبهشت 1399

چکیده مقاله:

In this paper, we describe our proposed methodology for constructing an ontology of natural language processing (NLP). We use a semi-automatic method; a combination of rule-based and machine learning techniques; to construct and populate an ontology with bilingual (English-Persian) concept labels (lexicon) and evaluate it manually. This methodology results in a complete ontology in the natural language processing domain with 1333 classes (containing concepts, tools, applications, etc.), 88 object properties, and 2437 annotation assertions for different classes. The built ontology is populated with about 428K NLP related papers and 38K authors, and also about 5M is Related to relations between papers and ontology classes and 1M is Author of relations between papers and authors. The evaluation results show that the ontology achieved a good result. The instantiation is done to enable applications find experts, publications and institutions (such as universities or research laboratories) related to various topics in NLP field.

نویسندگان

Niloofar Naderian

Computer Science and Engineering Faculty, Shahid Beheshti University, Tehra, Iran.

Mehrnoush Shamsfard

Faculty of Computer Science and Engineering, Shahdi Beheshti University of Technology, Tehran, Iran

Razieh Adelkhah

Faculty of Computer Science and Engineering Shahid Beheshti University Tehran, Iran