CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

An Auto-Indexing Method for Persian Text

عنوان مقاله: An Auto-Indexing Method for Persian Text
شناسه ملی مقاله: ITCOMI01_025
منتشر شده در همایش جامع بین المللی کامپیوتر، فناوری اطلاعات و مهندسی برق در سال 1396
مشخصات نویسندگان مقاله:

Maryam Moasheri - Department of Computer, Arak Branch, Islamic Azad University, Arak, Iran

خلاصه مقاله:
This paper studies an approach to automatic indexing Persian context based on Persian grammatical rules in order to produce back-of-the-book index. Automatic indexing means automatically extract or select words from a document to create index. In this work, in order to present an approach for automatic indexing, SVM (Support Vector Machine) has been used to produce an intelligent system. The corpus has been applied is Bijankhan corpus which is a manually tagged Persian text collection. To evaluate proposed system, a book entitled Natural Low was considered as test set, while the index section of this book was done manually by human agent and compared with the automatic system. In this study, achieved precision and recall, were 53% and 90%, respectively.

کلمات کلیدی:
Persian context, Automatic indexing, SVM

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/773360/