An Auto-Indexing Method for Persian Text
عنوان مقاله: An Auto-Indexing Method for Persian Text
شناسه ملی مقاله: ITCOMI01_025
منتشر شده در همایش جامع بین المللی کامپیوتر، فناوری اطلاعات و مهندسی برق در سال 1396
شناسه ملی مقاله: ITCOMI01_025
منتشر شده در همایش جامع بین المللی کامپیوتر، فناوری اطلاعات و مهندسی برق در سال 1396
مشخصات نویسندگان مقاله:
Maryam Moasheri - Department of Computer, Arak Branch, Islamic Azad University, Arak, Iran
خلاصه مقاله:
Maryam Moasheri - Department of Computer, Arak Branch, Islamic Azad University, Arak, Iran
This paper studies an approach to automatic indexing Persian context based on Persian grammatical rules in order to produce back-of-the-book index. Automatic indexing means automatically extract or select words from a document to create index. In this work, in order to present an approach for automatic indexing, SVM (Support Vector Machine) has been used to produce an intelligent system. The corpus has been applied is Bijankhan corpus which is a manually tagged Persian text collection. To evaluate proposed system, a book entitled Natural Low was considered as test set, while the index section of this book was done manually by human agent and compared with the automatic system. In this study, achieved precision and recall, were 53% and 90%, respectively.
کلمات کلیدی: Persian context, Automatic indexing, SVM
صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/773360/