PTokenizer: POS Tagger Tokenizer
سال انتشار: 1395
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 597
فایل این مقاله در 7 صفحه با فرمت PDF قابل دریافت می باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_JKBEI-2-7_006
تاریخ نمایه سازی: 9 خرداد 1396
چکیده مقاله:
By the advent of new information sources and the expansion of text data, natural language processing (NLP) has become one of the key parts of all the systems dealing with human written texts, and part of speech (POS) tagging is an inseparable part of all NLP tasks. As a result, it is of the paramount importance to enhance the accuracy of POS tagging. In this paper, applying language model and statistical information, we introduce a new approach to tokenize sentences and prepare them to be labeled by POS taggers. An evaluation shows that the proposed method yields a precision of 98 percent for tokenizing, and
کلیدواژه ها:
نویسندگان
Saeed Rahmani
Department of Computer and IT Engineering, Shiraz University, Shiraz, Iran
Seyyed Mostafa Fakhrahmad
Department of Computer and IT Engineering, Shiraz University, Shiraz, Iran
Mohammad Hadi Sadredini
Department of Computer and IT Engineering, Shiraz University, Shiraz, Iran