Construction of an Annotated Corpus for KurdishAbstractive Text Summarization

سال انتشار: 1401
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 171

فایل این مقاله در 5 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

CEITCONF06_009

تاریخ نمایه سازی: 26 خرداد 1402

چکیده مقاله:

Automatic text summarization has recently beenan essential task in natural language processing (NLP).However, the development of summarizing systems needsdatasets for proper evaluation. This requirement is necessaryfor less-resourced languages too. In this research, the first andfree annotated corpus is produced and presented to evaluateabstract Kurdish text summarizing systems. News articles onthis dataset have been utilized to collect the information. Also,an abstract Kurdish text summarization model based on thetransformers has been developed for the first time to beevaluated by this dataset too. The current work can be abaseline for future research.

کلیدواژه ها:

نویسندگان

Fatemeh Daneshfar

Assistant ProfessorDepartment of Computer Engineeringand Information Technology,University of Kurdistan, Sanandaj, Iran

Pedram Yamini

Bachelor StudentDepartment of Computer Engineering andInformation Technology,University of Kurdistan, Sanandaj, Iran

Abouzar Ghorbani

PhD StudentFaculty of Computer Engineering,University of Isfahan,Isfahan, Iran