Claim Detection in Persian Twitter Posts

سال انتشار: 1403
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 106

فایل این مقاله در 10 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_ITRC-16-3_004

تاریخ نمایه سازی: 4 آبان 1403

چکیده مقاله:

The proliferation of false information on social media has profound negative impacts across various aspects of people's lives. To mitigate these effects, numerous studies have focused on developing automated factchecking systems aimed at enhancing the accuracy and reliability of news and information. Claim detection, recognized as the initial stage in constructing such systems, has been explored in several languages. In our paper, we introduce a corpus of Persian tweets annotated with ۱۱ labels derived from linguistic analysis, representing different types of claims. Additionally, we establish a baseline claim detection model to assess the dataset. This study frames claim detection as a classification task and employs a transformer-based approach to train a multi-label classifier capable of identifying various types of claims in Persian texts.

نویسندگان

Mohammad Hadi Bokaei

ICT Research Institute Tehran, Iran

Minoo Nassajian

Alumni of Computational Linguistics, Sharif University of Technology

Mona Davoudi Shamsi

ICT Research Institute Tehran, Iran