Stance Detection Dataset for Persian Tweets

Stance detection aims to identify an author's stance towards a specific topic which has become a critical component in applications such as fake news detection, claim validation, author profiling, etc. However, while the stance is easily detected by humans, machine learning models are falling short of this task. In the English language, due to having large and appropriate e datasets, relatively good accuracy has been achieved in this field, but in the Persian language, due to the lack of data, we have not made significant progress in stance detection. So, in this paper, we present a stance detection dataset that contains ۳۸۱۳ labeled tweets. We provide a detailed description of the newly created dataset and develop deep learning models on it. Our best model achieves a macro-average F۱-score of ۵۸%. Moreover, our dataset can facilitate research in some fields in Persian such as cross-lingual stance detection, author profiling, etc.

کلیدواژه ها:

stance detection ، fake news ، social media ، twitter ، Persian dataset ، author profiling

نویسندگان

Mohammad Hadi Bokaei

ICT Research Institute (ITRC) Tehran, Iran

Mojgan Farhoodi

ICT Research Institute (ITRC) Tehran, Iran

Mona Davoudi

ICT Research Institute (ITRC) Tehran, Iran

صدور گواهی نمایه سازی
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

https://civilica.com/doc/1595026

شناسه ملی سند علمی:

JR_ITRC-14-4_006

تاریخ نمایه سازی: 8 بهمن 1401

نحوه استناد به مقاله:

در صورتی که می خواهید در اثر پژوهشی خود به این مقاله ارجاع دهید، به سادگی می توانید از عبارت زیر در بخش منابع و مراجع استفاده نمایید:

Bokaei, Mohammad Hadi and Farhoodi, Mojgan and Davoudi, Mona,1401,Stance Detection Dataset for Persian Tweets,https://civilica.com/doc/1595026

در داخل متن نیز هر جا که به عبارت و یا دستاوردی از این مقاله اشاره شود پس از ذکر مطلب، در داخل پارانتز، مشخصات زیر نوشته می شود.
برای بار اول: (1401, Bokaei, Mohammad Hadi؛ Mojgan Farhoodi and Mona Davoudi)
برای بار دوم به بعد: (1401, Bokaei؛ Farhoodi and Davoudi)
برای آشنایی کامل با نحوه مرجع نویسی لطفا بخش راهنمای سیویلیکا (مرجع دهی) را ملاحظه نمایید.

علم سنجی و رتبه بندی مقاله

مشخصات مرکز تولید کننده این مقاله به صورت زیر است:

رتبه علمی پژوهشگاه ارتباطات و فناوری اطلاعات

نوع مرکز: پژوهشگاه دولتی

تعداد مقالات: 914

در بخش علم سنجی پایگاه سیویلیکا می توانید رتبه بندی علمی مراکز دانشگاهی و پژوهشی کشور را بر اساس آمار مقالات نمایه شده مشاهده نمایید.