ParsiAzma Challenges on Persian Text Analysis in Social Media

سال انتشار: 1403
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 35

فایل این مقاله در 8 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_ITRC-16-3_001

تاریخ نمایه سازی: 4 آبان 1403

چکیده مقاله:

The ParsiAzma[۱] challenges in ۲۰۲۳ focused on Improving Persian text analysis in social media. We designed four shared tasks: stance detection, sentiment analysis, emotion detection, and claim detection in social media posts. The goal of these challenges was to bring together various teams to develop the best models for these challenges and to establish a standard test platform for future Persian language research. A total of ۲۸ teams participated, competing to solve the specified tasks. The most effective models in all shared tasks utilized the BERT model. Text embedding was first obtained using a BERT[۲]-based model, followed by final predictions with either an MLP[۳] or CNN[۴]. Additionally, several meta-classifiers were developed as fusion models to leverage the strengths of individual models. The best results based on accuracy criteria for the four challenges—stance detection, sentiment analysis, emotion recognition, and claim detection—were ۰.۶۷, ۰.۶۷, ۰.۴۵, and ۰.۵۶, respectively. These results indicate that emotion detection has lower accuracy than the other three tasks, highlighting its complexity.   [۱] https://parsiazma.ir/ [۲] Bidirectional Encoder Representations from Transformers [۳] Multi-Layer Perceptron [۴] Convolutional Neural Network

نویسندگان

Mojgan Farhoodi

ICT Research Institute Tehran, Iran

Maryam Mahmoudi

ICT Research Institute Tehran, Iran

Mohammad Hadi Bokaei

ICT Research Institute Tehran, Iran