Political Sentiment Analysis of Persian Tweets Using CNN-LSTM Model

  • سال انتشار: 1402
  • محل انتشار: فصلنامه بین المللی وب پژوهی، دوره: 6، شماره: 1
  • کد COI اختصاصی: JR_IJWR-6-1_004
  • زبان مقاله: انگلیسی
  • تعداد مشاهده: 53
دانلود فایل این مقاله

نویسندگان

Mohammad Dehghani

Electrical and Computer Engineering Department, University of Tehran, Tehran, Iran;

Zahra Yazdanparast

Electrical and Computer Engineering Department, Tarbiat Modares University, Tehran, Iran;

چکیده

Sentiment analysis is the process of identifying and categorizing people’s emotions or opinions regarding various topics. The analysis of Twitter sentiment has become an increasingly popular topic in recent years. In this paper, we present several machine learning and a deep learning model to analysis sentiment of Persian political tweets. Our analysis was conducted using Bag of Words and ParsBERT for word representation. We applied Gaussian Naive Bayes, Gradient Boosting, Logistic Regression, Decision Trees, Random Forests, as well as a combination of CNN and LSTM to classify the polarities of tweets. The results of this study indicate that deep learning with ParsBERT embedding performs better than machine learning. The CNN-LSTM model had the highest classification accuracy with ۸۹ percent on the first dataset and ۷۱ percent on the second dataset. Due to the complexity of Persian, it was a difficult task to achieve this level of efficiency. The main objective of our research was to reduce the training time while maintaining the model's performance. As a result, several adjustments were made to the model architecture and parameters. In addition to achieving the objective, the performance was slightly improved as well.

کلیدواژه ها

Sentiment analysis, Persian, Machine Learning, Deep Learning, Twitter

اطلاعات بیشتر در مورد COI

COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.

کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.