A Hybrid Approach to Sentiment Analysis of Iranian Stock Market User’s Opinions

  • سال انتشار: 1402
  • محل انتشار: ماهنامه بین المللی مهندسی، دوره: 36، شماره: 3
  • کد COI اختصاصی: JR_IJE-36-3_018
  • زبان مقاله: انگلیسی
  • تعداد مشاهده: 310
دانلود فایل این مقاله

نویسندگان

M. Ahangari

Department of Computer Engineering, Faculty of Engineering, Golestan University, Gorgan, Iran

A. Sebti

Department of Computer Engineering, Faculty of Engineering, Golestan University, Gorgan, Iran

چکیده

With the significant growth of social media, individuals and organizations are increasingly using public opinion in these media to make their own decisions. The purpose of sentiment analysis is to automatically extract people’s sentiments from those social networks. Social networks related to financial markets, including stock markets, have recently attracted the attention of many individuals and organizations. people in these networks share their opinions and ideas about each share in the form of a post or tweet. In fact, sentiment analysis in this field is the assessment of people’s attitude towards each share. There are different approaches in sentiment analysis, in this article, a hybrid approach is proposed for sentiment analysis. In this way the feature vector used in machine learning is obtained from a lexicon that is automatically extracted from user’s tweets. This lexicon is made by using stock price information related to user’s opinion. Also, by using the next day’s price information of each share, amendments were suggested to this lexicon. Therefore, the lexicon generated for the feature vector was constructed in three ways, and all three methods reported about an ۸% improvement over the baseline method in terms of F-score. The baseline method that is considered for this work, is the Persian version of SentiStrength lexicon which is designed for general purpose.

کلیدواژه ها

Sentiment analysis, Opinion mining, Lexicon Creation, Persian Lexicon, Support Vector Machine

اطلاعات بیشتر در مورد COI

COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.

کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.