Investigating Sentiment Analysis of News in Stock Market Prediction Utilizing Machine Learning Technique

  • سال انتشار: 1403
  • محل انتشار: مجله بین المللی ارتباطات و فناوری اطلاعات، دوره: 16، شماره: 3
  • کد COI اختصاصی: JR_ITRC-16-3_007
  • زبان مقاله: انگلیسی
  • تعداد مشاهده: 159
دانلود فایل این مقاله

نویسندگان

Golshid Ranjbaran

Department of Electrical and Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran

Mohammad-Shahram Moin

ICT Research Institute, Tehran, Iran

Sasan H Alizadeh

ICT Research Institute, Tehran, Iran

Abbas Koochari

Department of Electrical and Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran

چکیده

In the stock market, which is a dynamic, complex, nonlinear and non-parametric environment, accurate prediction is crucial for trading strategy. It is assumed that news articles affect the stock market. We investigated the relationship[۱] between headline’s sentiment of news and their impact on stock prices changes. To show this relationship, we applied the sentiment data and the price difference between the day before the news was published and the day of the news, to machine learning regression and classification models. Regression is used to predict changes and classification is used to decide whether to buy or sell stocks.  We used three stock datasets named Apple, Amazon and AXP and the results are shown in the mentioned dataset that using news with negative sentiments can make predictions just as correctly as using news with both positive and negative sentiments. In regression and classification models, Random Forest outperformed other machine learning algorithms in predicting stock price changes using news sentiment analysis. Additionally, we depicted that the results of computer and human tagging were almost similar, showing that using computer tools for text tagging will allow to tag text much more quickly and easily.

کلیدواژه ها

News, stock price prediction, machine learning, Regression model, Classification model

اطلاعات بیشتر در مورد COI

COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.

کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.