ML Based Social Media Data Emotion Analyzer and Sentiment Classifier with Enriched Preprocessor

  • سال انتشار: 1400
  • محل انتشار: فصلنامه مدیریت فناوری اطلاعات، دوره: 13، شماره: 5
  • کد COI اختصاصی: JR_JITM-13-5_002
  • زبان مقاله: انگلیسی
  • تعداد مشاهده: 255
دانلود فایل این مقاله

نویسندگان

Kothandan

Research Scholar, Computer Science Engineering, Bharath University, Chennai, India.

Murugesan

Provost, Bharath University, Chennai, India.

چکیده

Sentiment Analysis or opinion mining is NLP's method to computationally identify and categorize user opinions expressed in textual data.  Mainly it is used to determine the user's opinions, emotions, appraisals, or judgments towards a specific event, topic, product, etc. is positive, negative, or neutral. In this approach, a huge amount of digital data generated online from blogs and social media websites is gathered and analyzed to discover the insights and help make business decisions. Social media is web-based applications that are designed and developed to allow people to share digital content in real-time quickly and efficiently.  Many people define social media as apps on their Smartphone or tablet, but the truth is, this communication tool started with computers. It became an essential and inseparable part of human life. Most business uses social media to market products, promote brands, and connect to current customers and foster new business. Online social media data is pervasive. It allows people to post their opinions and sentiments about products, events, and other people in the form of short text messages. For example, Twitter is an online social networking service where users post and interact with short messages, called "tweets." Hence, currently, social media has become a prospective source for businesses to discover people's sentiments and opinions about a particular event or product. This paper focuses on the development of a Multinomial Naïve Bayes Based social media data emotion analyzer and sentiment classifier. This paper also explains various enriched methods used in pre-processing techniques. This paper also focuses on various Machine Learning Techniques and steps to use the text classifier and different types of language models.

کلیدواژه ها

Machine learning, Multinomial naive bayes, Emotion analysis, Language models, Opinion Mining (OM), Sentiment Analysis (SA), Twitter

اطلاعات بیشتر در مورد COI

COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.

کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.