Sentiment Analysis User Comments On E-commerce Online Sale Websites
- سال انتشار: 1398
- محل انتشار: فصلنامه بین المللی وب پژوهی، دوره: 2، شماره: 2
- کد COI اختصاصی: JR_IJWR-2-2_001
- زبان مقاله: انگلیسی
- تعداد مشاهده: 621
نویسندگان
M.Sc. in Information Technology Engineering, Ghiaseddin Jamshid Kashani University, Abyek, Iran
Faculty Member of Electrical and Computer Department, Faculty of Computer, Eyvanekey University, Semnan, Iran
چکیده
E-commerce websites, based on their structural ontology, provides access to a wide range of options and the ability to deal directly with manufacturers to receive cheaper products and services as well as receiving comments and ideas of the users on the provided products and services. This is a valuable source of information, which includes a large number of user reviews. It is difficult to check the bulk of the comments published manually and non-automatically. Hence, sentiment analysis is an automated and relatively new field of study, which extracts and analyzes people s attitudes and emotions from the context of the comments. The primary objective of this research is to analyze the content of users comments on online sale e-commerce websites of handcraft products. Sentiment analysis techniques were used at sentence level and machine learning approach. First, the pre-processing steps and TF-IDF method were implemented on the comments text. Next, the comments text were classified into two groups of products and services comments using Support Vector Machine (SVM) algorithm with 99.2% accuracy. Finally, the sentiment of comments was classified into three groups of positive, negative and neutral using XGBoost algorithm. The results showed, 95.23% and 95.12% accuracies for classification of sentiments in comments about products and services, respectively.کلیدواژه ها
Machine Learning, Opinion mining, Online Reviews, Sentiment Classification, TF-IDF, Xgboostاطلاعات بیشتر در مورد COI
COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.
کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.