Social Media Toxic Content Filtering System using SOIR Model

Bhandari; Navalakhe; Prajapati

Social Media Toxic Content Filtering System using SOIR Model

محل انتشار: فصلنامه مدیریت فناوری اطلاعات، دوره: 15، شماره: 6

سال انتشار: 1402

نوع سند: مقاله ژورنالی

زبان: انگلیسی

مشاهده: 146

فایل این مقاله در 17 صفحه با فرمت PDF قابل دریافت می باشد

دریافت فایل کامل مقاله

صدور گواهی نمایه سازی
من نویسنده این مقاله هستم

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

https://civilica.com/doc/1635717

شناسه ملی سند علمی:

JR_JITM-15-6_006

تاریخ نمایه سازی: 29 فروردین 1402

چکیده مقاله:

Social media is a popular data source in the research community. It provides different opportunities to design practical applications to favor humanity and society. A significant amount of people consumes social media content. Thus, sometimes content promoters and influencers publish misleading and toxic content. Therefore, this paper proposes an unhealthy content filtering system using the information retrieval model SOIR to identify and remove poisonous content from social media. The Semantic query Optimization-based Information Retrieval (SOIR) uses Fuzzy C Means (FCM) clustering to produce a particular data structure. To incorporate a query generation technique for the generation of multiple queries to increase the probability of correct outcomes. The SOIR model is modified in this work to utilize the model with the social media toxic content filtering model. The model uses linguistic and semantically information to craft new feature sets. The Part of Speech (POS) tagging is used to construct the linguistic feature. Finally, the pattern-matching algorithm is designed to classify the tweets as toxic or nontoxic. Based on lexical and semantic analysis of similar semantic queries (Tweets), it is identified with the class labels of the tweets. Twitter text posts are used to create training and test samples in this context. Here, a total of ۲۰۰۲ tweets are used for the experiment. The experimental study has been carried out with the different I.R. models (K-NN, Cosine) based on precision, recall, and F۱-Score demonstrating the superiority of the proposed classification model

کلیدواژه ها:

Text mining ، Semantic Knowledge ، information retrieval ، Sentiment analysis ، Lexical Pattern Analysis

نویسندگان

Bhandari

Department of Applied Mathematics, Indore Institute of Engineering and Technology, Indore, India

Navalakhe

Department of Applied Mathematics and Computational Science, Shri G. S. Institute of Technology and Science, Indore, India.

Prajapati

Department of Applied Mathematics, Indore Institute of Engineering and Technology, Indore, India.

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :

Agarwal, S. (۲۰۱۳, December). Data mining: Data mining concepts and ...
Bergamaschi, S., Domnori, E., Guerra, F., Orsini, M., Lado, R. ...
Bohra, A., Vijay, D., Singh, V., Akhtar, S. S., & ...
Chahal, M. (۲۰۱۶). Information retrieval using Jaccard similarity coefficient. Int. ...
Fotovatikhah, F., Herrera, M., Shamshirband, S., Chau, K. W., Faizollahzadeh ...
Irfan, M. R., Fauzi, M. A., Tibyani, T., & Mentari, ...
Jianqiang, Z., & Xiaolin, G. (۲۰۱۷). Comparison research on text ...
Kreiss, D., & McGregor, S. C. (۲۰۱۹). The "arbiters of ...
Matcha, W., Gašević, D., Uzir, A., Jovanović, J., Pardo, A., ...
Nafis, N. S. M., & Awang, S. (۲۰۲۱). An enhanced ...
Onan, A., & Toçoğlu, M. A. (۲۰۲۱). Weighted word embeddings ...
Pasquier, C., da Costa Pereira, C., & Tettamanzi, A. G. ...
Wang, H., Zhang, Q., & Yuan, J. (۲۰۱۷). Semantically enhanced ...
Xu, H., Yang, W., & Wang, J. (۲۰۱۵). Hierarchical emotion ...
Yassine, S., Kadry, S., & Sicilia, M. A. (۲۰۲۲). Detecting ...

نمایش کامل مراجع