Ensemble Feature Selection and Machine Learning Approaches for Phishing Website Detection Using

سال انتشار: 1403
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 47

فایل این مقاله در 6 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

AISOFT02_028

تاریخ نمایه سازی: 17 فروردین 1404

چکیده مقاله:

Phishing, a prevalent social engineering attack, targets users' sensitive information by mimicking legitimate websites to deceive users into divulging personal data. Despite various detection techniques, zero-day phishing attacks remain challenging to identify. This study aims to improve phishing detection through advanced feature selection methods and ensemble machine learning techniques. We propose a novel approach that integrates multiple feature extraction models, including LIME, Kernel SHAP, Tree SHAP, and Information Gain, to enhance feature selection and improve classification accuracy. Three datasets —ISCX-URL-۲۰۱۶, Hannousse & Yahiouche, and UCI —were utilized for evaluation. The ensemble learning techniques of voting and stacking were employed, combining classifiers including Random Forest, XGBoost, and Support Vector Machine (SVM). Results indicate that our stacking-based model outperforms traditional methods, achieving high accuracy across various datasets. This approach demonstrates the potential for robust detection of phishing websites with reduced computational complexity.

نویسندگان

Nasim Yazdanjo

Dept. of Computer Science and Engineering, School of Electrical and Computer Engineering, Shiraz University, Shiraz, Iran

Mostafa Fakhrahmad

Dept. of Computer Science and Engineering, School of Electrical and Computer Engineering, Shiraz University, Shiraz, Iran