Recognizing phishing websites based on a bayesian combiner

سال انتشار: 1400
محل انتشار: مجله آنالیز غیر خطی و کاربردها، دوره: 12، شماره: 0
کد COI اختصاصی: JR_IJNAA-12-0_061
زبان مقاله: انگلیسی
تعداد مشاهده: 149

دانلود فایل این مقاله

نویسندگان

- -

Department of Electrical Engineering, Shams Higher Education Institute, Iran.

- -

Department of Computer Engineering, West Tehran Branch, Islamic Azad University, Tehran, Iran.

- -

Department of Computer Engineering, West Tehran Branch, Islamic Azad University, Tehran, Iran.

- -

Department of Electrical Engineering, Islamic Azad University, Garmsar Branch, Semnan, Iran.

- -

Department of Electrical Engineering, Technical and Vocational University (TVU), Tehran, Iran.

چکیده

Phishing is a social engineering technique used to deceive users, which means trying to obtain confidential information such as username, password or bank account information. One of the most important challenges on the Internet today is the risk of phishing attack and Internet scams. These attacks cost the United States billions of dollars a year. Therefore, researchers have made great efforts to identify and combat such attacks. Accordingly, the present study aims to evaluate the methods of identifying phishing websites. This research is applied in terms of its objectives and descriptive-analytical in nature. In this article, the classification approach is used to identify phishing websites. From a machine learning point of view, if a suitable strategy is used, the ensemble of votes of different classifiers can be used to increase the accuracy of classification. In the method proposed in this paper, three inherently different ensemble classifiers, called bagging, AdaBoost, and rotation forest are employed. In this method, the stacked generalization strategy is used as an ensemble strategy. A relatively new dataset is employed to evaluate the performance of the proposed method. The database was added to the UCI Database in ۲۰۱۵ and uses ۳۰ features that appear to be appropriate for distinguishing phishing and non-phishing websites. The present study uses ۱۰-fold-cross-validation method as an evaluation strategy. The numerical results indicate that the proposed method can be used as a promising method for detecting phishing websites. It is worth mentioning that in this method, an F-score of ۹۶.۳ is resulted, which is a good result in detecting phishing.

کلیدواژه ها

Phishing, Classification, Ensembling, Stacked generalization

اطلاعات بیشتر در مورد COI

COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.

کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.