Investigating the effect of data augmentation on the performance of machine learning and deep learning methods in detecting fraudulent credit card transactions

  • سال انتشار: 1401
  • محل انتشار: هفتمین کنفرانس ملی و اولین کنفرانس بین المللی محاسبات توزیعی و پردازش داده های بزرگ
  • کد COI اختصاصی: DCBDP07_063
  • زبان مقاله: انگلیسی
  • تعداد مشاهده: 189
دانلود فایل این مقاله

نویسندگان

Hosein Fanai

Faculty of Information Technology and Computer Engineering Azarbaijan Shahid Madani University Tabriz, Iran

Hossein Abbasimehr

Faculty of Information Technology and Computer Engineering Azarbaijan Shahid Madani University Tabriz, Iran

چکیده

With the growth of e-banking in recent years, the rate of fraud in credit card transactions has increased. Therefore, establishing a fraud detection system for financial institutions is of particular importance. The utilized datasets in the fraud detection context always have the problem of class imbalance. Various methods have been used in previous research to build classification models. In this research, we aim to investigate the effect of the data augmentation method on the performance of conventional machine learning methods and deep learning methods. For this purpose, widely-used machine learning techniques, including decision tree, support vector machine, and random forest, along with two deep neural network models are employed. The results of experiments show that data augmentation leads to an increase in the performance of the random forest in terms of F۱-Measure. It achieves the best performance among the compared methods. Also, the results indicate that in general, the use of data augmentation increases the performance of models in terms of recall but decreases precision. Besides, data augmentation reduces the performance of deep learning methods.

کلیدواژه ها

Fraud detection, Classification, Data augmentation, Deep learning

مقالات مرتبط جدید

اطلاعات بیشتر در مورد COI

COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.

کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.