Improvement of the performance of machine learning algorithms in predicting breast cancer

سال انتشار: 1402
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 54

فایل این مقاله در 7 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_IJIMI-12-1_004

تاریخ نمایه سازی: 14 آذر 1402

چکیده مقاله:

Introduction: Breast cancer is one of the most common cancers among women compared to all other ones. Machine learning (ML) techniques can bring a large contribute on the process of prediction and early diagnosis of breast cancer, became a research hotspot and has been proved as a strong technique. Using ML models performed on multidimensional dataset, this article aims to find the most efficient and accurate ML models for tumor classification prediction. Material and Methods: Several supervised ML algorithms were utilized to diagnosis and prediction of cancer tumor such as Logistic Regression Decision Tree, Random Forest and KNN. The algorithms are applied to a dataset taken from the UCI repository including ۶۹۹ samples. The dataset includes Breast cancer features. To enhance the algorithms’ performance, these features are analyzed, the feature importance score and cross validation are considered. In this research, ML algorithms improved coupled by limited and selective features to produce high classification accuracy in tumor classification. Results: As a result of evaluation, Logistic Regression algorithm with accuracy value equal to ۹۹.۱۴%, AUC ROC equal to ۹۹.۶%, Extra Tree algorithm with accuracy value equal to ۹۹.۱۴% and AUC ROC equal to ۹۹.۱% have better performance than other algorithms. Therefore, these techniques can be useful for diagnosis and prediction of cancer tumor and prescribe it correctly. Conclusion: The technique of ML can be used in medicine for analyzing the related data collections to a disease and its prediction. The area under the ROC curve and evaluating criteria related to a number of classifying algorithms of ML to evaluate breast cancer and indeed, the diagnosis and prediction of breast cancer is compared to determine the most appropriate classifier.

نویسندگان

Maryam Poornajaf

Faculty Member, Department of Computer Engineering, Technical and Vocational University (TVU), Tehran, Iran

Sajad Yousefi

Faculty Member, Department of Electrical Engineering, Technical and Vocational University (TVU), Tehran, Iran