A Comparative Study of Multilayer Neural Network and C۴.۵Decision Tree Models for Predicting the Risk of Breast Cancer

سال انتشار: 1397
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 135

متن کامل این مقاله منتشر نشده است و فقط به صورت چکیده یا چکیده مبسوط در پایگاه موجود می باشد.
توضیح: معمولا کلیه مقالاتی که کمتر از ۵ صفحه باشند در پایگاه سیویلیکا اصل مقاله (فول تکست) محسوب نمی شوند و فقط کاربران عضو بدون کسر اعتبار می توانند فایل آنها را دریافت نمایند.

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_ARCHB-5-1_001

تاریخ نمایه سازی: 24 خرداد 1400

چکیده مقاله:

Background: Diagnosing breast cancer at an early stage can have a great impacton cancer mortality. One of the fundamental problems in cancer treatment is the lackof a proper method for early detection, which may lead to diagnostic errors. Usingdata analysis techniques can significantly help in early diagnosis of the disease. Thepurpose of this study was to evaluate and compare the efficacy of two data miningtechniques, i.e., multilayer neural network and C۴.۵, in early diagnosis of breastcancer. Methods: A data set from Motamed Cancer Institute's breast cancer researchclinic, Tehran, containing ۲۸۶۰ records related to breast cancer risk factors wereused. Of the records, ۱۱۴۱ (۴۰%) were related to malignant changes and breastcancer and ۱۷۱۹ (۶۰%) to benign tumors. The data set was analyzed usingperceptron neural network and decision tree algorithms, and was split into two atraining data set (۷۰%) and a testing data set (۳۰%) using Rapid Miner ۵.۲. Results: For neural networks, accuracy was ۸۰.۵۲%, precision ۸۸.۹۱%, andsensitivity ۹۰.۸۸%; and for decision tree, accuracy was ۸۰.۹۸%, precision ۸۰.۹۷%,and sensitivity ۸۹.۳۲%. Results indicated that both algorithms have acceptablecapabilities for analyzing breast cancer data. Conclusion: Although both models provided good results, neural networkshowed more reliable diagnosis for positive cases. Data set type and analysismethod affect results. On the other hand, information about more powerful riskfactors of breast cancer, such as genetic mutations, can provide models with highcoverage.

نویسندگان

Soolmaz Sohrabi

Shahid Beheshti University of Medical Sciences, Department of Medical Informatics, Tehran, Iran

Alireza Atashi

Department of E-Health, Virtual School, Tehran University of Medical Sciences, Tehran, Iran- Medical Informatics Department, Breast Cancer Research Center, Motamed cancer institute (ACECR), Tehran, Iran

Ali Dadashi

Mashhad University of Medical Sciences, Department Of Medical Informatics, Mashhad, Iran

Sina Marashi

Department of E-Health, Virtual School, Tehran University of Medical Sciences, Tehran, Iran