Breast Cancer Diagnosis from Perspective of Class Imbalance

  • سال انتشار: 1398
  • محل انتشار: مجله فیزیک پزشکی ایران، دوره: 16، شماره: 3
  • کد COI اختصاصی: JR_IJMP-16-3_008
  • زبان مقاله: انگلیسی
  • تعداد مشاهده: 176
دانلود فایل این مقاله

نویسندگان

Jue Zhang

Scholl of Information and Technology, Northwest University, Xi&#۰۳۹;an,China

Li Chen

shool of Information and Technology, Northwest Nniversity, Xi&#۰۳۹;an, Chian

چکیده

Introduction: Breast cancer is the second cause of mortality among women. Early detection is the only rescue to reduce the risk of breast cancer mortality. Traditional methods cannot effectively diagnose tumor since they are based on the assumption of well-balanced dataset.. However, a hybrid method can help to alleviate the two-class imbalance problem existing in the diagnosis of breast cancer and establish a more accurate diagnosis. Material and Methods: The proposed hybrid approach was based on improved Laplacian score (LS) andK-nearest neighbor (KNN) algorithms called LS-KNN. An improved LS algorithm was used for obtaining the optimal feature subset. The KNN with automatic K was utilized for classifying the data which guaranteed the effectiveness of the proposed method by reducing the computational effort and making the classification more faster. The effectiveness of LS-KNN was also examined on two biased-representative breast cancer datasets using classification accuracy, sensitivity, specificity, G-mean, and Matthews correlation coefficient. Results: Applying the proposed algorithm on two breast cancer datasets indicated that the efficiency of the new method was higher than the previously introduced methods. The obtained values of accuracy, sensitivity, specificity, G-mean, and Matthews correlation coefficient were ۹۹.۲۷%, ۹۹.۱۲%, ۹۹.۵۱%, ۹۹.۴۲%, respectively. Conclusion: Experimental results showed that the proposed approach worked well with breast cancer datasets and could be a good alternative to the well-known machine learning methods

کلیدواژه ها

Breast Cancer, classification, imbalance, Computer aided diagnosis

اطلاعات بیشتر در مورد COI

COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.

کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.