A novel algorithm applied to classify imbalanced data in Breast Cancer Dataset

سال انتشار: 1393
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 1,302

فایل این مقاله در 15 صفحه با فرمت PDF قابل دریافت می باشد

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

MHAA01_045

تاریخ نمایه سازی: 17 اسفند 1393

چکیده مقاله:

In today's world, the classification of imbalanced data is of great importance. Classifying such data is in a way that the class which is extremely important, in terms of Application Scope (minority class), includes fewer states compared to a class which is not (majority class). These datasets are called imbalanced data. Several methods have been proposed to classify these types of data. In the classification of these data, we are trying to increase the number of states of the minority class compared to majority class. In this paper, we suggest a new and effective algorithm in classification of 5-years data of cancer patients and there is an Imbalanced property in this dataset. The proposed algorithm is a combination of SMOTE algorithm, Imperialist Competitive Algorithm (ICA) and some well-known classifiers, and also to calculate the performance of the proposed method, some assessments such as GMean, Accuracy, Specificity, Sensitivity, have been used. The results show that combining the SMOTE+ICA+C5 algorithms would have the best result in the classification of imbalanced data. So this is an effective approach in imbalanced data classification.

کلیدواژه ها:

Breast cancer ، Classification ، ICA ، Synthetic Minority Over-sampling Technique

نویسندگان

Aref Tahmasb

Graduate student, Shahid Bahonar University of Kerman

Ali Akbar Niknafs

Assistant Professor, Shahid Bahonar University of Kerman

Hamid Mirvaziri

Assistant Professor, Shahid Bahonar University of Kerman

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :
  • W.C. Yeh, W.W. Chang, Y.Y. Chung, A new hybrid approach ...
  • A.K. Mohanty, S. Sahoo, A. Pradhan, S.K. Lenka, Breast cancer ...
  • World Health Organization, Quick Cancer Facts, 2010, Retrieved September 22, ...
  • C. DeSantis, R. Siegel, P. Bandi, A. Jemal, Breast cancer ...
  • Q. Gu, Z. Cai, L. Ziu, Classification of imbalanced data ...
  • _ He, E.A. Garcia, Learning from Imbalanced Data, IEEE Transactions ...
  • Y. Chen, Learning Classifiers from Imbalanced, Only Positive and Unlabeled ...
  • N.V. Chawla, K.W. Bowyer, L.O. Hall, W.P. Kegelmeyer, SMOTE: Synthetic ...
  • L. Pelayo, S. Dick, Applying novel resampling strategies to software ...
  • A. Lazarevi c, J. Srivastava, V. Kumar, Tutorial: data mining ...
  • SEER (2010) Surveillance, Epidemiology, and End Results (SEER) Program ...
  • (www. seer.cancer gov) Research Data (1973-2007). National Cancer Institute , ...
  • K.Jeng Wang , B _ Makond , Applid Soft Computing, ...
  • T.Mitchell Machine Learming"1997 ...
  • Antonio Maratea, Alfredo Petrosino, Mario Manzo" Adjusted F-measure and kernel ...
  • Learning" Information Sciences, 2013 ...
  • Zhuangyuan Zhao, Ping Zhong, Yaohong Zhao" Learning SVM with weighted ...
  • Classification of imbalanced data" Mathematicl and Computer Modelling, 2011 ...
  • Chou-Yuan Leea, , Zne-Jung Leeb" A novel algorithm applied to ...
  • V. Garcia, J.S. Sanchez , R.A. Mollineda." On the effectiveness ...
  • Hualong Yu , Jun Ni _ Jing Zhao." ACOS ampling: ...
  • Jinghua Wang , Jane You , Qin Li , Yong ...
  • M.A.H. Farquad , Indranil Bose: Preprocessing Imbalanced data using support ...
  • Piyasak Jeatrakul, Kok Wai Wong, Chun Che Fung." Classification of ...
  • Processing, ICONIP 2010, 22 - 25 November, Sydney. ...
  • Yang Yong. The Research of Imbalanced Data Set of Sample ...
  • Chi-Man Vong , Weng-FaiIp _ Pak-KinWong _ Chi- ChongChiu." Predicting ...
  • Tatjana Eitricha, Bruno Lang." Efficient optimization of support vector machine ...
  • Chou-Yuan Lee, Zne-Jung Lee." A novel algorithm applied to classify ...
  • Zhuangyuan Zhao, Ping Zhong, Yaohong Zhao." Learning SVM with weighted ...
  • classification of imbalanced data. Mathematical and Computer Modelling 54 (2011) ...
  • Che-Chang Hsu , Kuo-Shong Wang, Shih-Hsing Chang:" Bayesian decision theory ...
  • Yang Liu , Xiaohui Yu , Jimmy Xiangji Huang _ ...
  • Bartosz Krawczyka, Micha Wo zniaka, Gerald Schaefe" Cost-sensitive decision tree ...
  • نمایش کامل مراجع