A Multiple Kernel Learning based Model with Clustered Features for Cancer Stage Detection using Gene Datasets

  • سال انتشار: 1402
  • محل انتشار: ماهنامه بین المللی مهندسی، دوره: 36، شماره: 11
  • کد COI اختصاصی: JR_IJE-36-11_008
  • زبان مقاله: انگلیسی
  • تعداد مشاهده: 117
دانلود فایل این مقاله

نویسندگان

A. Mohammadjani

Department of Electrical and Computer Engineering, Nooshirvani University of Technology, Babol, Iran

F. Zamani

Department of Electrical and Computer Engineering, Nooshirvani University of Technology, Babol, Iran

چکیده

Genomic data is used in various fields of medicine including diagnosis, prediction, and treatment of diseases. Stage detection of cancer progression is crucial for treating patients because the mortality rate of cancer is higher when it is diagnosed in the late stages. Furthermore, the type of treatment varies depending on the cancer stage. This paper presents a Multiple Kernel Learning based algorithm to predict the stage of cancer using genomic data. Because of the high dimension of genomic data, the curse of dimensionality may degrade the stage prediction. To reduce the dimension, features are clustered first in the proposed algorithm. Then, the original data samples are clustered into smaller subsets with reduced dimensions based on the computed feature clusters. Afterward, for each subset, a kernel matrix is calculated. The kernel matrices are weighted and then combined linearly. Finally, a cancer stage prediction model is trained using the combined kernel matrix and Support Vector Machine. The proposed algorithm is compared with the baseline methods. The classification accuracy of the proposed method outperforms the other methods in ۱۳ cancer groups of ۱۵  from the cancer genome atlas program (TCGA) dataset.

کلیدواژه ها

Machine Learning, Multiple Kernel Learning, Bioinformatics, cancer stage, Dimension Reduction, The Cancer Genome Atlas (TCGA)

اطلاعات بیشتر در مورد COI

COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.

کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.