Machine learning models for diagnostic classification of hepatitis C tests

  • سال انتشار: 1400
  • محل انتشار: مجله پیشگامان انفورماتیک سلامت، دوره: 10، شماره: 1
  • کد COI اختصاصی: JR_IJIMI-10-1_020
  • زبان مقاله: انگلیسی
  • تعداد مشاهده: 100
دانلود فایل این مقاله

نویسندگان

Oladosu Oyebisi Oladimeji

Department of Computer Science, University of Ibadan, Ibadan, Nigeria

Abimbola Oladimeji

Department of Chemistry, University of Ibadan, Ibadan, Nigeria

Oladimeji Olayanju

Department of Computer Science and Information Technology, University of Bowen, Iwo, Nigeria

چکیده

Introduction: Hepatitis C is a chronic infection caused by hepatitis c virus - a blood borne virus. Therefore, the infection occurs through exposure to small quantities of blood. It has been estimated by World Health Organization (WHO) to have affected ۷۱ million people worldwide. This infection costs individual, groups and government a lot because no vaccine has been gotten yet for the treatment. This disease is likely to continue to affect more people because it’s long asymptotic phase which makes its early detection not feasible.Material and Methods: In this study, we have presented machine learning models to automatically classify the diagnosis test of hepatitis and also ranked the test features in order to know how they contribute to the classification which help in decision making process by the health care industry. The synthetic minority oversampling technique (SMOTE) was used to solve the problem of imbalance dataset.Results: The models were evaluated based on metrics such as Matthews correlation coefficient, F-measure, Precision-Recall curve and Receiver Operating Characteristic Area Under Curve. We found that using SMOTE techniques helped raise performance of the predictive models. Also, random forest (RF) had the best performance based on Matthews correlation coefficient (۰.۹۹), F-measure (۰.۹۹), Precision-Recall curve (۱.۰۰) and Receiver Operating Characteristic Area Under Curve (۰.۹۹).Conclusion: This discovery has the potential to impact on clinical practice, when health workers aim at classifying diagnosis result of disease at its early stage.

کلیدواژه ها

Hepatitis C, Machine Learning, Health Data Analytics, Decision Making

مقالات مرتبط جدید

اطلاعات بیشتر در مورد COI

COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.

کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.