Confidence‑Based Misdiagnosis Analysis in Machine Learning Models for Diabetes Prediction

سال انتشار: 1404
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 43

فایل این مقاله در 7 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

DTIS03_038

تاریخ نمایه سازی: 28 اردیبهشت 1405

چکیده مقاله:

This paper presents a confidence‑based framework for examining diagnostic errors in machine learning models for diabetes prediction. Instead of looking only at accuracy, the framework focuses on how each model represents and handles uncertainty. To evaluate this idea, three models with different probabilistic characteristics—Gaussian Naive Bayes, LogitBoost, and Logistic Regression—were trained on the Pima Indian Diabetes dataset. Using a margin‑based criterion, errors were categorized into Severe Misdiagnosis and Near‑Miss cases. The analysis shows that although the models achieve similar accuracy (LogitBoost ۰.۷۱۹, Logistic Regression ۰.۷۵۸, Naive Bayes ۰.۷۳۹), the three models produce fundamentally different misdiagnosis patterns. LogitBoost yields the highest number of severe errors (۴۲ vs. ۶ and ۱۴), while Logistic Regression and Naive Bayes generate substantially more near‑miss errors (۳۱ and ۲۶ vs. ۱). These results suggest that trust‑oriented error analysis can reveal hidden weaknesses in predictive models and support the development of safer medical diagnosis systems.

نویسندگان

Asieh Khosravanian

Department of Computer Engineering, University of Larestan, Lar, Iran.