CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Missing data imputation using supervised learning methods

عنوان مقاله: Missing data imputation using supervised learning methods
شناسه ملی مقاله: JR_JSMTA-2-1_011
منتشر شده در در سال 1400
مشخصات نویسندگان مقاله:

Behzad Rezaei Shiri - School of Mathematics, Statistics and Computer Science, College of Science, University of Tehran, Tehran, Iran
Samaneh Eftekhari Mahabadi - School of Mathematics, Statistics and Computer Science, College of Science, University of Tehran, Tehran, Iran

خلاصه مقاله:
Missing data is a very common problem in all research fields. Case deletion is a simple way to handle incomplete data sets which could mislead to biased statistical results. A more reliable approach to handle missing values is imputation which allows covariate-dependent missing mechanism, as well. This paper aims to prepare guidance for researchers facing missing data problems by comparing various imputation methods including machine learning techniques, to achieve better results in supervised learning tasks. A benchmark dataset has experimented and the results are compared by applying popular classifiers over varying missing mechanisms and rates on this benchmark dataset.

کلمات کلیدی:
Imputation, Machine learning algorithms, Missing data, Missing mechanism

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/1441900/