Induction of decision trees by looking to data sequentially and using error correction rule

سال انتشار: 1395
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 553

فایل این مقاله در 6 صفحه با فرمت PDF قابل دریافت می باشد

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

ICIKT08_048

تاریخ نمایه سازی: 5 بهمن 1395

چکیده مقاله:

Decision trees are common algorithms in machine learning. Traditionally, these algorithms make trees recursively and at each step, they inspect data to induce the part of the tree. However decision trees are famous for their instability and high variance in error. In this paper a solution which adds error correction rule to a traditional decision tree algorithm is examined. In fact an algorithm which we call it, ECD3 is introduced. Algorithm of ECD3 inspects data sequentially in an iterative manner and updates tree only when it finds an erroneous observation. This method was first proposed by Dr. Utgoff but not implemented. In this paper, the method is developed and several experiments are performed to evaluate the method. We found that in most cases, performance of ECD3 is comparable to its predecessors. However ECD3 has some benefits over them. First, sizes of its trees are significantly smaller. Second, on average, variance of error in ECD3 is lower. Furthermore, ECD3 automatically chooses part of data for induction of the tree and sets aside others. This capability can be exploited for prototype selection in various learning algorithms. To explain these observations, we use inductive bias and margin definitions in our theories. We introduce a new definition of margin in ordinary decision trees based on shape, size and splitting criteria in trees. We show that how ECD3 expands the margins and enhances precision over test data.

کلیدواژه ها:

نویسندگان

NargesSadat Bathaeian

Computer engineering department Bu-Ali Sina University Hamedan, I.R. of Iran

Muharram Mansoorizadeh

Computer engineering department Bu-Ali Sina University Hamedan, I.R. of Iran

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :
  • _ _ trees: _ ...
  • Alpaydin, E. (2004) Introduction to Machine Learming. Massachusets Institute of ...
  • Mitchell, T. (1997) Machine Learning. McGraw-Hil. ...
  • Li RH, Belford GG (2002) Instability of decision tree ...
  • Zurada, J., (2010) Could Decision Tres Improve the ...
  • Kweku-Muat Osei-Bryson, Kendall Giles, (2006) Splitting methods for decision tre ...
  • نمایش کامل مراجع