XML Document Clustering Based on Common Tag Names Anywhere in the Structure

سال انتشار: 1388
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 1,570

فایل این مقاله در 8 صفحه با فرمت PDF قابل دریافت می باشد

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

CSICC14_065

تاریخ نمایه سازی: 24 خرداد 1388

چکیده مقاله:

One of the most effective ways to extract knowledge from large information resources is applying data mining methods. Since the amount of information on the Internet is exploding, using XML documents is common as they have many advantages. Knowledge extraction from XML documents is a way to provide more utilizable results. XCLS is one of the most efficient algorithms for XML documents clustering. In this paper we represent a new algorithm for clustering XML documents. This algorithm is an improvement over XCLS algorithm which tries to obviate its problems. We implemented both algorithms and evaluated their clustering quality and running time on the same data sets. In both cases, it is shown that the performance of the new algorithm is better.

نویسندگان

Mohamad Alishahi

Islamic Azad University Mashhad Branch

Mehdi Ravakhah

Islamic Azad University Mashhad Branch

Baharak Shakeriaski

Islamic Azad University Ramsar Branch

Mahmud Naghibzade

Ferdowsi university of Mashhad Computer Department