The Application of Clustering Methods in Information Retrieval Systems

سال انتشار: 1389
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 2,020

فایل این مقاله در 12 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

IDMC04_012

تاریخ نمایه سازی: 15 دی 1389

چکیده مقاله:

The explosion of on-line information has given rise to many query-based search engines (such as Alta Vista) and manually constructed topic hierarchies (such as Yahoo!). But with the current growth rate in the amount of information, query results grow incomprehensibly large and manual classification in topic hierarchies creates an immense information bottleneck. Therefore, these tools are rapidly becoming inadequate for addressing users’ information needs. Clustering techniques are generally applied for finding unobvious relations and structures in data sets. In this paper, we implement an integrated approach to information retrieval which combines some techniques of clustering in order to evaluate of the effect of clustering methods in the retrieval performance. To capture the relationships among index terms, vector space model are used. Three clustering methods are adapted (such as Modified K-means, Lloyd’s and Splitting algorithm) to the task of clustering documents with respect to the index terms. Our implemented system combines clustering approach with traditional relevance feedback approach of retrieval. The performance evaluation of clustering based IR system is carried out, and a comparison with a traditional IR system is presented. Precision and recall are defined and applied as quantitative evaluation measures of the results. The experiments with various test document sets have shown that in most cases clustering based IR system performs better than the traditional IR system. A series of experiments have been conducted in order to validate this approach; descriptions of those experiments along with the results are presented.

نویسندگان

Aboozar Kalantari Soltanieh

Master Student, School of Computer Science, Physics and Mathematics, Linnaeus university, Vaxjo, Sweden

Ali Haydar

Assistant Professor, Computer Engineering department, Girne American University, Northern Cyprus

Kamil Dimililer

Assistant Professor, Computer Engineering department, Girne American University, Northern Cyprus