A New Approach to Improve the Accuracy of the TF_IDF Ranking Algorithm in Text Retrieval
محل انتشار: چهارمین کنفرانس بین المللی وب پژوهی
سال انتشار: 1397
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 580
فایل این مقاله در 8 صفحه با فرمت PDF قابل دریافت می باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
IRANWEB04_011
تاریخ نمایه سازی: 24 شهریور 1397
چکیده مقاله:
Today, the World Wide Web is considered as the largest source of data with the help of Web search engines, as one of the most useful tools for extracting information. Due to the web growth, providing information related to user queries by search engines is very difficult. Also, the effectiveness of the information retrieval systems is largely dependent on term-weighting. Therefore, search engines use different web mining techniques to rank search results. For this purpose, various ranking algorithms are presented.In this research, the weighting algorithm TF_ IDF is used to rank the documents. By introducing the entropy parameter related to the number of user query words in the text of the documents, the accuracy of the ranking of the documents in the information retrieval is evaluated. The remarkable points obtained from the surveys on standard questions provide a new approach to increasing the efficiency of text search systems, which the responses from subsequent experiments demonstrate its validation. The proposed approach in this paper uses the Standard Web collections and the results show that it can significantly increase the accuracy of retrieval in terms of the volume of test data collection
کلیدواژه ها:
نویسندگان
Azize Nemati
Graduate Student, Department of Computer Engineering, Golestan University Gorgan
Soheila Karbasi
Assistant Professor, Department of Computer Engineering, Golestan University, Gorgan