the surfer model with a hybrid approach to ranking the web pages

  • سال انتشار: 1395
  • محل انتشار: فصلنامه سیستم های اطلاعاتی و مخابرات، دوره: 4، شماره: 3
  • کد COI اختصاصی: JR_JIST-4-3_008
  • زبان مقاله: انگلیسی
  • تعداد مشاهده: 403
دانلود فایل این مقاله

نویسندگان

Javad Paksima

Department of Engineering, Payame Noor Yazd University, Yazd, Iran

Homa Khajeh

Department of Engineering, Science and Art University, Yazd, Iran

چکیده

Users who seek results pertaining to their queries are at the first place. To meet users’ needs, thousands of webpages must be ranked. This requires an efficient algorithm to place the relevant webpages at first ranks. Regarding informationretrieval, it is highly important to design a ranking algorithm to provide the results pertaining to user’s query due to the great deal of information on the World Wide Web. In this paper, a ranking method is proposed with a hybrid approach,which considers the content and connections of pages. The proposed model is a smart surfer that passes or hops from the current page to one of the externally linked pages with respect to their content. A probability, which is obtained using thelearning automata along with content and links to pages, is used to select a webpage to hop. For a transition to another page, the content of pages linked to it are used. As the surfer moves about the pages, the PageRank score of a page is recursively calculated. Two standard datasets named TD2003 and TD2004 were used to evaluate and investigate theproposed method. They are the subsets of dataset LETOR3. The results indicated the superior performance of the proposed approach over other methods introduced in this area

کلیدواژه ها

Ranking; Web Pages; Surfer Model; Learning Automata; Information Retrieval

مقالات مرتبط جدید

اطلاعات بیشتر در مورد COI

COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.

کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.