A New Approach To Focused Crawling: Combination of Text summarizing With Neural Networks and vector space model

سال انتشار: 1392
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 709

فایل این مقاله در 6 صفحه با فرمت PDF قابل دریافت می باشد

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_ACSIJ-2-3_005

تاریخ نمایه سازی: 24 فروردین 1393

چکیده مقاله:

Focused crawlers are programs designed to browse the Web and download pages on a specific topic. They are used for answering user queries or for building digital libraries on a topic specified by the user. In this article we will show how summarizing of web pages is needed for improving performance of a crawler which uses vector space model to rank the web pages. A neural network is trained to learn the relevant characteristics of sentences that should be included in the summary of a web page. Then the neural network will be used as a filter to summarize web pages. Finally, the crawler will use vector space model to rank summaries instead of web pages

نویسندگان

fahim mohammadi

Department of Information Technology, Institute for Advanced Studies in Basic Sciences (IASBS),Zanjan, Iran