Web Page Streams and Relevance Propagation for Topic Distillation

سال انتشار: 1392
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 140

فایل این مقاله در 12 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_ITRC-6-1_005

تاریخ نمایه سازی: 22 فروردین 1401

چکیده مقاله:

Over the past decade, several studies in field of relevance propagation models have been proposed to improve quality of web search, which include hyperlink-based score propagation, hyperlinkbased term propagation and popularity-based relevance propagation models; however, all of them have used low precision content similarity functions in the propagation process and their throughputs are not entirely satisfactory. In this paper, two stream-based content similarity functions that could be used to derive new relevance propagation models were introduced. In the proposed content similarity functions, the web page was split to different streams with different degrees of importance and the text of each web page was divided between these streams. To evaluate the proposed relevance propagation models, Letor ۳.۰ (including two standard web test collections) was used in the experiments. It was concluded that splitting web pages as different streams could provide significant improvement in relevance propagation models.