文章基本信息

标题：The test collection for navigational retrieval on WWW data-Design and characteristics
作者：Keizo Oyama ; Haruko Ishikawa ; Koji Eguchi 等
期刊名称：Progress in Informatics
印刷版ISSN：1349-8614
电子版ISSN：1349-8606
出版年度：2005
期号：1
页码：59-73
DOI：10.2201/NiiPi.2005.1.5
出版社：National Institute of Informatics
摘要：This paper describes the design and characteristics of a test collection for navigational retrieval of WWW data that was built through the WEB Task of the Fourth NTCIR Workshop to evaluate the retrieval effectiveness of Web search systems. This reusable test collection consists of 100 gigabytes of Web document data and 300 topics of various types and corresponding relevance judgments. Among the several types of �Navigational Retrieval�, we selected the �Known Item Search�, which simulates a situation where a user searches for one or a few �representative Web pages� of a known item. It is assumed that the user knows about the item but may not have seen its Web page. Relevance judgments were performed on the probable documents mainly from the viewpoint of representativeness of respective known items represented by the topics. Using the judgment results, several evaluation measures were applied to various retrieval results. Based on the evaluation results, relationships among the types of topics, Web-page styles and search methods are discussed. The stability of the evaluation results with different numbers of topics is also analyzed.
关键词：Web information retrieval; evaluation methods; test collections