首页    期刊浏览 2024年11月06日 星期三
登录注册

文章基本信息

  • 标题:Operationalising "Websites": lexically, semantically or topologically?
  • 本地全文:下载
  • 作者:Viv Cothey ; Isidro Aguillo ; Natalia Arroyo
  • 期刊名称:Cybermetrics : International Journal of Scientometrics, Informetrics and Bibliometrics
  • 电子版ISSN:1137-5019
  • 出版年度:2006
  • 卷号:10
  • 期号:1
  • 出版社:Centro de Informacion y Documentacion Cientifica
  • 摘要:

    Methods to investigate the structure of the Web graph in order to better understand its properties are of interest to many researchers. The scale and complexity of the Web-page digraph is typically managed by aggregating together or clustering individual Web-pages in order to form "Websites". It is the properties of these Websites which then become the focus of research. The most popular Web-page clustering technique is "lexical" and uses the url syntax in order to assign Web-pages to "Websites". Semantic clustering, that is clustering Web-pages according to the similarity of their content has also been proposed. In this paper we consider a third approach to Web-page clustering which is based on the topological properties of the Web-page within the Web-page digraph. We present the technique and report the results of an experiment to compare the use of url-lexically and topologically determined Websites in two sub-domains, one within the Spanish country level domain and the other within the UK country level domain of the Web.

  • 关键词:World Wide Web; Web graph; Website; topology; cliques; clusters
国家哲学社会科学文献中心版权所有