期刊名称:International Journal of Computer Science and Network Security
印刷版ISSN:1738-7906
出版年度:2008
卷号:8
期号:11
页码:169-176
出版社:International Journal of Computer Science and Network Security
摘要:In this paper, we will discuss on the strategic issues of exploring, harvesting and integrating the Deep Web. We will then develop a novel interdisciplinary stochastic model for a Deep Web Search Engine which can detect and rank the contents optimally. Our efforts aim at opening up to users by building Generator. On this information grand voyage, the Generator will address the challenges of exploring, harvesting and integrating of the Deep Web. First, to make the Web systematically accessible: our Generator will focus on the discovery, modeling and structuring of databases on the Web to develop a search engine, in order to help users find sources useful for their information need. Second, to make the Web uniformly usable: the Generator will help users to make optimal choice of keywords. Based on these insights, we design a stochastic model and employ an interdisciplinary approach consisting stochastic and optimization techniques. In addition, we consider three types of keywords: text-based, image-based and hybrid-based. Experimental and simulation results are given for illustrations.