期刊名称:International Journal of Computer Science & Technology
印刷版ISSN:2229-4333
电子版ISSN:0976-8491
出版年度:2011
卷号:2
期号:2(Version 1)
出版社:Ayushmaan Technologies
摘要:In web search engines, the retrieval of information is quite challenging due to short, ambiguous and noisy queries. This can be resolved by classifying the queries to appropriate categories. In this paper we propose a web query classification system by using a state space tree based approach which is a hierarchical arrangement of categories as states at different levels. The user given query is passed into yahoo directory search and we extract the resulting categories as features for further processing. The extracted features are mapped to the target categories using direct mapping, wordnet mapping and, glossary mapping. The frequency with which a target category term is matched when the various mapping techniques are involved is recorded at the various nodes in state space tree. Performing a best first search on the state space tree yields a ranked list of categories. This technique when compared with manual classification was found to produce a precision of 0.66.
关键词:Web Query; Classification; Intermediate categories; State space;tree