摘要:Categorizing geographic features from text is a process of assigning a collection of text documents that describe geographic features to predefined categories. This paper presents a new approach to categorize geographic features from text that begins at the level of the individual document. Using latent semantic analysis, similar geographic features are grouped into predefined categories through capturing the semantic context of text. By incorporating ontologies into latent semantic analysis, the domain knowledge can be incorporated into the categorizing process. The proposed approach can allocate each geographic feature into more than one category and is able to identify a set of key concepts to represent each category. The results from an experimental evaluation using the proposed approach showed promise in categorizing geographic features from text.