期刊名称:Electronic Journal of Applied Statistical Analysis : Decision Support Systems and Services Evaluation
电子版ISSN:2037-3627
出版年度:2011
卷号:2
期号:1
页码:54-64
DOI:10.1285/i2037-3627v2n1p54
语种:English
出版社:Università del Salento
摘要:In this work, a methodology for semi-automatic derivation of knowledge from document collections is proposed. In order to extract relevant information from documents, a process integrating both statistical and lexical approaches is applied. We propose a strategy for the semantic evaluation of the index terms extracted in order to ensure a good correspondence between the information searched for and the information retrieved. Therefore, we propose a system for the peculiar lexicon extraction and assessment. The system can be used for defining an ontological model to be used in the semantic processing of a corpus of documents belonging to a specialist domain.