期刊名称:Journal of Computer Applications in Archaeology
电子版ISSN:2514-8362
出版年度:2019
卷号:2
期号:1
页码:21-30
DOI:10.5334/jcaa.33
摘要:In this paper, we present the results of user requirement solicitation for a search system of grey literature in archaeology, specifically Dutch excavation reports. This search system uses Named Entity Recognition and Information Retrieval techniques to create an effective and effortless search experience. Specifically, we used Conditional Random Fields to identify entities, with an average accuracy of 56%. This is a baseline result, and we identified many possibilities for improvement. These entities were indexed in ElasticSearch and a user interface was developed on top of the index. This proof of concept was used in user requirement solicitation and evaluation with a group of end users. Feedback from this group indicated that there is a dire need for such a system, and that the first results are promising.
关键词:Grey Literature; Named Entity Recognition; Information Retrieval; Big Data; Machine Learning