期刊名称:Journal of Theoretical and Applied Information Technology
印刷版ISSN:1992-8645
电子版ISSN:1817-3195
出版年度:2017
卷号:95
期号:19
页码:4973
出版社:Journal of Theoretical and Applied
摘要:Language is rich in morphological variations but poor in linguistic computational resources. Ngoko Javanese language is a morphologically rich language that has a different variant form of words. This paper describes an algorithm by which a stem for Ngoko Javanese language. Ngoko Javanese language stemmer is efficiently used in information retrieval. Through this algorithm, we can get a root from its actual word. We use a hybrid rule-based and string matching algorithm. Special rules are created to remove the prefixes and suffixes of the Ngoko Javanese terms. The algorithm has been tested on hundreds of Ngoko Javanese words. Results reveal that the accuracy reaches to about 67%.
关键词:Javanese Language; Stemming; String Matching; Rule Based Algorithm