期刊名称:International Journal of Computers and Communications
印刷版ISSN:2074-1294
出版年度:2012
卷号:6
期号:1
页码:26-34
出版社:University Press
摘要:Offering e-Government services to citizens is linked primarily to civil registry data. Searching for a citizen’s data in civil registry is a common service carried out by string search algorithms using unique keywords such as citizen’s name and surname. Similar pronunciation of some Albanian language consonants challenges search on citizen’s data, names of which are similarly pronounced, despite different spelling. This paper presents a novel approach for extending string searching algorithm based on Albanian names in Kosovo Civil Registry. This paper compares Levenshtein distance, American Soundex and extended Soundex algorithm results in a database of 271.000 citizens of Prishtina municipality. The extended algorithm accommodates basic rules of pronunciation in Albanian language and its accuracy and efficiency is better than Levenshtein distance and American Soundex.