期刊名称:The International Arab Journal of Information Technology
印刷版ISSN:1683-3198
出版年度:2004
卷号:1
期号:1
出版社:Zarqa Private University
摘要:Word prediction methodologies depend heavily on the statistical approach that uses the unigram, bigram, and the trigram of words. However, the construction of the N-gram model requires a very large size of memory, which is beyond the capability of many existing computers. Beside this, the approximation reduces the accuracy of word prediction. In this paper, we suggest to use a cluster of computers to build an Optimal Binary Search Tree (OBST) that will be used for the statistical approach in word prediction. The OBST will contain extra links so that the bigram and the trigram of the language will be presented. In addition, we suggest the incorporation of other enhancements to achieve optimal performance of word prediction. Our experimental results showed that the suggested approach improves the keystroke saving
关键词:Bigram; cluster computing; N-gram; unigram; trigram; word frequency; word prediction