期刊名称:International Journal of Database Management Systems
印刷版ISSN:0975-5985
电子版ISSN:0975-5705
出版年度:2014
卷号:6
期号:1
页码:61
DOI:10.5121/ijdms.2014.6105
出版社:Academy & Industry Research Collaboration Center (AIRCC)
摘要:In post genomic era with the advent of new technologies a huge amount of complex molecular data aregenerated with high throughput. The management of this biological data is definitely a challenging taskdue to complexity and heterogeneity of data for discovering new knowledge. Issues like managing noisyand incomplete data are needed to be dealt with. Use of data mining in biological domain has made itsinventory success. Discovering new knowledge from the biological data is a major challenge in datamining technique. The novelty of the proposed model is its combined use of intelligent techniques to classifythe protein sequence faster and efficiently. Use of FFT, fuzzy classifier, String weighted algorithm, gramencoding method, neural network model and rough set classifier in a single model and in an appropriateplace can enhance the quality of the classification system .Thus the primary challenge is to identify andclassify the large protein sequences in a very fast and easy but intellectual way to decrease the timecomplexity and space complexity.