期刊名称:Karbala International Journal of Modern Science
印刷版ISSN:2405-609X
电子版ISSN:2405-609X
出版年度:2015
卷号:1
期号:4
页码:212-224
DOI:10.1016/j.kijoms.2015.11.006
语种:English
出版社:Elsevier
摘要:Abstract Membrane cholesterol plays an important role in modulating the function of several membrane proteins. From these proteins, a special cholesterol binding motif is reported to which the membrane cholesterol binds and modulates their activity. This consensus motif is either seen as a forward pattern known as CRAC (L/V-X(1-5)-Y-X(1-5)-R/K) and/or as a backward pattern CARC (R/K-X(1-5)-Y-X(1-5)-L/V). This as such is a low consensus motif as substituting amino acid in the unconserved positions of the motif (‘X’) yields many combinations. In order to obtain a better consensus motif for cholesterol binding, it is worthwhile to look for the same within a membrane proteins superfamily (ABC transporters, GPCRs, etc.) and assign them as a signature motif. Therefore, in the current work an attempt was made to identify the distribution of this motif in all seven helices of GPCR family and assign a consensus signature motif for an individual helix using a novel Fuzzy C-Means (FCM) approach. The workflow proceeds in four phases; first, GPCR protein sequences were extracted from UniProt database that contains seven transmembrane (TM) helices and a cholesterol dictionary has been designed for different window sizes. In second phase; those sequences are filtered which starts with R/K or L/V using both CRAC and CARC cholesterol recognition methods leading to discovery of filtered cholesterol motifs. Third phase leads to identification of significant cholesterol motifs using FCM algorithm by computing the membership of sequences to different motifs and pattern matching with different helices. Finally those uncovered cholesterol motifs that matched with TM helices were analyzed. From the results we report an algorithm that can efficiently identify and assign cholesterol signature motifs in GPCR protein sequences that can be further extended to other membrane proteins.