期刊名称:Proceedings of the National Academy of Sciences
印刷版ISSN:0027-8424
电子版ISSN:1091-6490
出版年度:2022
卷号:119
期号:6
DOI:10.1073/pnas.2114092119
语种:English
出版社:The National Academy of Sciences of the United States of America
摘要:Significance
Pre-messenger RNA (pre-mRNA) splicing is a key regulatory step in gene expression. The splicing reaction is mediated by the spliceosome, a dynamic complex comprising five small nuclear ribonucleoproteins (snRNPs), which assembles onto each intron in multiple steps. We present detailed structural analysis and supporting functional data of an important protein–RNA interaction between human U1 and U2 snRNP. Our structure shows that an intrinsically disordered arginine-glycine (RGG/RG)–rich motif of a U2 snRNP subunit forms an RNA-sequence–specific connection with U1 snRNP. This study broadens the functional scope of unstructured RGG/RG-rich motifs in RNA binding proteins and provides a molecular basis of early steps of spliceosome assembly, which may help develop innovative therapeutic strategies against diseases originating from splicing defects.
In mammals, the structural basis for the interaction between U1 and U2 small nuclear ribonucleoproteins (snRNPs) during the early steps of splicing is still elusive. The binding of the ubiquitin-like (UBL) domain of SF3A1 to the stem-loop 4 of U1 snRNP (U1-SL4) contributes to this interaction. Here, we determined the 3D structure of the complex between the UBL of SF3A1 and U1-SL4 RNA. Our crystallography, NMR spectroscopy, and cross-linking mass spectrometry data show that SF3A1-UBL recognizes, sequence specifically, the GCG/CGC RNA stem and the apical UUCG tetraloop of U1-SL4. In vitro and in vivo mutational analyses support the observed intermolecular contacts and demonstrate that the carboxyl-terminal arginine-glycine-glycine-arginine (RGGR) motif of SF3A1-UBL binds sequence specifically by inserting into the RNA major groove. Thus, the characterization of the SF3A1-UBL/U1-SL4 complex expands the repertoire of RNA binding domains and reveals the capacity of RGG/RG motifs to bind RNA in a sequence-specific manner.