1
|
Sha M, Parveen Rahamathulla M. Splice site recognition - deciphering Exon-Intron transitions for genetic insights using Enhanced integrated Block-Level gated LSTM model. Gene 2024; 915:148429. [PMID: 38575098 DOI: 10.1016/j.gene.2024.148429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 03/26/2024] [Accepted: 04/01/2024] [Indexed: 04/06/2024]
Abstract
Bioinformatics is a contemporary interdisciplinary area focused on analyzing the growing number of genome sequences. Gene variants are differences in DNA sequences among individuals within a population. Splice site recognition is a crucial step in the process of gene expression, where the coding sequences of genes are joined together to form mature messenger RNA (mRNA). These genetic variants that disrupt genes are believed to be the primary reason for neuro-developmental disorders like ASD (Autism Spectrum Disorder) is a neuro-developmental disorder that is diagnosed in individuals, families, and society and occurs as the developmental delay in one among the hundred genes that are associated with these disorders. Missense variants, premature stop codons, or deletions alter both the quality and quantity of encoded proteins. Predicting genes within exons and introns presents main challenges, such as dealing with sequencing errors, short reads, incomplete genes, overlapping, and more. Although many traditional techniques have been utilized in creating an exon prediction system, the primary challenge lies in accurately identifying the length and spliced strand location classification of exons in conjunction with introns. From now on, the suggested approach utilizes a Deep Learning algorithm to analyze intricate and extensive genomic datasets. M-LSTM is utilized to categorize three binary combinations (EI as 1, IE as 2, and none as 3) using spliced DNA strands. The M-LSTM system is able to sequence extensive datasets, ensuring that long information can be stored without any impact on the current input or output. This enables it to recognize and address long-term connections and problems with rapidly increasing gradients. The proposed model is compared internally with Naïve Bayes and Random Forest to assess its efficacy. Additionally, the proposed model's performance is forecasted by utilizing probabilistic parameters like recall, F1-score, precision, and accuracy to assess the effectiveness of the proposed system.
Collapse
|
2
|
Gao P, Zhao Y, Xu G, Zhong Y, Sun C. Unique features of conventional and nonconventional introns in Euglena gracilis. BMC Genomics 2024; 25:595. [PMID: 38872102 PMCID: PMC11170887 DOI: 10.1186/s12864-024-10495-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 06/05/2024] [Indexed: 06/15/2024] Open
Abstract
BACKGROUND Nuclear introns in Euglenida have been understudied. This study aimed to investigate nuclear introns in Euglenida by identifying a large number of introns in Euglena gracilis (E. gracilis), including cis-spliced conventional and nonconventional introns, as well as trans-spliced outrons. We also examined the sequence characteristics of these introns. RESULTS A total of 28,337 introns and 11,921 outrons were identified. Conventional and nonconventional introns have distinct splice site features; the former harbour canonical GT/C-AG splice sites, whereas the latter are capable of forming structured motifs with their terminal sequences. We observed that short introns had a preference for canonical GT-AG introns. Notably, conventional introns and outrons in E. gracilis exhibited a distinct cytidine-rich polypyrimidine tract, in contrast to the thymidine-rich tracts observed in other organisms. Furthermore, the SL-RNAs in E. gracilis, as well as in other trans-splicing species, can form a recently discovered motif called the extended U6/5' ss duplex with the respective U6s. We also describe a novel type of alternative splicing pattern in E. gracilis. The tandem repeat sequences of introns in this protist were determined, and their contents were comparable to those in humans. CONCLUSIONS Our findings highlight the unique features of E. gracilis introns and provide insights into the splicing mechanism of these introns, as well as the genomics and evolution of Euglenida.
Collapse
|
3
|
Kramárek M, Souček P, Réblová K, Grodecká L, Freiberger T. Splicing analysis of STAT3 tandem donor suggests non-canonical binding registers for U1 and U6 snRNAs. Nucleic Acids Res 2024; 52:5959-5974. [PMID: 38426935 PMCID: PMC11162779 DOI: 10.1093/nar/gkae147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 02/02/2024] [Accepted: 02/16/2024] [Indexed: 03/02/2024] Open
Abstract
Tandem donor splice sites (5'ss) are unique regions with at least two GU dinucleotides serving as splicing cleavage sites. The Δ3 tandem 5'ss are a specific subclass of 5'ss separated by 3 nucleotides which can affect protein function by inserting/deleting a single amino acid. One 5'ss is typically preferred, yet factors governing particular 5'ss choice are not fully understood. A highly conserved exon 21 of the STAT3 gene was chosen as a model to study Δ3 tandem 5'ss splicing mechanisms. Based on multiple lines of experimental evidence, endogenous U1 snRNA most likely binds only to the upstream 5'ss. However, the downstream 5'ss is used preferentially, and the splice site choice is not dependent on the exact U1 snRNA binding position. Downstream 5'ss usage was sensitive to exact nucleotide composition and dependent on the presence of downstream regulatory region. The downstream 5'ss usage could be best explained by two novel interactions with endogenous U6 snRNA. U6 snRNA enables the downstream 5'ss usage in STAT3 exon 21 by two mechanisms: (i) binding in a novel non-canonical register and (ii) establishing extended Watson-Crick base pairing with the downstream regulatory region. This study suggests that U6:5'ss interaction is more flexible than previously thought.
Collapse
|
4
|
Zheng R, Dunlap M, Bobkov GOM, Gonzalez-Figueroa C, Patel KJ, Lyu J, Harvey SE, Chan TW, Quinones-Valdez G, Choudhury M, Le Roux CA, Bartels MD, Vuong A, Flynn RA, Chang HY, Van Nostrand EL, Xiao X, Cheng C. hnRNPM protects against the dsRNA-mediated interferon response by repressing LINE-associated cryptic splicing. Mol Cell 2024; 84:2087-2103.e8. [PMID: 38815579 PMCID: PMC11204102 DOI: 10.1016/j.molcel.2024.05.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Revised: 01/09/2024] [Accepted: 05/07/2024] [Indexed: 06/01/2024]
Abstract
RNA splicing is pivotal in post-transcriptional gene regulation, yet the exponential expansion of intron length in humans poses a challenge for accurate splicing. Here, we identify hnRNPM as an essential RNA-binding protein that suppresses cryptic splicing through binding to deep introns, maintaining human transcriptome integrity. Long interspersed nuclear elements (LINEs) in introns harbor numerous pseudo splice sites. hnRNPM preferentially binds at intronic LINEs to repress pseudo splice site usage for cryptic splicing. Remarkably, cryptic exons can generate long dsRNAs through base-pairing of inverted ALU transposable elements interspersed among LINEs and consequently trigger an interferon response, a well-known antiviral defense mechanism. Significantly, hnRNPM-deficient tumors show upregulated interferon-associated pathways and elevated immune cell infiltration. These findings unveil hnRNPM as a guardian of transcriptome integrity by repressing cryptic splicing and suggest that targeting hnRNPM in tumors may be used to trigger an inflammatory immune response, thereby boosting cancer surveillance.
Collapse
|
5
|
D'Incal CP, Annear DJ, Elinck E, van der Smagt JJ, Alders M, Dingemans AJM, Mateiu L, de Vries BBA, Vanden Berghe W, Kooy RF. Loss-of-function of activity-dependent neuroprotective protein (ADNP) by a splice-acceptor site mutation causes Helsmoortel-Van der Aa syndrome. Eur J Hum Genet 2024; 32:630-638. [PMID: 38424297 PMCID: PMC11153555 DOI: 10.1038/s41431-024-01556-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 01/19/2024] [Accepted: 01/25/2024] [Indexed: 03/02/2024] Open
Abstract
Mutations in ADNP result in Helsmoortel-Van der Aa syndrome. Here, we describe the first de novo intronic deletion, affecting the splice-acceptor site of the first coding ADNP exon in a five-year-old girl with developmental delay and autism. Whereas exome sequencing failed to detect the non-coding deletion, genome-wide CpG methylation analysis revealed an episignature suggestive of a Helsmoortel-Van der Aa syndrome diagnosis. This diagnosis was further supported by PhenoScore, a novel facial recognition software package. Subsequent whole-genome sequencing resolved the three-base pair ADNP deletion c.[-5-1_-4del] with transcriptome sequencing showing this deletion leads to skipping of exon 4. An N-terminal truncated protein could not be detected in transfection experiments with a mutant expression vector in HEK293T cells, strongly suggesting this is a first confirmed diagnosis exclusively due to haploinsufficiency of the ADNP gene. Pathway analysis of the methylome indicated differentially methylated genes involved in brain development, the cytoskeleton, locomotion, behavior, and muscle development. Along the same line, transcriptome analysis identified most of the differentially expressed genes as upregulated, in line with the hypomethylated CpG episignature and confirmed the involvement of the cytoskeleton and muscle development pathways that are also affected in patient cell lines and animal models. In conclusion, this novel mutation for the first time demonstrates that Helsmoortel-Van der Aa syndrome can be caused by a loss-of-function mutation. Moreover, our study elegantly illustrates the use of EpiSignatures, WGS and Phenoscore as novel complementary diagnostic tools in case a of negative WES result.
Collapse
|
6
|
Leckie KM, Sawler J, Kapos P, MacKenzie JO, Giles I, Baynes K, Lo J, Baute GJ, Celedon JM. Loss of daylength sensitivity by splice site mutation in Cannabis pseudo-response regulator. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 118:2020-2036. [PMID: 38525679 DOI: 10.1111/tpj.16726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 03/08/2024] [Accepted: 03/10/2024] [Indexed: 03/26/2024]
Abstract
Photoperiod insensitivity (auto-flowering) in drug-type Cannabis sativa circumvents the need for short day (SD) flowering requirements making outdoor cultivation in high latitudes possible. However, the benefits of photoperiod insensitivity are counterbalanced by low cannabinoid content and poor flower quality in auto-flowering genotypes. Despite recent studies in cannabis flowering, a mechanistic understanding of photoperiod insensitivity is still lacking. We used a combination of genome-wide association study and genetic fine-mapping to identify the genetic cause of auto-flowering in cannabis. We then used gene expression analyses and transient transformation assays to characterize flowering time control. Herein, we identify a splice site mutation within circadian clock gene PSEUDO-RESPONSE REGULATOR 37 (CsPRR37) in auto-flowering cannabis. We show that CsPRR37 represses FT expression and its circadian oscillations transition to a less repressive state during SD as compared to long days (LD). We identify several key circadian clock genes whose expression is altered in auto-flowering cannabis, particularly under non-inductive LD. Research into the pervasiveness of this mutation and others affecting flowering time will help elucidate cannabis domestication history and advance cannabis breeding toward a more sustainable outdoor cultivation system.
Collapse
|
7
|
Buerer L, Clark NE, Welch A, Duan C, Taggart AJ, Townley BA, Wang J, Soemedi R, Rong S, Lin CL, Zeng Y, Katolik A, Staley JP, Damha MJ, Mosammaparast N, Fairbrother WG. The debranching enzyme Dbr1 regulates lariat turnover and intron splicing. Nat Commun 2024; 15:4617. [PMID: 38816363 PMCID: PMC11139901 DOI: 10.1038/s41467-024-48696-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 05/05/2024] [Indexed: 06/01/2024] Open
Abstract
The majority of genic transcription is intronic. Introns are removed by splicing as branched lariat RNAs which require rapid recycling. The branch site is recognized during splicing catalysis and later debranched by Dbr1 in the rate-limiting step of lariat turnover. Through generation of a viable DBR1 knockout cell line, we find the predominantly nuclear Dbr1 enzyme to encode the sole debranching activity in human cells. Dbr1 preferentially debranches substrates that contain canonical U2 binding motifs, suggesting that branchsites discovered through sequencing do not necessarily represent those favored by the spliceosome. We find that Dbr1 also exhibits specificity for particular 5' splice site sequences. We identify Dbr1 interactors through co-immunoprecipitation mass spectrometry. We present a mechanistic model for Dbr1 recruitment to the branchpoint through the intron-binding protein AQR. In addition to a 20-fold increase in lariats, Dbr1 depletion increases exon skipping. Using ADAR fusions to timestamp lariats, we demonstrate a defect in spliceosome recycling. In the absence of Dbr1, spliceosomal components remain associated with the lariat for a longer period of time. As splicing is co-transcriptional, slower recycling increases the likelihood that downstream exons will be available for exon skipping.
Collapse
|
8
|
Sarka K, Katzman S, Zahler AM. A role for SNU66 in maintaining 5' splice site identity during spliceosome assembly. RNA (NEW YORK, N.Y.) 2024; 30:695-709. [PMID: 38443114 PMCID: PMC11098459 DOI: 10.1261/rna.079971.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Accepted: 02/21/2024] [Indexed: 03/07/2024]
Abstract
In spliceosome assembly, the 5' splice site is initially recognized by U1 snRNA. U1 leaves the spliceosome during the assembly process, therefore other factors contribute to the maintenance of 5' splice site identity as it is loaded into the catalytic site. Recent structural data suggest that human tri-snRNP 27K (SNRP27) M141 and SNU66 H734 interact to stabilize the U4/U6 quasi-pseudo knot at the base of the U6 snRNA ACAGAGA box in pre-B complex. Previously, we found that mutations in Caenorhabditis elegans at SNRP-27 M141 promote changes in alternative 5'ss usage. We tested whether the potential interaction between SNRP-27 M141 and SNU-66 H765 (the C. elegans equivalent position to human SNU66 H734) contributes to maintaining 5' splice site identity during spliceosome assembly. We find that SNU-66 H765 mutants promote alternative 5' splice site usage. Many of the alternative 5' splicing events affected by SNU-66(H765G) overlap with those affected SNRP-27(M141T). Double mutants of snrp-27(M141T) and snu-66(H765G) are homozygous lethal. We hypothesize that mutations at either SNRP-27 M141 or SNU-66 H765 allow the spliceosome to load alternative 5' splice sites into the active site. Tests with mutant U1 snRNA and swapped 5' splice sites indicate that the ability of SNRP-27 M141 and SNU-66 H765 mutants to affect a particular 5' splice alternative splicing event is dependent on both the presence of a weaker consensus 5'ss nearby and potentially nearby splicing factor binding sites. Our findings confirm a new role for the C terminus of SNU-66 in maintenance of 5' splice site identity during spliceosome assembly.
Collapse
|
9
|
McCue K, Burge CB. An interpretable model of pre-mRNA splicing for animal and plant genes. SCIENCE ADVANCES 2024; 10:eadn1547. [PMID: 38718117 PMCID: PMC11078188 DOI: 10.1126/sciadv.adn1547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/25/2023] [Accepted: 04/04/2024] [Indexed: 05/12/2024]
Abstract
Pre-mRNA splicing is a fundamental step in gene expression, conserved across eukaryotes, in which the spliceosome recognizes motifs at the 3' and 5' splice sites (SSs), excises introns, and ligates exons. SS recognition and pairing is often influenced by protein splicing factors (SFs) that bind to splicing regulatory elements (SREs). Here, we describe SMsplice, a fully interpretable model of pre-mRNA splicing that combines models of core SS motifs, SREs, and exonic and intronic length preferences. We learn models that predict SS locations with 83 to 86% accuracy in fish, insects, and plants and about 70% in mammals. Learned SRE motifs include both known SF binding motifs and unfamiliar motifs, and both motif classes are supported by genetic analyses. Our comparisons across species highlight similarities between non-mammals, increased reliance on intronic SREs in plant splicing, and a greater reliance on SREs in mammalian splicing.
Collapse
|
10
|
Holm LL, Doktor TK, Flugt KK, Petersen US, Petersen R, Andresen B. All exons are not created equal-exon vulnerability determines the effect of exonic mutations on splicing. Nucleic Acids Res 2024; 52:4588-4603. [PMID: 38324470 PMCID: PMC11077056 DOI: 10.1093/nar/gkae077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 01/05/2024] [Accepted: 01/26/2024] [Indexed: 02/09/2024] Open
Abstract
It is now widely accepted that aberrant splicing of constitutive exons is often caused by mutations affecting cis-acting splicing regulatory elements (SREs), but there is a misconception that all exons have an equal dependency on SREs and thus a similar vulnerability to aberrant splicing. We demonstrate that some exons are more likely to be affected by exonic splicing mutations (ESMs) due to an inherent vulnerability, which is context dependent and influenced by the strength of exon definition. We have developed VulExMap, a tool which is based on empirical data that can designate whether a constitutive exon is vulnerable. Using VulExMap, we find that only 25% of all exons can be categorized as vulnerable, whereas two-thirds of 359 previously reported ESMs in 75 disease genes are located in vulnerable exons. Because VulExMap analysis is based on empirical data on splicing of exons in their endogenous context, it includes all features important in determining the vulnerability. We believe that VulExMap will be an important tool when assessing the effect of exonic mutations by pinpointing whether they are located in exons vulnerable to ESMs.
Collapse
|
11
|
Malard F, Wolter AC, Marquevielle J, Morvan E, Ecoutin A, Rüdisser S, Allain FT, Campagne S. The diversity of splicing modifiers acting on A-1 bulged 5'-splice sites reveals rules for rational drug design. Nucleic Acids Res 2024; 52:4124-4136. [PMID: 38554107 PMCID: PMC11077090 DOI: 10.1093/nar/gkae201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 12/07/2023] [Accepted: 03/07/2024] [Indexed: 04/01/2024] Open
Abstract
Pharmacological modulation of RNA splicing by small molecules is an emerging facet of drug discovery. In this context, the SMN2 splicing modifier SMN-C5 was used as a prototype to understand the mode of action of small molecule splicing modifiers and propose the concept of 5'-splice site bulge repair. In this study, we combined in vitro binding assays and structure determination by NMR spectroscopy to identify the binding modes of four other small molecule splicing modifiers that switch the splicing of either the SMN2 or the HTT gene. Here, we determined the solution structures of risdiplam, branaplam, SMN-CX and SMN-CY bound to the intermolecular RNA helix epitope containing an unpaired adenine within the G-2A-1G+1U+2 motif of the 5'-splice site. Despite notable differences in their scaffolds, risdiplam, SMN-CX, SMN-CY and branaplam contact the RNA epitope similarly to SMN-C5, suggesting that the 5'-splice site bulge repair mechanism can be generalised. These findings not only deepen our understanding of the chemical diversity of splicing modifiers that target A-1 bulged 5'-splice sites, but also identify common pharmacophores required for modulating 5'-splice site selection with small molecules.
Collapse
|
12
|
Atkinson R, Georgiou M, Yang C, Szymanska K, Lahat A, Vasconcelos EJR, Ji Y, Moya Molina M, Collin J, Queen R, Dorgau B, Watson A, Kurzawa-Akanbi M, Laws R, Saxena A, Shyan Beh C, Siachisumo C, Goertler F, Karwatka M, Davey T, Inglehearn CF, McKibbin M, Lührmann R, Steel DH, Elliott DJ, Armstrong L, Urlaub H, Ali RR, Grellscheid SN, Johnson CA, Mozaffari-Jovin S, Lako M. PRPF8-mediated dysregulation of hBrr2 helicase disrupts human spliceosome kinetics and 5´-splice-site selection causing tissue-specific defects. Nat Commun 2024; 15:3138. [PMID: 38605034 PMCID: PMC11009313 DOI: 10.1038/s41467-024-47253-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 03/19/2024] [Indexed: 04/13/2024] Open
Abstract
The carboxy-terminus of the spliceosomal protein PRPF8, which regulates the RNA helicase Brr2, is a hotspot for mutations causing retinitis pigmentosa-type 13, with unclear role in human splicing and tissue-specificity mechanism. We used patient induced pluripotent stem cells-derived cells, carrying the heterozygous PRPF8 c.6926 A > C (p.H2309P) mutation to demonstrate retinal-specific endophenotypes comprising photoreceptor loss, apical-basal polarity and ciliary defects. Comprehensive molecular, transcriptomic, and proteomic analyses revealed a role of the PRPF8/Brr2 regulation in 5'-splice site (5'SS) selection by spliceosomes, for which disruption impaired alternative splicing and weak/suboptimal 5'SS selection, and enhanced cryptic splicing, predominantly in ciliary and retinal-specific transcripts. Altered splicing efficiency, nuclear speckles organisation, and PRPF8 interaction with U6 snRNA, caused accumulation of active spliceosomes and poly(A)+ mRNAs in unique splicing clusters located at the nuclear periphery of photoreceptors. Collectively these elucidate the role of PRPF8/Brr2 regulatory mechanisms in splicing and the molecular basis of retinal disease, informing therapeutic approaches.
Collapse
|
13
|
Liang Q, Zhang Z, Ding B, Shao Y, Ding Q, Dai J, Hu X, Wu W, Wang X. A noncanonical splicing variant c.875-5 T > G in von Willebrand factor causes in-frame exon skipping and type 2A von Willebrand disease. Thromb Res 2024; 236:51-60. [PMID: 38387303 DOI: 10.1016/j.thromres.2024.02.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Revised: 01/16/2024] [Accepted: 02/01/2024] [Indexed: 02/24/2024]
Abstract
INTRODUCTION A novel variant involving noncanonical splicing acceptor site (c.875-5 T > G) in propeptide coding region of von Willebrand factor (VWF) was identified in a patient with type 2A von Willebrand disease (VWD), who co-inherited with a null variant (p.Tyr271*) and presented characteristic discrepancy of plasma level of VWF antigen and activity, and a selective reduction of both intermediate-molecular-weight (IMWMs) and high-molecular-weight VWF multimers (HMWMs). MATERIALS AND METHODS VWF mRNA transcripts obtained from peripheral leukocytes and platelets of the patients were investigated to analyze the consequence of c.875-5 T > G on splicing. The impact of the variant on expression and multimer assembly was further analyzed by in vitro expression studies in AtT-20 cells. The intracellular processing of VWF mutant and the Weibel-Palade bodies (WPBs) formation was evaluated by immunofluorescence staining and electron microscopy. RESULTS The mRNA transcript analysis revealed that c.875-5 T > G variant led to exon 8 skipping and an in-frame deletion of 41 amino acids in the D1 domain of VWF (p.Ser292_Glu333delinsLys), yielding a truncated propeptide. Consistent with the patient's laboratory manifestations, the AtT-20 cells transfected with mutant secreted less VWF, with the VWF antigen level in conditioned medium 47 % of wild-type. A slight retention in the endoplasmic reticulum was observed for the mutant. Almost complete loss of IMWMs and HMWMs in the medium and impaired WPBs formation in the cell, indicating truncated VWF propeptide lost its chaperon-like function for VWF multimerization and tubular storage. CONCLUSIONS The VWF splicing site variant (c.875-5 T > G) causes propeptide truncation, severely compromising VWF multimer assembly and tubular storage.
Collapse
|
14
|
Zhang H, Xin M, Lin L, Chen C, Balestra D, Ding Q. Pleiotropic effects of different exonic nucleotide changes at the same position contribute to hemophilia B phenotypic variation. J Thromb Haemost 2024; 22:975-989. [PMID: 38184202 DOI: 10.1016/j.jtha.2023.12.031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 12/29/2023] [Accepted: 12/29/2023] [Indexed: 01/08/2024]
Abstract
BACKGROUND The disease-causing effects of genetic variations often depend on their location within a gene. Exonic changes generally lead to alterations in protein production, secretion, activity, or clearance. However, owing to the overlap between proteins and splicing codes, missense variants can also affect messenger RNA splicing, thus adding a layer of complexity and influencing disease phenotypes. OBJECTIVES To extensively characterize a panel of 13 exonic variants in the F9 gene occurring at 6 different factor IX positions and associated with varying severities of hemophilia B (HB). METHODS Computational predictions, splicing analysis, and recombinant factor IX assays were exploited to characterize F9 variants. RESULTS We demonstrated that 5 (38%) of 13 selected F9 exonic variants have pleiotropic effects. Although bioinformatic approaches accurately classified effects, extensive experimental assays were required to elucidate and deepen the molecular mechanisms underlying the pleiotropic effects. Importantly, their characterization was instrumental in developing tailored RNA therapeutics based on engineered U7 small nuclear RNA to mask cryptic splice sites and compensatory U1 small nuclear RNA to enhance exon definition. CONCLUSION Overall, albeit a multitool bioinformatic approach suggested the molecular effects of multiple HB variants, the deep investigation of molecular mechanisms revealed insights into the HB phenotype-genotype relationship, enabling accurate classification of HB variants. Importantly, knowledge of molecular mechanisms allowed the development of tailored RNA therapeutics, which can also be translated to other genetic diseases.
Collapse
|
15
|
Waldock WJ, Taylor LJ, Sperring S, Staurenghi F, Martinez-Fernandez de la Camara C, Whitfield J, Clouston P, Yusuf IH, MacLaren RE. A hypomorphic variant of choroideremia is associated with a novel intronic mutation that leads to exon skipping. Ophthalmic Genet 2024; 45:210-217. [PMID: 38273808 DOI: 10.1080/13816810.2023.2270554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 10/09/2023] [Indexed: 01/27/2024]
Abstract
INTRODUCTION Molecular confirmation of pathogenic sequence variants in the CHM gene is required prior to enrolment in retinal gene therapy clinical trials for choroideremia. Individuals with mild choroideremia have been reported. The molecular basis of genotype-phenotype associations is of clinical relevance since it may impact on selection for retinal gene therapy. METHODS AND MATERIALS Genetic testing and RNA analysis were undertaken in a patient with mild choroideremia to confirm the pathogenicity of a novel intronic variant in CHM and to explore the mechanism underlying the mild clinical phenotype. RESULTS A 42-year-old male presented with visual field loss. Fundoscopy and autofluorescence imaging demonstrated mild choroideremia for his age. Genetic analysis revealed a variant at a splice acceptor site in the CHM gene (c.1350-3C > G). RNA analysis demonstrated two out-of-frame transcripts, suggesting pathogenicity, without any detectable wildtype transcripts. One of the two out-of-frame transcripts is present in very low levels in healthy controls. DISCUSSION Mild choroideremia may result from +3 or -3 splice site variants in CHM. It is presumed that the resulting mRNA transcripts may be partly functional, thereby preventing the development of the null phenotype. Choroideremia patients with such variants may present challenges for gene therapy since there may be residual transcript activity which could result in long-lasting visual function which is atypical for this disease.
Collapse
|
16
|
Bai R, Yuan M, Zhang P, Luo T, Shi Y, Wan R. Structural basis of U12-type intron engagement by the fully assembled human minor spliceosome. Science 2024; 383:1245-1252. [PMID: 38484052 DOI: 10.1126/science.adn7272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2023] [Accepted: 02/09/2024] [Indexed: 03/19/2024]
Abstract
The minor spliceosome, which is responsible for the splicing of U12-type introns, comprises five small nuclear RNAs (snRNAs), of which only one is shared with the major spliceosome. In this work, we report the 3.3-angstrom cryo-electron microscopy structure of the fully assembled human minor spliceosome pre-B complex. The atomic model includes U11 small nuclear ribonucleoprotein (snRNP), U12 snRNP, and U4atac/U6atac.U5 tri-snRNP. U11 snRNA is recognized by five U11-specific proteins (20K, 25K, 35K, 48K, and 59K) and the heptameric Sm ring. The 3' half of the 5'-splice site forms a duplex with U11 snRNA; the 5' half is recognized by U11-35K, U11-48K, and U11 snRNA. Two proteins, CENATAC and DIM2/TXNL4B, specifically associate with the minor tri-snRNP. A structural analysis uncovered how two conformationally similar tri-snRNPs are differentiated by the minor and major prespliceosomes for assembly.
Collapse
|
17
|
Zhang Z, Kumar V, Dybkov O, Will CL, Urlaub H, Stark H, Lührmann R. Cryo-EM analyses of dimerized spliceosomes provide new insights into the functions of B complex proteins. EMBO J 2024; 43:1065-1088. [PMID: 38383864 PMCID: PMC10943123 DOI: 10.1038/s44318-024-00052-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 01/25/2024] [Accepted: 01/26/2024] [Indexed: 02/23/2024] Open
Abstract
The B complex is a key intermediate stage of spliceosome assembly. To improve the structural resolution of monomeric, human spliceosomal B (hB) complexes and thereby generate a more comprehensive hB molecular model, we determined the cryo-EM structure of B complex dimers formed in the presence of ATP γ S. The enhanced resolution of these complexes allows a finer molecular dissection of how the 5' splice site (5'ss) is recognized in hB, and new insights into molecular interactions of FBP21, SNU23 and PRP38 with the U6/5'ss helix and with each other. It also reveals that SMU1 and RED are present as a heterotetrameric complex and are located at the interface of the B dimer protomers. We further show that MFAP1 and UBL5 form a 5' exon binding channel in hB, and elucidate the molecular contacts stabilizing the 5' exon at this stage. Our studies thus yield more accurate models of protein and RNA components of hB complexes. They further allow the localization of additional proteins and protein domains (such as SF3B6, BUD31 and TCERG1) whose position was not previously known, thereby uncovering new functions for B-specific and other hB proteins during pre-mRNA splicing.
Collapse
|
18
|
Lai H, Lyu M, Ruan H, Liu Y, Liu T, Lei S, Xiao Y, Zhang S, Ying B. Large-scale analysis reveals splicing biomarkers for tuberculosis progression and prognosis. Comput Biol Med 2024; 171:108187. [PMID: 38402840 DOI: 10.1016/j.compbiomed.2024.108187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 02/07/2024] [Accepted: 02/18/2024] [Indexed: 02/27/2024]
Abstract
BACKGROUND Emerging evidence suggests that aberrant alternative splicing (AS) may play an important role in tuberculosis (TB). However, current knowledge regarding the value of AS in TB progression and prognosis remains unclear. METHOD Public RNA-seq datasets related to TB progression and prognosis were searched and AS analyses were conducted based on SUPPA2. Percent spliced in (PSI) was used for quantifying AS events and multiple machine learning (ML) methods were employed to construct predictive models. Area under curve (AUC), sensitivity and specificity were calculated to evaluate the model performance. RESULTS A total of 1587 samples from 7 datasets were included. Among 923 TB-progression related differential AS events (DASEs), 3 events (GET1-skipping exon (SE), TPD52-alternative first exons (AF) and TIMM10-alternative 5' splice site (A5)) were selected as candidate biomarkers; however, their predictive performance was limited. For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, MACROD2-alternative 3' splice site (A3), CD55-retained intron (RI) and GALNT11-AF) were selected as candidates from the 1282 DASEs. Six ML methods were used to integrate these 5 events and XGBoost outperformed than others. AUC, sensitivity and specificity of XGBoost model were 0.875, 81.1% and 83.5% in training set, while they were 0.805, 68.4% and 73.2% in test set. CONCLUSION GET1-SE, TPD52-AF and TIMM10-A5 showed limited role in predicting TB progression, while PHF23-AF, KIF1B-SE, MACROD2-A3, CD55-RI and GALNT11-AF could well predict TB prognosis and work as candidate biomarkers. This work preliminarily explored the value of AS in predicting TB progression and prognosis and offered potential targets for further research.
Collapse
|
19
|
Li C, Zhang R, Pan F, Xin Q, Shi X, Guo W, Qiao D, Wang Z, Zhang Y, Liu X, Zhang Y, Shao L. Functional analysis of the CTNS gene exonic variants predicted to affect splicing. Clin Genet 2024; 105:323-328. [PMID: 38009794 DOI: 10.1111/cge.14460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 10/26/2023] [Accepted: 11/15/2023] [Indexed: 11/29/2023]
Abstract
Cystinosis is a severe, monogenic systemic disease caused by variants in CTNS gene. Currently, there is growing evidence that exonic variants in many diseases can affect pre-mRNA splicing. The impact of CTNS gene exonic variants on splicing regulation may be underestimated due to the lack of routine studies at the RNA level. Here, we analyzed 59 exonic variants in the CTNS gene using bioinformatics tools and identified candidate variants that may induce splicing alterations by minigene assays. We identified six exonic variants that induce splicing alterations by disrupting the ratio of exonic splicing enhancers/exonic splicing silencers (ESEs/ESSs) or by interfering with the recognition of classical splice sites, or both. Our results help in the correct molecular characterization of variants in cystinosis and inform emerging therapies. Furthermore, our work suggests that the combination of in silico and in vitro assays facilitates to assess the effects of DNA variants driving rare genetic diseases on splicing regulation and will enhance the clinical utility of variant functional annotation.
Collapse
|
20
|
Waye JS, Hanna M, Nakamura L, Walker L, Eng B, Nfonsam LE. Splice Acceptor Mutation [ HBB:c.93-2A > T] in a Patient with Hb S/β 0-Thalassemia. Hemoglobin 2024; 48:116-117. [PMID: 38360540 DOI: 10.1080/03630269.2024.2314075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Accepted: 01/24/2024] [Indexed: 02/17/2024]
Abstract
We report a case of Hb S/β0-thalassemia (Hb S/β0-thal) in a patient who is a compound heterozygote for the Hb Sickle mutation (HBB:c.20A > T) and a mutation of the canonical splice acceptor sequence of IVS1 (AG > TG, HBB:c.93-2A > T). This is the fifth mutation involving the AG splice acceptor site of IVS1, all of which prevent normal splicing and cause β0-thal.
Collapse
|
21
|
Ishigami Y, Wong MS, Martí-Gómez C, Ayaz A, Kooshkbaghi M, Hanson SM, McCandlish DM, Krainer AR, Kinney JB. Specificity, synergy, and mechanisms of splice-modifying drugs. Nat Commun 2024; 15:1880. [PMID: 38424098 PMCID: PMC10904865 DOI: 10.1038/s41467-024-46090-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Accepted: 02/10/2024] [Indexed: 03/02/2024] Open
Abstract
Drugs that target pre-mRNA splicing hold great therapeutic potential, but the quantitative understanding of how these drugs work is limited. Here we introduce mechanistically interpretable quantitative models for the sequence-specific and concentration-dependent behavior of splice-modifying drugs. Using massively parallel splicing assays, RNA-seq experiments, and precision dose-response curves, we obtain quantitative models for two small-molecule drugs, risdiplam and branaplam, developed for treating spinal muscular atrophy. The results quantitatively characterize the specificities of risdiplam and branaplam for 5' splice site sequences, suggest that branaplam recognizes 5' splice sites via two distinct interaction modes, and contradict the prevailing two-site hypothesis for risdiplam activity at SMN2 exon 7. The results also show that anomalous single-drug cooperativity, as well as multi-drug synergy, are widespread among small-molecule drugs and antisense-oligonucleotide drugs that promote exon inclusion. Our quantitative models thus clarify the mechanisms of existing treatments and provide a basis for the rational development of new therapies.
Collapse
|
22
|
Kwon YS, Jin SW, Song H. Global analysis of binding sites of U2AF1 and ZRSR2 reveals RNA elements required for mutually exclusive splicing by the U2- and U12-type spliceosome. Nucleic Acids Res 2024; 52:1420-1434. [PMID: 38088204 PMCID: PMC10853781 DOI: 10.1093/nar/gkad1180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 11/18/2023] [Accepted: 12/05/2023] [Indexed: 02/10/2024] Open
Abstract
Recurring mutations in genes encoding 3' splice-site recognition proteins, U2AF1 and ZRSR2 are associated with human cancers. Here, we determined binding sites of the proteins to reveal that U2-type and U12-type splice sites are recognized by U2AF1 and ZRSR2, respectively. However, some sites are spliced by both the U2-type and U12-type spliceosomes, indicating that well-conserved consensus motifs in some U12-type introns could be recognized by the U2-type spliceosome. Nucleotides flanking splice sites of U12-type introns are different from those flanking U2-type introns. Remarkably, the AG dinucleotide at the positions -1 and -2 of 5' splice sites of U12-type introns with GT-AG termini is not present. AG next to 5' splice site introduced by a single nucleotide substitution at the -2 position could convert a U12-type splice site to a U2-type site. The class switch of introns by a single mutation and the bias against G at the -1 position of U12-type 5' splice site support the notion that the identities of nucleotides in exonic regions adjacent to splice sites are fine-tuned to avoid recognition by the U2-type spliceosome. These findings may shed light on the mechanism of selectivity in U12-type intron splicing and the mutations that affect splicing.
Collapse
|
23
|
Bakhtiar D, Vondraskova K, Pengelly RJ, Chivers M, Kralovicova J, Vorechovsky I. Exonic splicing code and coordination of divalent metals in proteins. Nucleic Acids Res 2024; 52:1090-1106. [PMID: 38055834 PMCID: PMC10853796 DOI: 10.1093/nar/gkad1161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 11/15/2023] [Accepted: 11/17/2023] [Indexed: 12/08/2023] Open
Abstract
Exonic sequences contain both protein-coding and RNA splicing information but the interplay of the protein and splicing code is complex and poorly understood. Here, we have studied traditional and auxiliary splicing codes of human exons that encode residues coordinating two essential divalent metals at the opposite ends of the Irving-Williams series, a universal order of relative stabilities of metal-organic complexes. We show that exons encoding Zn2+-coordinating amino acids are supported much less by the auxiliary splicing motifs than exons coordinating Ca2+. The handicap of the former is compensated by stronger splice sites and uridine-richer polypyrimidine tracts, except for position -3 relative to 3' splice junctions. However, both Ca2+ and Zn2+ exons exhibit close-to-constitutive splicing in multiple tissues, consistent with their critical importance for metalloprotein function and a relatively small fraction of expendable, alternatively spliced exons. These results indicate that constraints imposed by metal coordination spheres on RNA splicing have been efficiently overcome by the plasticity of exon-intron architecture to ensure adequate metalloprotein expression.
Collapse
|
24
|
Kawakami R, Hiraide T, Watanabe K, Miyamoto S, Hira K, Komatsu K, Ishigaki H, Sakaguchi K, Maekawa M, Yamashita K, Fukuda T, Miyairi I, Ogata T, Saitsu H. RNA sequencing and target long-read sequencing reveal an intronic transposon insertion causing aberrant splicing. J Hum Genet 2024; 69:91-99. [PMID: 38102195 DOI: 10.1038/s10038-023-01211-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 11/28/2023] [Accepted: 12/01/2023] [Indexed: 12/17/2023]
Abstract
More than half of cases with suspected genetic disorders remain unsolved by genetic analysis using short-read sequencing such as exome sequencing (ES) and genome sequencing (GS). RNA sequencing (RNA-seq) and long-read sequencing (LRS) are useful for interpretation of candidate variants and detection of structural variants containing repeat sequences, respectively. Recently, adaptive sampling on nanopore sequencers enables target LRS more easily. Here, we present a Japanese girl with premature chromatid separation (PCS)/mosaic variegated aneuploidy (MVA) syndrome. ES detected a known pathogenic maternal heterozygous variant (c.1402-5A>G) in intron 10 of BUB1B (NM_001211.6), a known responsive gene for PCS/MVA syndrome with autosomal recessive inheritance. Minigene splicing assay revealed that almost all transcripts from the c.1402-5G allele have mis-splicing with 4-bp insertion. GS could not detect another pathogenic variant, while RNA-seq revealed abnormal reads in intron 2. To extensively explore variants in intron 2, we performed adaptive sampling and identified a paternal 3.0 kb insertion. Consensus sequence of 16 reads spanning the insertion showed that the insertion consists of Alu and SVA elements. Realignment of RNA-seq reads to the new reference sequence containing the insertion revealed that 16 reads have 5' splice site within the insertion and 3' splice site at exon 3, demonstrating causal relationship between the insertion and aberrant splicing. In addition, immunoblotting showed severely diminished BUB1B protein level in patient derived cells. These data suggest that detection of transcriptomic abnormalities by RNA-seq can be a clue for identifying pathogenic variants, and determination of insert sequences is one of merits of LRS.
Collapse
|
25
|
Duan C, Mooney T, Buerer L, Bowers C, Rong S, Kim SW, Fredericks AM, Monaghan SF, Fairbrother WG. The unusual gene architecture of polyubiquitin is created by dual-specific splice sites. Genome Biol 2024; 25:33. [PMID: 38268025 PMCID: PMC10809524 DOI: 10.1186/s13059-023-03157-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Accepted: 12/21/2023] [Indexed: 01/26/2024] Open
Abstract
BACKGROUND The removal of introns occurs through the splicing of a 5' splice site (5'ss) with a 3' splice site (3'ss). These two elements are recognized by distinct components of the spliceosome. However, introns in higher eukaryotes contain many matches to the 5' and 3' splice-site motifs that are presumed not to be used. RESULTS Here, we find that many of these sites can be used. We also find occurrences of the AGGT motif that can function as either a 5'ss or a 3'ss-previously referred to as dual-specific splice sites (DSSs)-within introns. Analysis of the Sequence Read Archive reveals a 3.1-fold enrichment of DSSs relative to expectation, implying synergy between the ability to function as a 5'ss and 3'ss. Despite this suggested mechanistic advantage, DSSs are 2.7- and 4.7-fold underrepresented in annotated 5' and 3' splice sites. A curious exception is the polyubiquitin gene UBC, which contains a tandem array of DSSs that precisely delimit the boundary of each ubiquitin monomer. The resulting isoforms splice stochastically to include a variable number of ubiquitin monomers. We found no evidence of tissue-specific or feedback regulation but note the 8.4-fold enrichment of DSS-spliced introns in tandem repeat genes suggests a driving role in the evolution of genes like UBC. CONCLUSIONS We find an excess of unannotated splice sites and the utilization of DSSs in tandem repeats supports the role of splicing in gene evolution. These findings enhance our understanding of the diverse and complex nature of the splicing process.
Collapse
|