<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fgene.2020.00109</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Expression Changes Confirm Genomic Variants Predicted to Result in Allele-Specific, Alternative mRNA Splicing</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Mucaki</surname><given-names>Eliseos J.</given-names>
</name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/840173"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Shirley</surname><given-names>Ben C.</given-names>
</name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Rogan</surname><given-names>Peter K.</given-names>
</name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>*</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/808820"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Biochemistry, University of Western Ontario</institution>, <addr-line>London, ON</addr-line>, <country>Canada</country></aff>
<aff id="aff2"><sup>2</sup><institution>CytoGnomix</institution>, <addr-line>London, ON</addr-line>, <country>Canada</country></aff>
<aff id="aff3"><sup>3</sup><institution>Department of Oncology University of Western Ontario</institution>, <addr-line>London, ON</addr-line>, <country>Canada</country></aff>
<aff id="aff4"><sup>4</sup><institution>Department of Computer Science, University of Western Ontario</institution>, <addr-line>London, ON</addr-line>, <country>Canada</country></aff>
<author-notes>
<fn fn-type="edited-by">
<p>Edited by: Emanuele Buratti, International Centre for Genetic Engineering and Biotechnology, Italy</p>
</fn>
<fn fn-type="edited-by">
<p>Reviewed by: Stefan Stamm, University of Kentucky, United States; Lucie Grodeck&#xe1;, Center of Cardiovascular and Transplant Surgery, Czechia</p>
</fn>
<fn fn-type="corresp" id="fn001">
<p>*Correspondence: Peter K. Rogan, <email xlink:href="mailto:progan@uwo.ca">progan@uwo.ca</email></p>
</fn>
<fn fn-type="other" id="fn002">
<p>This article was submitted to RNA, a section of the journal Frontiers in Genetics</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>05</day>
<month>03</month>
<year>2020</year>
</pub-date>
<pub-date pub-type="collection">
<year>2020</year>
</pub-date>
<volume>11</volume>
<elocation-id>109</elocation-id>
<history>
<date date-type="received">
<day>13</day>
<month>09</month>
<year>2019</year>
</date>
<date date-type="accepted">
<day>30</day>
<month>01</month>
<year>2020</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2020 Mucaki, Shirley and Rogan</copyright-statement>
<copyright-year>2020</copyright-year>
<copyright-holder>Mucaki, Shirley and Rogan</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#xa0;terms.</p>
</license>
</permissions>
<abstract>
<p>Splice isoform structure and abundance can be affected by either noncoding or masquerading coding variants that alter the structure or abundance of transcripts. When these variants are common in the population, these nonconstitutive transcripts are sufficiently frequent so as to resemble naturally occurring, alternative mRNA splicing. Prediction of the effects of such variants has been shown to be accurate using information theory-based methods. Single nucleotide polymorphisms (SNPs) predicted to significantly alter natural and/or cryptic splice site strength were shown to affect gene expression. Splicing changes for known SNP genotypes were confirmed in HapMap lymphoblastoid cell lines with gene expression microarrays and custom designed q-RT-PCR or TaqMan assays. The majority of these SNPs (15 of 22) as well as an independent set of 24 variants were then subjected to RNAseq analysis using the ValidSpliceMut web beacon (<ext-link ext-link-type="uri" xlink:href="http://validsplicemut.cytognomix.com">http://validsplicemut.cytognomix.com</ext-link>), which is based on data from the Cancer Genome Atlas and International Cancer Genome Consortium. SNPs from different genes analyzed with gene expression microarray and q-RT-PCR exhibited significant changes in affected splice site use. Thirteen SNPs directly affected exon inclusion and 10 altered cryptic site use. Homozygous SNP genotypes resulting in stronger splice sites exhibited higher levels of processed mRNA than alleles associated with weaker sites. Four SNPs exhibited variable expression among individuals with the same genotypes, masking statistically significant expression differences between alleles. Genome-wide information theory and expression analyses (RNAseq) in tumor exomes and genomes confirmed splicing effects for 7 of the HapMap SNP and 14 SNPs identified from tumor genomes. q-RT-PCR resolved rare splice isoforms with read abundance too low for statistical significance in ValidSpliceMut. Nevertheless, the web-beacon provides evidence of unanticipated splicing outcomes, for example, intron retention due to compromised recognition of constitutive splice sites. Thus, ValidSpliceMut and q-RT-PCR represent complementary resources for identification of allele-specific, alternative splicing.</p>
</abstract>
<kwd-group>
<kwd>allele-specific gene expression</kwd>
<kwd>mRNA splicing</kwd>
<kwd>single nucleotide polymorphism</kwd>
<kwd>mutation</kwd>
<kwd>cryptic splicing</kwd>
<kwd>intron retention</kwd>
<kwd>alternative splicing</kwd>
<kwd>information theory</kwd>
</kwd-group>
<contract-num rid="cn001">RGPIN-2015-06290</contract-num>
<contract-sponsor id="cn001">Natural Sciences and Engineering Research Council of Canada<named-content content-type="fundref-id">10.13039/501100000038</named-content>
</contract-sponsor>
<contract-sponsor id="cn002">Canada Research Chairs<named-content content-type="fundref-id">10.13039/501100001804</named-content>
</contract-sponsor>
<counts>
<fig-count count="3"/>
<table-count count="3"/>
<equation-count count="0"/>
<ref-count count="104"/>
<page-count count="16"/>
<word-count count="10119"/>
</counts>
</article-meta>
</front>
<body>
<sec id="s1" sec-type="intro">
<title>Introduction</title>
<p>Accurate and comprehensive methods are needed for predicting impact of noncoding mutations, in particular, mRNA splicing defects, which are prevalent in genetic disease (<xref ref-type="bibr" rid="B40">Krawczak et al., 1992</xref>; <xref ref-type="bibr" rid="B86">Teraoka et al., 1999</xref>; <xref ref-type="bibr" rid="B1">Ars et al., 2003</xref>; <xref ref-type="bibr" rid="B79">Spielmann and Mundlos, 2016</xref>; <xref ref-type="bibr" rid="B29">Gloss and Dinger, 2018</xref>). This class of mutations may account for as much as 62% of point mutations (<xref ref-type="bibr" rid="B41">L&#xf3;pez-Bigas et al., 2005</xref>). Large transcriptome studies have suggested that a large fraction of genome-wide association studies (GWAS) signals for disease and complex traits are due to single nucleotide polymorphisms (SNPs) affecting mRNA splicing (<xref ref-type="bibr" rid="B63">Park et al., 2018</xref>). ValidSpliceMut (<xref ref-type="bibr" rid="B77">Shirley et al., 2019</xref>) presents evidence of altered splicing (<xref ref-type="bibr" rid="B91">Viner et al., 2014</xref>; <xref ref-type="bibr" rid="B22">Dorman et al., 2014</xref>) for 309,848 validated genome splice-variant predictions (<xref ref-type="bibr" rid="B76">Shirley et al., 2013</xref>). The majority of mutations were associated with exon skipping, cryptic site use, or intron retention, and in these cases ValidSpliceMut assigns a molecular phenotype classification to all variants as either aberrant, likely aberrant or inducing alternative isoforms.</p>
<p>While allele-specific alternative splicing can predispose for disease susceptibility (<xref ref-type="bibr" rid="B63">Park et al., 2018</xref>), these genetic variations also are associated with common phenotypic variability in populations (<xref ref-type="bibr" rid="B34">Hull et al., 2007</xref>). <xref ref-type="bibr" rid="B78">Soemedi et al. (2017)</xref> determined that 10% of a set of published disease-causing exonic mutations (N = 4,964) altered splicing. Their analysis of a control set of exonic SNPs common among those without disease phenotypes revealed a smaller proportion (3%) that altered splicing (N = 228). However, we recently showed that splice-altering, common SNPs are considerably more abundant in tumor genomes in the ValidSpliceMut web-beacon (<uri xlink:href="http://validsplicemut.cytognomix.com">http://validsplicemut.cytognomix.com</uri>; <xref ref-type="bibr" rid="B77">Shirley et al., 2019</xref>). Variants with higher germline population frequencies which impact splicing are less likely than rare mutations with direct splicing effects to be involved in Mendelian diseases or cancer. The present study analyzes predicted splice-altering polymorphic variants in genotyped lymphoblastoid cell lines by q-RT-PCR, expression microarrays of samples of known SNP genotypes, and high throughput expression data corresponding to sequenced tumor exomes and genomes. The relatively high frequencies of these variants enable comparisons of expressed transcripts in multiple individuals and genotypes. Effects of these SNPs are confirmed by multiple methods, although the supporting evidence from these distinct approaches is often complementary, rather than entirely concordant.</p>
<p>An estimated 90% to 95% of all multiexon genes are alternatively spliced (<xref ref-type="bibr" rid="B62">Pan et al., 2008</xref>; <xref ref-type="bibr" rid="B96">Wang et al., 2008</xref>; <xref ref-type="bibr" rid="B2">Baralle and Giudice, 2017</xref>). The selection of splicing signals involves exon and intron sequences, complementarity with snRNAs, RNA secondary structure, and competition between spliceosomal recognition sites (<xref ref-type="bibr" rid="B54">Moore and Sharp, 1993</xref>; <xref ref-type="bibr" rid="B3">Berget, 1995</xref>; <xref ref-type="bibr" rid="B63">Park et al., 2018</xref>). U1 snRNP interacts with the donor (or 5') splice site (<xref ref-type="bibr" rid="B103">Zhuang and Weiner, 1986</xref>; <xref ref-type="bibr" rid="B74">S&#xe9;raphin et al., 1988</xref>) and U2 (and U6) snRNP with the acceptor and branch sites of pre-mRNA (<xref ref-type="bibr" rid="B64">Parker et al., 1987</xref>; <xref ref-type="bibr" rid="B98">Wu and Manley, 1989</xref>). The majority of human splice donors (5') and acceptors (3') base pair with the U1 and U2 RNAs in spliceosomes, but are generally not precisely complementary to these sequences (<xref ref-type="bibr" rid="B73">Rogan et al., 2003</xref>). Additional exonic and intronic cis-regulatory elements can promote or suppress splice site recognition through recruitment of trans-acting splicing factors. SR proteins are positive trans-acting splicing factors which contain RNA-recognition motifs (RRM) and a carboxy-terminal domain enriched in Arg/Ser dipeptides (SR domain; <xref ref-type="bibr" rid="B5">Birney et al., 1993</xref>). Binding of RRMs in pre-mRNA enhances exon recognition by promoting interactions with spliceosomal and other proteins (<xref ref-type="bibr" rid="B25">Fu and Maniatis, 1992</xref>). SR proteins function in splice site communication by forming an intron bridge needed for exon recognition (<xref ref-type="bibr" rid="B104">Zuo and Maniatis, 1996</xref>). Factors that negatively impact splicing include heterogeneous nuclear ribonucleoproteins (hnRNPs; <xref ref-type="bibr" rid="B51">Martinez-Contreras et al., 2007</xref>).</p>
<p>Splicing mutations affect normal exon recognition by altering the strengths of natural donor or acceptor sites and proximate cryptic sites, either independently or simultaneously. Weakened splice sites reduce of kinetics of mRNA processing, leading to an overall decrease in full length transcripts, increased exon skipping, cryptic splice site activation within exons or within adjacent introns, intron retention, and inclusion of cryptic, pseudo-exons (<xref ref-type="bibr" rid="B85">Talerico and Berget, 1990</xref>; <xref ref-type="bibr" rid="B11">Carothers et al., 1993</xref>; <xref ref-type="bibr" rid="B7">Buratti et al., 2006</xref>; <xref ref-type="bibr" rid="B63">Park et al., 2018</xref>). The kinetics of splicing at weaker cryptic sites is also slower than at natural sites (<xref ref-type="bibr" rid="B21">Domenjoud et al., 1993</xref>). Mutations strengthen cryptic sites either by increasing resemblance to &#x201c;consensus sequences&#x201d; (<xref ref-type="bibr" rid="B58">Nelson and Green, 1990</xref>) or by modulating the levels of SR proteins contributing to splice site recognition (<xref ref-type="bibr" rid="B52">Mayeda and Krainer, 1992</xref>; <xref ref-type="bibr" rid="B8">C&#xe1;ceres et al., 1994</xref>). Mutations affecting splicing regulatory elements (<xref ref-type="bibr" rid="B19">Dietz et al., 1993</xref>; <xref ref-type="bibr" rid="B67">Richard and Beckmann, 1995</xref>) disrupt trans-acting SR protein interactions (<xref ref-type="bibr" rid="B80">Staknis and Reed, 1994</xref>) with distinct exonic and intronic cis-regulatory elements (<xref ref-type="bibr" rid="B6">Black, 2003</xref>).</p>
<p>Information theory-based (IT-based) models of donor and acceptor mRNA splice sites reveal the effects of changes in strengths of individual sites (termed <italic>R<sub>i</sub></italic>; <xref ref-type="bibr" rid="B72">Rogan et al., 1998</xref>; <xref ref-type="bibr" rid="B73">Rogan et al., 2003</xref>). This facilitates prediction of phenotypic severity (<xref ref-type="bibr" rid="B71">Rogan and Schneider, 1995</xref>; <xref ref-type="bibr" rid="B93">von Kodolitsch et al., 1999</xref>; <xref ref-type="bibr" rid="B94">von Kodolitsch et al., 2006</xref>). The effects of splicing mutations can be predicted <italic>in silico</italic> by information theory (<xref ref-type="bibr" rid="B71">Rogan and Schneider, 1995</xref>; <xref ref-type="bibr" rid="B36">Kannabiran et al., 1998</xref>; <xref ref-type="bibr" rid="B72">Rogan et al., 1998</xref>; <xref ref-type="bibr" rid="B83">Svojanovsky et al., 2000</xref>; <xref ref-type="bibr" rid="B73">Rogan et al., 2003</xref>; <xref ref-type="bibr" rid="B9">Caminsky et al., 2014</xref>; <xref ref-type="bibr" rid="B22">Dorman et al., 2014</xref>; <xref ref-type="bibr" rid="B91">Viner et al., 2014</xref>; <xref ref-type="bibr" rid="B10">Caminsky et al., 2016</xref>; <xref ref-type="bibr" rid="B56">Mucaki et al., 2016</xref>; <xref ref-type="bibr" rid="B77">Shirley et al., 2019</xref>), and these predictions can be confirmed by <italic>in vitro</italic> experimental studies (<xref ref-type="bibr" rid="B92">Vockley et al., 2000</xref>; <xref ref-type="bibr" rid="B43">Lamba et al., 2003</xref>; <xref ref-type="bibr" rid="B73">Rogan et al., 2003</xref>; <xref ref-type="bibr" rid="B38">Khan et al., 2004</xref>; <xref ref-type="bibr" rid="B82">Susani et al., 2004</xref>; <xref ref-type="bibr" rid="B32">Hobson et al., 2006</xref>; <xref ref-type="bibr" rid="B13">Caux-Moncoutier et al., 2009</xref>; <xref ref-type="bibr" rid="B61">Olsen et al., 2014</xref>; <xref ref-type="bibr" rid="B90">Vemula et al., 2014</xref>; <xref ref-type="bibr" rid="B65">Peterlongo et al., 2015</xref>). Strengths of one or more splice sites may be altered and, in some instances, concomitant with amino acid changes in coding sequences (<xref ref-type="bibr" rid="B72">Rogan et al., 1998</xref>; <xref ref-type="bibr" rid="B65">Peterlongo et al., 2015</xref>). Information analysis has been a successful approach for recognizing nondeleterious, sometimes polymorphic variants (<xref ref-type="bibr" rid="B71">Rogan and Schneider, 1995</xref>; <xref ref-type="bibr" rid="B17">Colombo et al., 2013</xref>), and for distinguishing of milder from severe mutations (<xref ref-type="bibr" rid="B72">Rogan et al., 1998</xref>; <xref ref-type="bibr" rid="B93">von Kodolitsch et al., 1999</xref>; <xref ref-type="bibr" rid="B42">Lacroix et al., 2012</xref>).</p>
<p>Predicting the relative abundance of various transcripts by information analysis requires integration of the contributions of all pertinent cis-acting regulatory elements. We have applied quantitative methods to prioritize inferences as to which SNPs impact gene expression levels and transcript structure. Effects of mutations on combinations of splicing signals reveal changes in isoform structure and abundance (<xref ref-type="bibr" rid="B55">Mucaki et al., 2013</xref>; <xref ref-type="bibr" rid="B9">Caminsky et al., 2014</xref>). Multisite information theory-based models have also been used to detect and analyze SNP effects on cis-acting promoter modules that contribute to establishing transcript levels (<xref ref-type="bibr" rid="B4">Bi and Rogan, 2004</xref>; <xref ref-type="bibr" rid="B95">Vyhlidal et al., 2004</xref>; <xref ref-type="bibr" rid="B49">Lu et al., 2017</xref>; <xref ref-type="bibr" rid="B48">Lu and Rogan, 2019</xref>).</p>
<p>The robustness of this approach for predicting rare, deleterious splicing mutations justifies efforts to identify common SNPs that impact mRNA splicing. We previously described SNPs from dbSNP that affect splicing (<xref ref-type="bibr" rid="B72">Rogan et al., 1998</xref>; <xref ref-type="bibr" rid="B57">Nalla and Rogan, 2005</xref>). Here, we explicitly predict and validate SNPs that influence mRNA structure and levels of expression of the genes containing them in immortalized lymphoblastoid cell lines and tumors. Since constitutive splicing mutations can arise at other locations within pre-mRNA sequences that elicit cryptic splicing, we examined whether more common genomic polymorphisms might frequently affect the abundance and structure of splice isoforms.</p>
</sec>
<sec id="s2">
<title>Methods</title>
<sec id="s2_1">
<title>Information Analysis</title>
<p>The protein-nucleic acid interactions intrinsic to splicing can be analyzed using information theory, which comprehensively and quantitatively models functional sequence variation based on a thermodynamic framework (<xref ref-type="bibr" rid="B75">Schneider, 1997</xref>). Donor and acceptor splice site strength can be predicted by the use of IT-based weight matrices derived from known functional sites (<xref ref-type="bibr" rid="B73">Rogan et al., 2003</xref>). The Automated Splice Site and Exon Definition server (ASSEDA) is an online resource based on the hg19 coordinate system to determine splice site information changes associated with genetic diseases (<xref ref-type="bibr" rid="B55">Mucaki et al., 2013</xref>). ASSEDA is now part of the MutationForecaster (<uri xlink:href="http://www.mutationforecaster.com">http://www.mutationforecaster.com</uri>) variant interpretation system.</p>
</sec>
<sec id="s2_2">
<title>Creation of Exon Array Database</title>
<p>Exon-level microarrays have been used to compare abnormal expression for different cellular states, which can then be confirmed by q-RT-PCR (<xref ref-type="bibr" rid="B87">Thorsen et al., 2008</xref>). We hypothesized that the predicted effect of SNPs on expression of the proximate exon would correspond to the expression of exon microarray probes of genotyped individuals in the HapMap cohort. We used the dose-dependent expression of the minor allele to qualify SNPs for subsequent information analysis consistent with alterations of mRNA splicing. Additional SNPs predicted by information analysis were also tested for effects on splicing (<xref ref-type="bibr" rid="B57">Nalla and Rogan, 2005</xref>).</p>
<p>Expression data were normalized using the PLIER (Probe Logarithmic Intensity Error) method on Affymetrix Human Exon 1.0 ST microarray data for 176 genotyped HapMap cell lines (<xref ref-type="bibr" rid="B33">Huang et al., 2007</xref>, Gene Expression Omnibus accession no. GSE 7792; <xref ref-type="bibr" rid="B59">Nembaware et al., 2008</xref>). Microarray probes which overlap SNPs, that were subsequently removed, were identified by intersecting dbSNP129 with probe coordinates [obtained from X:MAP (<xref ref-type="bibr" rid="B100">Yates et al., 2008</xref>) using the Galaxy Browser (<xref ref-type="bibr" rid="B28">Giardine et al., 2005</xref>)]. A MySQL database containing the PLIER normalized intensities and CEU (Utah residents with Northern and Western European ancestry) and YRI (Yoruba in Ibadan) genotypes for Phase I+II HapMap SNPs was created. Tables were derived to link SNPs to their nearest like-stranded probeset (to within 500 nt), and to associate probesets to the exons they may overlap (transcript and exon tables from Ensembl version 51). A MySQL query was used to create a table containing the splicing index (SI; intensity of a probeset divided by the overall gene intensity) of each probeset for each HapMap individual.</p>
<p>The database was queried to identify significant SI changes of an exonic probeset based on the genotype of a SNP the probeset was associated with (SNP within natural donor/acceptor region of exon). Probesets displaying a stepwise change in mean SI (where the mean SI of the heterozygous group is in between the mean SI values of the two homozygous groups) were identified using a different program script (criteria: the mean SI of homozygous rare and heterozygous groups are &lt; 90% of the homozygous common group). Splicing Index boxplots were created with R, where the x- and y-axis are genotype and SI, respectively (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1</bold></xref>). These boxplots analyze the effect a SNP has on a particular probeset across all individuals.</p>
<p>SNPs with effects on splicing were validated by q-RT-PCR of lymphoblastoid cell lines. Where available, results were also compared to abnormal splicing patterns present in RNAseq data from tumors carrying these same SNPs (in the ValidSpliceMut database; <xref ref-type="bibr" rid="B77">Shirley et al., 2019</xref>). SNPs predicted to exhibit nominal effects on splicing (&#x394;<italic>R<sub>i</sub></italic> &lt; 1 bit) were included to determine minimal detectable changes by q-RT-PCR.</p>
</sec>
<sec id="s2_3">
<title>Cell Culture and RNA Extraction</title>
<p>EBV-transformed lymphoblastoid cell lines of HapMap individuals with our SNPs of interest (homozygous common, heterozygous and homozygous rare when available) were ordered from the Coriell Cell Repositories (CEU: GM07000, GM07019, GM07022, GM07056, GM11992, GM11994, GM11995, GM12872; YRI: GM18855, GM18858, GM18859, GM18860, GM19092, GM19093, GM19094, GM19140, GM19159). Cells were grown in HyClone RPMI-1640 medium [15% FBS (HyClone), 1% L-Glutamine, and 1% Penicillin:streptomycin (Invitrogen); 37&#xb0;C, 5% CO<sub>2</sub>]. RNA was extracted with Trizol LS (Invitrogen) from 10<sup>6</sup> cells and treated with DNAase [20 mM MgCl<sub>2</sub> (Invitrogen), 2 mM DTT (Sigma-Aldrich), 0.4 U/&#x3bc;L RNasin (Promega), 10 &#xb5;g/ml DNase (Worthington Biochemical) in 1x TE buffer] at 37&#xb0;C for 15 min. The reaction was stopped with EDTA (0.05 M; 2.5% v/v), and heated to 65&#xb0;C for 20 min, followed by ethanol precipitation (resuspended in 0.1% v/v DEPC-treated 1x TE buffer). DNA was extracted using a Puregene Tissue Core Kit B (Qiagen).</p>
</sec>
<sec id="s2_4">
<title>Design of Real-Time Expression Assays</title>
<p>Sequences were obtained from UCSC and Ensembl. DNA primers used to amplify a known splice form, or one predicted by information analysis, were designed using Primer Express (ABI). DNA primers (<xref ref-type="supplementary-material" rid="SM2"><bold>Supplementary Table 1</bold></xref>) were obtained from IDT (Coralville, IA, USA), and dissolved to 200 uM. Primers were placed over junctions of interest to amplify a single splice form. T<sub>m</sub> ranged from 58&#xb0;C&#x2013;65&#xb0;C, and amplicon lengths varied from 69&#x2013;136 nt. BLASTn (Refseq_RNA database) was used to reduce possible cross-hybridization. Primers were designed to amplify the wildtype splice form, exon skipping (if a natural site is weakened), and cryptic site splice forms which were either previously reported (UCSC mRNA and EST tracks) or predicted by information analysis (where <italic>R<sub>i</sub></italic> cryptic site &#x2265; <italic>R<sub>i</sub></italic> weakened natural site).</p>
<p>Two types of reference amplicons were used to quantify allele specific splice forms. These consisted of intrinsic products derived from constitutively spliced exons with the same gene and external genes with high uniformity of expression among HapMap cell lines. Reference primers internal to the genes of interest were designed 1&#x2013;4 exons adjacent from the affected exon (exons without any evidence of variation in the UCSC Genome Browser; <xref ref-type="bibr" rid="B37">Kent et al., 2002</xref>), placed upstream of the SNP of interest whenever possible. Two advantages to including an internal reference in the q-RT-PCR experiment include: potential detection of changes in total mRNA levels; and account for inter-individual variation of expression.</p>
<p>External reference genes (excluding the SNP of interest) were chosen based on consistent PLIER intensities with low coefficients of variation in expression among all 176 HapMap individuals. The following external controls were selected: exon 39 of <italic>SI</italic> (PLIER intensity 11.4 &#xb1; 1.7), exon 9 of <italic>FRMPD1</italic> (22 &#xb1; 2.81), exon 46 of <italic>DNAH1</italic> (78.5 &#xb1; 9.54), exon 3 of <italic>CCDC137</italic> (224 &#xb1; 25), and exon 25 of <italic>VPS39</italic> (497 &#xb1; 76). The external reference chosen for an experiment was matched to the intensity of the probeset within the exon of interest. This decreased potential errors in &#x394;&#x394;C<sub>T</sub> values and proved to be accurate and reproducible for most genes.</p>
<p>To control for interindividual variation in expression, we compared expression in HapMap individuals based on their SNP genotypes and familial relatedness. Families with all three possible genotypes were available (homozygous common, rare, and heterozygous) for 12 of these SNPs (rs1805377, rs2243187, rs2070573, rs2835655, rs2835585, rs2072049, rs1893592, rs6003906, rs1018448, rs13076750, rs16802, and rs8130564). For those families in which all genotypes were not represented, samples from the same ethnic background (YRI or CEU populations) were compared for the missing genotype (N = 8; rs17002806, rs2266988, rs1333973, rs743920, rs2285141, rs2838010, rs10190751, rs16994182; individuals with homozygous common and rare genotypes were from the same families for the latter two SNPs). Two SNPs were tested using homozygous individuals from different ethnic backgrounds: rs3747107 (<italic>GUSBP11</italic>) and rs2252576 (<italic>BACE2</italic>). While the splicing impact of rs3747107 was clearly observable by q-RT-PCR, either background or data noise did impact the interpretation of effects of rs2252576.</p>
</sec>
<sec id="s2_5">
<title>PCR and Quantitative RT-PCR</title>
<p>M-MLV reverse transcriptase (Invitrogen) converted 1&#xb5;g of DNase-treated RNA to cDNA with 20 nt Oligo-dT (25&#xb5;g/ml; IDT) and rRNAsin (Promega). Precipitated cDNA was resuspended in water at 20 ng/&#xb5;l of original RNA concentration. All designed primer sets were tested with conventional PCR to ensure a single product at the expected size. PCR reactions were prepared with 1.0 M Betaine (Sigma-Aldrich), and were heated to 80&#xb0;C before adding Taq Polymerase (Invitrogen). Optimal T<sub>m</sub> for each primer set was determined to obtain maximum yield.</p>
<p>Quantitative PCR was performed with an Eppendorf Mastercycler ep Realplex 4, a Bio-Rad CFX96, as well as a Stratagene Mx3005P. SYBR Green assays were performed using the KAPA SYBR FAST qPCR kit (Kapa Biosystems) in 10 &#xb5;l reactions using 200 &#xb5;M of each primer and 24 ng total of cDNA per reaction. For some tests, SsoFast EvaGreen supermix (Bio-Rad) was used with 500 &#xb5;M of each primer instead.</p>
<p>When testing the effect of a SNP, amplification reactions with all primers designed to detect all relevant isoforms (as well as the gene internal reference and external reference) were run simultaneously, in triplicate. C<sub>t</sub> values obtained from these experiments were normalized against the same external reference using the Relative Expression Software Tool (REST; <uri xlink:href="http://www.gene-quantification.de/rest.html">http://www.gene-quantification.de/rest.html</uri>; <xref ref-type="bibr" rid="B66">Pfaffl et al., 2002</xref>).</p>
</sec>
<sec id="s2_6">
<title>Taqman Assay</title>
<p>Two dual-labeled Taqman probes were designed to detect the two splice forms of <italic>XRCC4</italic> (detecting alternative forms of exon 8 either with or without a 6 nt deletion at the 5' end). Probes were placed over the sequence junction of interest where variation would be near the probe middle (<xref ref-type="supplementary-material" rid="SM2"><bold>Supplementary Table 1</bold></xref>). The assay was performed on an ABI StepOne Real-Time PCR system using ABI Genotyping Master Mix. Experiment was run in 25 &#xb5;l reactions (300 nM each primer, 400nM probe [5'-FAM or TET fluorophore with a 3' Black Hole quencher; IDT], and 80 ng cDNA total). Probes were tested in separate reactions.</p>
</sec>
<sec id="s2_7">
<title>RNAseq Analyses</title>
<p>The previous analyses were extended to include 24 additional, common SNPs for their potential influence on splicing. All SNVs present in ICGC (International Cancer Genome Consortium) patients (<xref ref-type="bibr" rid="B77">Shirley et al., 2019</xref>) were evaluated by the Shannon Pipeline (SP; <xref ref-type="bibr" rid="B76">Shirley et al., 2013</xref>) to identify those altering splice site strength. Common SNPs (average heterozygosity &gt; 10% in dbSNP 150) predicted to decrease natural splice site strength by SP (where <italic>&#x394;R<sub>i</sub></italic> &lt; &#x2212;1 bit) were selected. ICGC patients carrying these flagged SNPs were identified, and the expression of the corresponding SNP-containing region in RNAseq was visualized with IGV (Integrated Genome Viewer; <uri xlink:href="https://igv.org">https://igv.org</uri>; <xref ref-type="bibr" rid="B69">Robinson et al., 2017</xref>). Similar RNAseq reads were grouped using IGV collapse and sort commands, which caused nonconstitutive spliced reads to cosegregate to the top of the viewing window. IGV images which did not meet our gene expression criteria (exon affected by the SNP must have &#x2265;5 RNAseq reads present) were eliminated. As this generated thousands of images, we report the analysis of two ICGC patients [DO47132 (Renal Cell Cancer) and DO52711 (Chronic Lymphocytic Leukemia)], chosen randomly, preselecting tissues to increase the likelihood of finding expression in these regions. Images were evaluated sequentially (in order of rsID value) and only concluded once the first 24 SNPs meeting these criteria were found. This type of analysis could not reveal a splicing event to be more abundant in these patients when compared to noncarriers. Nevertheless, splicing information changes resulting from SNPs corresponded to observed alternative and/or other novel splice isoforms. We then queried the ValidSpliceMut database for these SNPs, as abnormal splicing was only flagged in the database when the junction-read or read-abundance counts significantly exceeded corresponding evidence type in a large set of normal control samples (<xref ref-type="bibr" rid="B77">Shirley et al., 2019</xref>).</p>
</sec>
</sec>
<sec id="s3" sec-type="results">
<title>Results</title>
<sec id="s3_1">
<title>Selection of Candidate SNPs Affecting Splicing</title>
<p>A publicly available exon microarray dataset was initially used to locate exons affected by SNPs altering splice site strength. A change in the mean SI of a diagnostic probeset in individuals of differing genotypes at the same variant can suggest altered splicing. The increase or decrease in SI is related to the expected impact of the SNP on splicing. For example, an exonic probe which detects a normally spliced mRNA will have decreased SI in the event of skipping. Mean SI may be increased when a probe detects the use of an intronic cryptic splice site. SNPs with strong impact on splicing will distinguish mean SI levels of individuals homozygous for the major versus minor alleles (and with heterozygous genotypes).</p>
<p>There were 9,328 HapMap-annotated SNPs within donor/acceptor regions of known exons which contained at least one probeset. Of 987 SNPs that are associated to exonic probesets which differ in mean SI between the homozygous common and rare HapMap individuals, 573 caused a decrease in natural site <italic>R<sub>i</sub></italic> value. Inactivating and leaky splicing variants (reduction in information content where final <italic>R<sub>i</sub></italic> &#x2265; <italic>R<sub>i,minimum</sub></italic> [minimum functional splice site strength]) both exhibit reduced SI values and were similarly abundant. Thus, both severe and moderate splicing mutations with reduced penetrance and milder molecular phenotypes were detected, consistent with Mendelian disorders (<xref ref-type="bibr" rid="B93">von Kodolitsch et al., 1999</xref>; <xref ref-type="bibr" rid="B94">von Kodolitsch et al., 2006</xref>).</p>
<p>Of the SNPs associated with significant changes in <italic>R<sub>i</sub></italic> (termed &#x394;<italic>R<sub>i</sub></italic>), 9,328 occurred within the natural splice sites of exons detectable with microarray probesets. We initially focused on 21 SNPs on chromosome 21 (0.23% total, 18.8% of chr21) and 34 on chromosome 22 (0.36% of total, 14.5% of chr22) associated with stepwise decreases in probeset intensity at each genotype. Seven of the chr21 SNPs and nine of the chr22 SNPs caused information changes with either natural splice site &#x394;<italic>R<sub>i</sub></italic> &#x2265; 0.1 bits, or cryptic site(s) with an <italic>R<sub>i</sub></italic> value comparable to a neighbouring natural site, and in which mRNA or EST data supported use of the cryptic site. These SNPs included: rs2075276 [<italic>MGC16703</italic>], rs2838010 [<italic>FAM3B</italic>], rs3747107 [<italic>GUSBP11</italic>], rs2070573 [<italic>C21orf2</italic>], rs17002806 [<italic>WBP2NL</italic>], rs3950176 [<italic>EMID1</italic>], rs1018448 [<italic>ARFGAP3</italic>], rs6003906 [<italic>DERL3</italic>], rs2266988 [<italic>PRAME</italic>], rs2072049 [<italic>PRAME</italic>], rs2285141 [<italic>CYB5R3</italic>], rs2252576 [<italic>BACE2</italic>], rs16802 [<italic>BCR</italic>], rs17357592 [<italic>COL6A2</italic>], rs16994182 [<italic>CLDN14</italic>], and rs8130564 [<italic>TMPRSS3</italic>].</p>
<p>The minimum information change for detecting a splicing effect by expression microarray is constrained by several factors. Detection of splice isoforms can be limited by genomic probeset coverage, which cannot distinguish alternative splicing events in close proximity (see <xref ref-type="fig" rid="f1"><bold>Figure 1A</bold></xref>). Even where genotype-directed SI changes are very distinct, some individuals with the common allele have equivalent SI values to individuals with the rare allele [rs2070573 (<xref ref-type="fig" rid="f2"><bold>Figure 2</bold></xref>) and rs1333973 (<xref ref-type="fig" rid="f3"><bold>Figure 3</bold></xref>)]. In some cases, the number of individuals with a particular genotype is insufficient for statistical significance (rs2243187; <xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.4</bold></xref>). Although exon microarrays can be used to find potential alternate splicing and give support to our predictions, it became necessary to validate the microarray predictions by q-RT-PCR, TaqMan assays, and with RNAseq data from SNP carriers.</p>
<fig id="f1" position="float">
<label>Figure 1</label>
<caption>
<p>Splicing Impact of rs1805377 (<italic>XRCC4</italic>). The natural acceptor of <italic>XRCC4</italic> exon 8 is abolished by rs1805377 (11.5 &#x2212;&gt; 0.6 bits) while simultaneously strengthening a second exonic cryptic acceptor 6nt downstream (11.4 to 11.8 bits), resulting in a 6nt deletion in the mRNA. <bold>(A)</bold> Both of these acceptor sites have been validated in GenBank mRNAs, i.e., NM_022406 and NM_003401 (UCSC panel derived from <uri xlink:href="http://genome.ucsc.edu">http://genome.ucsc.edu</uri>). <bold>(B)</bold> The relative abundance of the two splice forms was determined by q-RT-PCR. The weaker rs1805377 A/A genotype (0.6 bit acceptor) was used ~47-fold less frequently than the cryptic downstream acceptor (11.8 bits). <bold>(C)</bold> The two splice isoforms cannot be distinguished by the exon microarray as the upstream probeset (ID 2818500) does not overlap the variable region, though the average expression of the rs1805377 A/A genotype is reduced. <bold>(D)</bold> ValidSpliceMut flagged this mutation for intron retention, which can be observed in the RNAseq of heterozygous ICGC patient DO27779 [Box 1]. Use of both acceptor sites is also evident [Box 2]. For more detail for this and all of the other single nucleotide polymorphisms (SNPs) analyzed, refer to <xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1</bold></xref>.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fgene-11-00109-g001.tif"/>
</fig>
<fig id="f2" position="float">
<label>Figure 2</label>
<caption>
<p>Splicing Impact of rs2070573 (<italic>C21orf2</italic>). <bold>(A)</bold> The single nucleotide polymorphisms (SNP) rs2070573 is a common polymorphism which alters the first nucleotide of the extended form of <italic>C21orf2</italic> exon 6. <bold>(B)</bold> The donor site is strengthened by the presence of the C-allele (<italic>R<sub>i</sub></italic> 0.4 to 4.0 bits; A &gt; C) and its use extends the exon by 360 nt. Q-RT-PCR found a ~4-9-fold and ~17-23-fold increase in the extended exon 6 splice form in the A/C and C/C cell lines tested, respectively. <bold>(C)</bold> The exon microarray probeset which detects the extension (ID 3934488) shows a stepwise increase in SI with C-allele individuals which supports the q-RT-PCR result<bold>. (D)</bold> The variant was present in ValidSpliceMut, which associated the A-allele with an increase in total intron retention [six patients flagged for total intron retention read abundance; p=0.019 (average over all patients)]. This image displays sequence read distributions in the RNAseq data of TCGA BRCA patient, TCGA-BH-A0H0, who is heterozygous for rs2070573. The IGV panel indicates reads corresponding to total intron 6 retention [Box 1] and which extend beyond the constitutive donor splice site of exon 6 into the adjacent intron [Box 2]. All 4 reads which extend over the exon splice junction are derived from the G-allele (strong binding site; not all visible in panel <bold>D</bold>).</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fgene-11-00109-g002.tif"/>
</fig>
<fig id="f3" position="float">
<label>Figure 3</label>
<caption>
<p>Splicing Impact of rs1333973 (<italic>IFI44L</italic>). <bold>(A)</bold> The natural donor of <italic>IFI44L</italic> exon 2 is weakened from 9.1 to 4.6 bits (A &gt; T) by rs1333973, increasing the frequency of exon 2 skipping and other events. <bold>(B)</bold> By q-RT-PCR, skipping was found to be 15.6-fold higher while normal splicing was 15.4-fold lower in A homozygotes (relative to T homozygotes). <bold>(C)</bold> Exon microarray data strongly supports the q-RT-PCR findings. <bold>(D)</bold> ICGC patient DO6354 is homozygous for the T-allele, which resulted in exon 2 skipping [Box 3], failure to recognize the exon 2 donor causing total retention of intron 2 [Box 1], and activation of an upstream exonic 2.4 bit cryptic donor 375 nt from the affected site [Box 2].</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fgene-11-00109-g003.tif"/>
</fig>
<p>We report q-RT-PCR validation studies for 13 of the 16 SNPs (q-RT-PCR primers could not be designed for rs16994182, rs2075276, and rs3950176), along with nine other candidate SNPs from our previous information theory-based analyses (<xref ref-type="bibr" rid="B57">Nalla and Rogan, 2005</xref>): rs1805377 [<italic>XRCC4</italic>], rs2243187 [<italic>IL19</italic>], rs2835585 [<italic>TTC3</italic>], rs2865655 [<italic>TTC3</italic>], rs1893592 [<italic>UBASH3A</italic>], rs743920 [<italic>EMID1</italic>], rs13076750 [<italic>LPP</italic>], rs1333973 [<italic>IFI44L</italic>], and rs10190751 [<italic>CFLAR</italic>].</p>
<p>After amplification of known and predicted splice forms (<xref ref-type="supplementary-material" rid="SM2"><bold>Supplementary Table 1</bold></xref>), 15 SNPs showed measurable changes in splicing consistent with information-theory predictions. Ten increase alternate splice site use (two of which increased strength of cryptic site, eight activated an unaffected pre-existing cryptic site), six affect exon inclusion (five increased exon skipping), three increased activation of an alternative exon, and four decreased overall expression levels. Altered splicing could not be validated for six SNPs, however experimental analyses of three of the five SNPs where &#x394;<italic>R<sub>i</sub></italic> &lt; 1 bit were hampered by high interindividual variability in expression.</p>
<p>Changes in splice site information were used to predict observed differences in splice isoform levels (<xref ref-type="table" rid="T1"><bold>Table 1</bold></xref>). <xref ref-type="fig" rid="f1"><bold>Figures 1</bold></xref>&#x2013;<xref ref-type="fig" rid="f3"><bold>3</bold></xref> and <xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1</bold></xref> indicate the experimentally-determined splicing effects for each SNP, a modified UCSC Genome Browser image of the relevant region, boxplots showing exon microarray expression levels of each allele for the relevant probesets, and an IGV image of the RNAseq results for an individual tumor carrying the SNP. Abundance of the aberrant splice forms measured by q-RT-PCR (relative to an internal gene reference) is indicated in <xref ref-type="table" rid="T2"><bold>Table 2</bold></xref>. Changes in predicted splice site strength were consistent with results measured by q-RT-PCR for 12 out of the 15 SNP (exceptions were rs2070573, rs17002806, and rs2835585). Variants predicted to reduce strength &#x2265; 100-fold were found to reduce expression by 38- to 58-fold, the variance falling within the margin of measurement error. Modest natural splice site affinity changes predicted to be &lt; eightfold (&#x394;R<sub>i</sub> &lt; 3.0) did not consistently result in detectable changes in splicing. In some instances, lower abundance splice forms were observed (i.e. rs2835585 altered exon skipping levels by up to 8.8-fold; nevertheless, the normal splice form predominated).</p>
<table-wrap id="T1" position="float">
<label>Table 1</label>
<caption>
<p>Summary of q-RT-PCR results.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" colspan="5">Summary of q-RT-PCR results</th>
<th valign="top" colspan="6">SNP effects (Increase/Decrease in fold change of homozygotes)</th>
<th valign="top" colspan="2">Additional Expression Evidence</th>
</tr>
<tr>
<th valign="top" align="left">Gene</th>
<th valign="top" align="left">
<xref ref-type="supplementary-material" rid="SM1">Supplementary Image 1 </xref> panel /<break/> rsID / HGVS notation</th>
<th valign="top" align="center">Splice type</th>
<th valign="top" align="center">Information Change (n&#x2013;natural, c&#x2013;cryptic site)</th>
<th valign="top" align="center">Fold change</th>
<th valign="top" align="center">Natural site</th>
<th valign="top" align="center">Cryptic site</th>
<th valign="top" align="center">Exon skipping</th>
<th valign="top" align="center">Alternate exon</th>
<th valign="top" align="center">Total mRNA</th>
<th valign="top" align="center">In-frame<sup>a</sup></th>
<th valign="top" align="center">Exon microarray</th>
<th valign="top" align="center">RNAseq (Valid-SpliceMut)</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" rowspan="2"><italic>XRCC4</italic></td>
<td valign="top" rowspan="2">S1.6/ rs1805377 /<break/>NM_003401:  c.894-7G&gt;A<break/>NM_022406:  c.894-1G&gt;A</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center" style="color: #811517">11.5 (G) -&gt; 0.6 (A) (n)</td>
<td valign="top" align="center" style="color: #811517">1886</td>
<td valign="top" align="center" style="color: #811517">38.4</td>
<td valign="top" align="center" style="color: #3c6945">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">Y</td>
</tr>
<tr>
<td valign="top" align="center">A</td>
<td valign="top" align="center" style="color: #3c6945">11.4 (G) -&gt; 11.8 (A) (n)</td>
<td valign="top" align="center">1.3</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center" style="color: #3c6945">47.3<sup>c</sup></td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">Y</td>
</tr>
<tr>
<td valign="top" align="left"><italic>IL19</italic></td>
<td valign="top" align="left">S1.4/ rs2243187 / NM_013371: c.364-1G&gt;A</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center" style="color: #811517">4.7 (G) -&gt; -6.2 (A) (n)</td>
<td valign="top" align="center" style="color: #811517">26.3</td>
<td valign="top" align="center" style="color: #811517">1.8<sup>d</sup></td>
<td valign="top" align="center" style="color: #3c6945">1.8<sup>d</sup></td>
<td valign="top" align="center" style="color: #811517">2.1<sup>d</sup></td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">N</td>
</tr>
<tr>
<td valign="top" align="left"><italic>C21orf2</italic></td>
<td valign="top" align="left">S1.1/ rs2070573 / NM_004928: c.643-137A&gt;C</td>
<td valign="top" align="center">D</td>
<td valign="top" align="center" style="color: #3c6945">0.4 (A) -&gt; 4.0 (C) (c)</td>
<td valign="top" align="center" style="color: #3c6945">12.1</td>
<td valign="top" align="center">NC</td>
<td valign="top" align="center" style="color: #3c6945">22.6</td>
<td valign="top" align="center"> &#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">Y</td>
</tr>
<tr>
<td valign="top" align="left"><italic>TTC3</italic></td>
<td valign="top" align="left">S1.21/ rs2835655 / NM_003316: c.5115G&gt;A</td>
<td valign="top" align="center">D</td>
<td valign="top" align="center" style="color: #811517">11.2 (G) -&gt; 8.2 (A) (n)</td>
<td valign="top" align="center" style="color: #811517">8.1</td>
<td valign="top" align="center" style="color: #811517">1.5</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center" style="color: #811517">1.3<sup>e</sup></td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center" style="color: #811517">1.5</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">N</td>
</tr>
<tr>
<td valign="top" align="left"><italic>TTC3<sup>f</sup></italic></td>
<td valign="top" align="left">S1.5/ rs2835585 / NM_003316: c.188-8T&gt;A</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center" style="color: #811517">7.6 (T) -&gt; 5.4 (A) (n)</td>
<td valign="top" align="center" style="color: #811517">4.5</td>
<td valign="top" align="center">NC</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center" style="color: #3c6945">8.8</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">Y</td>
</tr>
<tr>
<td valign="top" align="left"><italic>WBP2NL</italic></td>
<td valign="top" align="left">S1.13/ rs17002806 / NM_152613: c.*86G&gt;A</td>
<td valign="top" align="center">D</td>
<td valign="top" align="center" style="color: #811517">9.2 (G) -&gt; 6.0 (A) (n)</td>
<td valign="top" align="center" style="color: #811517">9.1</td>
<td valign="top" align="center" style="color: #811517">23.8</td>
<td valign="top" align="center" style="color: #3c6945">34+<sup>g</sup></td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">N</td>
</tr>
<tr>
<td valign="top" align="left"><italic>GUSBP11</italic></td>
<td valign="top" align="left">S1.8/ rs3747107 / NR_024448: n.2961+6512 C&gt;G</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center" style="color: #811517">4.7 (C) -&gt; -7.0 (G) (n)</td>
<td valign="top" align="center" style="color: #811517">25.9</td>
<td valign="top" align="center" style="color: #811517">98.3</td>
<td valign="top" align="center" style="color: #3c6945">31/42.8<sup>h</sup></td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center" style="color: #3c6945">1.6</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">Y/N<sup>i</sup></td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">N</td>
</tr>
<tr>
<td valign="top" align="left"><italic>PRAME</italic><sup>j</sup></td>
<td valign="top" align="left">S1.9/ rs2266988<sup>k</sup> / NM_006115: c.19G&gt;A</td>
<td valign="top" align="center">D</td>
<td valign="top" align="center" style="color: #811517">7.8 (G) -&gt; 6.2 (A) (n)</td>
<td valign="top" align="center" style="color: #811517">3.0</td>
<td valign="top" align="center">NC</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center" style="color: #3c6945">8.8</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">N</td>
</tr>
<tr>
<td valign="top" align="left"><italic>PRAME</italic><sup>j</sup></td>
<td valign="top" align="left">S1.14/ rs2072049 / NM_006115: c.954-16G&gt;T</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center" style="color: #811517">8.2 (G) -&gt; 7.1 (T) (n)</td>
<td valign="top" align="center" style="color: #811517">2.2</td>
<td valign="top" align="center" style="color: #811517">2.6<sup>d</sup></td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center" style="color: #811517">3.1<sup>d</sup></td>
<td valign="top" align="center">-</td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">Y</td>
</tr>
<tr>
<td valign="top" align="left"><italic>UBASH3A</italic></td>
<td valign="top" align="left">S1.10/ rs1893592 /<break/>NM_018961: c.1393+3A&gt;C</td>
<td valign="top" align="center">D</td>
<td valign="top" align="center" style="color: #811517">8.7 (A) -&gt; 4.2 (C) (n)</td>
<td valign="top" align="center" style="color: #811517">22.9</td>
<td valign="top" align="center" style="color: #811517">3.0</td>
<td valign="top" align="center" style="color: #811517">2.0/1.7<sup>l</sup></td>
<td valign="top" align="center">N/A<sup>m</sup></td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center" style="color: #811517">1.4</td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">Y</td>
</tr>
<tr>
<td valign="top" align="left"><italic>DERL3</italic></td>
<td valign="top" align="left">S1.15/ rs6003906 / NM_198440: c.328-8A&gt;T</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center" style="color: #811517">&#x2013;2.1 (A) -&gt; -4.3 (T) (n)</td>
<td valign="top" align="center" style="color: #811517">4.6</td>
<td valign="top" align="center">NC</td>
<td valign="top" align="center" style="color: #3c6945">2.0</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">N</td>
</tr>
<tr>
<td valign="top" align="left"><italic>ARFGAP3</italic></td>
<td valign="top" align="left">S1.3/ rs1018448 / NM_014570: c.1065C&gt;A</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center" style="color: #811517">10.5 (C) -&gt; 8.3 (A) (n)</td>
<td valign="top" align="center" style="color: #811517">4.5</td>
<td valign="top" align="center" style="color: #811517">2.2</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center" style="color: #3c6945">1.4</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center" style="color: #811517">2.0</td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">N</td>
</tr>
<tr>
<td valign="top" align="left"><italic>CFLAR</italic></td>
<td valign="top" align="left">S1.2/ rs10190751 / NM_003879: c.606+934G&gt;A NM_001127184:c.607-1G&gt;A</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center" style="color: #811517">16.1 (G) -&gt; 5.2 (A) (n)</td>
<td valign="top" align="center" style="color: #811517">1885</td>
<td valign="top" align="center" style="color: #811517">10000+</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center" style="color: #3c6945">2.1</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">Y</td>
</tr>
<tr>
<td valign="top" align="left"><italic>IFI44L</italic></td>
<td valign="top" align="left">S1.11/ rs1333973 / NM_006820: c.478+3A&gt;T</td>
<td valign="top" align="center">D</td>
<td valign="top" align="center" style="color: #811517">9.1 (A) -&gt; 4.6 (T) (n)</td>
<td valign="top" align="center" style="color: #811517">22.6</td>
<td valign="top" align="center" style="color: #811517">15.4</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center" style="color: #3c6945">15.6</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center" style="color: #811517">3.9</td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">N</td>
</tr>
<tr>
<td valign="top" align="left"><italic>LPP</italic></td>
<td valign="top" align="left">S1.12/ rs13076750 / NM_001167671:c.-66-8G&gt;A</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center" style="color: #811517">9.3 (G) -&gt; -1.6 (A) (n)</td>
<td valign="top" align="center" style="color: #811517">625</td>
<td valign="top" align="center" style="color: #811517">5000+</td>
<td valign="top" align="center" style="color: #3c6945">15.8</td>
<td valign="top" align="center" style="color: #3c6945">26.6</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center" style="color: #811517">&#x2013;</td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">Y</td>
</tr>
<tr>
<td valign="top" align="left"><italic>EMID1</italic></td>
<td valign="top" align="left">S1.7/ rs743920 / NM_133455: c.326C&gt;G</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center" style="color: #3c6945">6.0 (C) -&gt; 7.9 (G) (c)</td>
<td valign="top" align="center" style="color: #3c6945">3.6</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center" style="color: #3c6945">5.8</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">N</td>
</tr>
<tr>
<td valign="top" align="left"><italic>CLDN14</italic></td>
<td valign="top" align="left">S1.22/ rs16994182 / NM_001146078:c.-82+4C&gt;G</td>
<td valign="top" align="center">D</td>
<td valign="top" align="center" style="color: #811517">7.4 (C) -&gt; 6.8 (G) (n)</td>
<td valign="top" align="center" style="color: #811517">1.6</td>
<td valign="top" align="center" style="color: #811517">5.9</td>
<td valign="top" align="center" style="color: #3c6945">&#x2013;</td>
<td valign="top" align="center" style="color: #811517">2.1</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">N/A</td>
<td valign="top" align="center">N</td>
</tr>
<tr>
<td valign="top" align="left"><italic>BCR</italic></td>
<td valign="top" align="left">S1.17/ rs16802 / NM_004327:c.2708-13A&gt;G</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center" style="color: #3c6945">5.6 (A) -&gt; 5.8 (G) (n)</td>
<td valign="top" align="center" style="color: #3c6945">1.1</td>
<td valign="top" align="center">NC<sup>n</sup></td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">N</td>
</tr>
<tr>
<td valign="top" align="left"><italic>TMPRSS3</italic></td>
<td valign="top" align="left">S1.19 / rs8130564 / NM_032404: c.66-13T&gt;C</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center" style="color: #3c6945">4.2 (T) -&gt; 4.4 (C) (n)</td>
<td valign="top" align="center" style="color: #3c6945">1.1</td>
<td valign="top" align="center">NC<sup>n</sup></td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">N</td>
</tr>
<tr>
<td valign="top" align="left"><italic>BACE2</italic></td>
<td valign="top" align="left">S1.18/ rs2252576 / NM_012105: c.748-10C&gt;T</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center" style="color: #3c6945">7.2 (C) -&gt; 8.0 (T) (n)</td>
<td valign="top" align="center" style="color: #3c6945">1.5</td>
<td valign="top" align="center">NC<sup>n</sup></td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">N</td>
</tr>
<tr>
<td valign="top" align="left"><italic>CYB5R3</italic></td>
<td valign="top" align="left">S1.20/ rs2285141 / NM_007326: c.-48-18G&gt;T</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center" style="color: #811517">2.0 (G) -&gt; 1.2 (T) (n)</td>
<td valign="top" align="center" style="color: #811517">1.7</td>
<td valign="top" align="center">NC</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center" style="color: #3c6945">1.8</td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">N</td>
</tr>
<tr>
<td valign="top" align="left"><italic>FAM3B</italic></td>
<td valign="top" align="left">S1.16/ rs2838010 / NM_058186: c.20-217A&gt;T</td>
<td valign="top" align="center">D</td>
<td valign="top" align="center" style="color: #3c6945">&#x2013;10.8 (A) -&gt; 7.8 (T) (c)</td>
<td valign="top" align="center" style="color: #3c6945">228</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">N/A<sup>i</sup></td>
<td valign="top" align="center">n/a<sup>b</sup></td>
<td valign="top" align="center">Y</td>
<td valign="top" align="center">N</td>
<td valign="top" align="center">N</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Red text indicates a decrease in the abundance of a particular splice form, while green text indicates an increase in abundance. A &#x2013; Acceptor Splice Site Affected; D &#x2013; Donor Splice Site Affected; NC - Not detectable (abolished). <sup>a</sup> Splicing events which alter reading frame may induce nonsense-mediated decay; <sup>b</sup> No allele specific difference in expression and splicing; <sup>c</sup> complete discrimination of both isoforms using a custom designed TaqMan probe; <sup>d</sup> Values from comparing heterozygote with homozygote common; <sup>e</sup> Change in splicing likely related to change in RNA level; <sup>f</sup> Intron 2-3 retention of <italic>TTC3</italic> amplified by PCR, but no allele specific change detected; <sup>g</sup> This splice form not at detectable levels in homozygote; <sup>h</sup> Cryptic acceptor 114nt upstream of affected site / cryptic acceptor 118nt upstream of affected site; <sup>i</sup> mRNA in-frame when alternate exon is used, and out of frame due to cryptic site use; <sup>j</sup> <italic>PRAME</italic> is a special case where two SNPs affect splicing of two separate exons; <sup>k</sup> rs2266988 and rs1129172 are identical SNPs on opposite strands; <sup>l</sup> Cryptic donor 555nt downstream of affected site / cryptic donor 29nt downstream of affected site;  <sup>m</sup> Splice form not detected by PCR; <sup>n</sup> High variability between individuals with the same genotype by q-RT-PCR.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap id="T2" position="float">
<label>Table 2</label>
<caption>
<p>Abundance of mRNA splice forms relative to internal gene reference.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="left">Gene</th>
<th valign="top" align="center">rsID</th>
<th valign="top" align="left">mRNA Splice Form</th>
<th valign="top" align="center">Homozygotes strong <break/>allele (%) <break/>[# Patients Tested]</th>
<th valign="top" align="center">Homozygotes weak <break/>allele (%) <break/>[# Patients Tested]</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>C21orf2</italic></td>
<td valign="top" align="center">rs2070573</td>
<td valign="top" align="left">Extended Exon 6 Splice Form</td>
<td valign="top" align="center">45.3 &#xb1; 16.9 [2]<sup>1</sup></td>
<td valign="top" align="center">1.5 &#xb1; 0.2 [2]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>TTC3</italic></td>
<td valign="top" align="center">rs2835585</td>
<td valign="top" align="left">Exon 3 Skipping</td>
<td valign="top" align="center">&lt;0.1 &#xb1; 0.0 [2]</td>
<td valign="top" align="center">0.3 &#xb1; 0.1 [2]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>IL19<sup>2</sup></italic></td>
<td valign="top" align="center">rs2243187</td>
<td valign="top" align="left">3 nt Inclusion of Exon 5</td>
<td valign="top" align="center">~100 [1]</td>
<td valign="top" align="center">53.2 &#xb1; 3.0 [1; Het.]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>IL19<sup>2</sup></italic></td>
<td valign="top" align="center">rs2243187</td>
<td valign="top" align="left">3 nt Exclusion of Exon 5</td>
<td valign="top" align="center">61.0 &#xb1; 3.4 [1]</td>
<td valign="top" align="center">~100 [1; Het.]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>IL19<sup>2</sup></italic></td>
<td valign="top" align="center">rs2243187</td>
<td valign="top" align="left">Exon 5 Skipping</td>
<td valign="top" align="center">14.1 &#xb1; 1.4 [1]</td>
<td valign="top" align="center">6.8 &#xb1; 1.1 [1; Het.]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>UBASH3A</italic></td>
<td valign="top" align="center">rs1893592</td>
<td valign="top" align="left">29 nt Retention of Intron 10</td>
<td valign="top" align="center">8.6 &#xb1; 4.6 [1]</td>
<td valign="top" align="center">4.2 &#xb1; 5.8 [1]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>XRCC4</italic></td>
<td valign="top" align="center">rs1805377</td>
<td valign="top" align="left">6 nt Inclusion of Exon 8</td>
<td valign="top" align="center">~100 [1]</td>
<td valign="top" align="center">3.1 &#xb1; 0.5 [1]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>XRCC4</italic></td>
<td valign="top" align="center">rs1805377</td>
<td valign="top" align="left">6 nt Exclusion of Exon 8</td>
<td valign="top" align="center">2.6 &#xb1; 1.1 [1]</td>
<td valign="top" align="center">61.4 &#xb1; 26.3 [1]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>PRAME</italic></td>
<td valign="top" align="center">rs2266988</td>
<td valign="top" align="left">Normal Exon 3 Splicing</td>
<td valign="top" align="center">32.4 &#xb1; 3.6 [1]</td>
<td valign="top" align="center">20.1 &#xb1; 3.9 [1]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>PRAME</italic></td>
<td valign="top" align="center">rs2266988</td>
<td valign="top" align="left">Exon 3 Skipping</td>
<td valign="top" align="center">1.2 &#xb1; 1.2 [1]</td>
<td valign="top" align="center">3.5 &#xb1; 3.0 [1]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>GUSBP11</italic></td>
<td valign="top" align="center">rs3747107</td>
<td valign="top" align="left">Exon 8 Splicing</td>
<td valign="top" align="center">34.0 &#xb1; 0.6 [1]</td>
<td valign="top" align="center">0.3 &#xb1; 0.0 [2]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>GUSBP11</italic></td>
<td valign="top" align="center">rs3747107</td>
<td valign="top" align="left">Alternative Exon 8</td>
<td valign="top" align="center">37.3 &#xb1; 7.3 [1]</td>
<td valign="top" align="center">39.8 &#xb1; 3.7 [2]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>GUSBP11</italic></td>
<td valign="top" align="center">rs3747107</td>
<td valign="top" align="left">114 nt Retention of Intron 7</td>
<td valign="top" align="center">0.1 &#xb1; 0.0 [1]</td>
<td valign="top" align="center">1.4 &#xb1; 0.1 [2]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>GUSBP11</italic></td>
<td valign="top" align="center">rs3747107</td>
<td valign="top" align="left">118 nt Retention of Intron 7</td>
<td valign="top" align="center">0.2 &#xb1; 0.0 [1]</td>
<td valign="top" align="center">8.0 &#xb1; 0.5 [2]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>DERL3</italic></td>
<td valign="top" align="center">rs6003906</td>
<td valign="top" align="left">Normal Exon 5 Splicing</td>
<td valign="top" align="center">~100 [1]</td>
<td valign="top" align="center">55.8 &#xb1; 0.0 [1]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>DERL3</italic></td>
<td valign="top" align="center">rs6003906</td>
<td valign="top" align="left">Extended Exon 4; Short Exon 5</td>
<td valign="top" align="center">3.2 &#xb1; 0.0 [1]</td>
<td valign="top" align="center">4.0 &#xb1; 0.0 [1]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>ARFGAP3</italic></td>
<td valign="top" align="center">rs1018448</td>
<td valign="top" align="left">Exon 12 Skipping</td>
<td valign="top" align="center">12.3 &#xb1; 5.7 [3]</td>
<td valign="top" align="center">23.1 &#xb1; 10.1 [1]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>IFI44L</italic></td>
<td valign="top" align="center">rs1333973</td>
<td valign="top" align="left">Normal Exon 2 Splicing</td>
<td valign="top" align="center">57.0 &#xb1; 0.0 [1]</td>
<td valign="top" align="center">14.7 &#xb1; 0.0 [1]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>IFI44L</italic></td>
<td valign="top" align="center">rs1333973</td>
<td valign="top" align="left">Exon 2 Skipping</td>
<td valign="top" align="center">0.8 &#xb1; 0.0 [1]</td>
<td valign="top" align="center">48.0 &#xb1; 0.0 [1]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>CFLAR</italic></td>
<td valign="top" align="center">rs10190751</td>
<td valign="top" align="left">Upstream Exon 7 Use</td>
<td valign="top" align="center">~100 [1]</td>
<td valign="top" align="center">&lt;0.1 &#xb1; 0.0 [1]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>CFLAR</italic></td>
<td valign="top" align="center">rs10190751</td>
<td valign="top" align="left">Downstream Exon 7 Use</td>
<td valign="top" align="center">39.0 &#xb1; 0.0 [1]</td>
<td valign="top" align="center">87.1 &#xb1; 0.0 [1]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>WBP2NL</italic></td>
<td valign="top" align="center">rs17002806</td>
<td valign="top" align="left">25 nt Intron 6 Retention</td>
<td valign="top" align="center">N.D.<sup>2</sup></td>
<td valign="top" align="center">2.3 &#xb1; 0.0 [1]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>CYB5R3</italic></td>
<td valign="top" align="center">rs2285141</td>
<td valign="top" align="left">Alternate Exon 2 Use</td>
<td valign="top" align="center">&lt;0.1 &#xb1; 0.0 [1]</td>
<td valign="top" align="center">&lt;0.1 &#xb1; 0.0 [1]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>EMID1</italic></td>
<td valign="top" align="center">rs743920</td>
<td valign="top" align="left">6 nt deletion of Exon 4</td>
<td valign="top" align="center">69.0 &#xb1; 9.7 [1]</td>
<td valign="top" align="center">9.2 &#xb1; 1.4 [1]</td>
</tr>
<tr>
<td valign="top" align="left"><italic>CLDN14</italic></td>
<td valign="top" align="center">rs16994182</td>
<td valign="top" align="left">Exon 2 Skipping</td>
<td valign="top" align="center">1.9 &#xb1; 0.0 [1]</td>
<td valign="top" align="center">5.3 &#xb1; 0.0 [1]</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><sup>1</sup>Average expression was computed by comparing qPCR C<sub>t</sub> values across multiple experimental runs and normalized against C<sub>t</sub> of internal gene reference. SNPs tested in multiple experiments with one individual of each genotype will have a standard deviation of 0.0. <sup>2</sup>Heterozygote; Individuals who are homozygous for IL19 SNP rs2243187 were not available for testing. N.D., Not detected. C<sub>t</sub> values were not available for LPP rs13076750.</p>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="s3_2">
<title>SNPs Affecting Cryptic Site Strength and Activity</title>
<p>Increased cryptic site use coinciding with a decrease in natural site strength (<xref ref-type="table" rid="T1"><bold>Table 1</bold></xref>) was validated for: rs1805377 (<xref ref-type="fig" rid="f1"><bold>Figure 1</bold></xref>); rs2243187 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.4</bold></xref>); rs3747107 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.8</bold></xref>); rs17002806 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.13</bold></xref>); rs6003906 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.15</bold></xref>); and rs13076750 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.12</bold></xref>). rs2070573 (<xref ref-type="fig" rid="f2"><bold>Figure 2</bold></xref>) and rs743920 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.7</bold></xref>) strengthened cryptic splice sites resulting in increased use of these sites. Despite the difference in strength between the natural and cryptic sites affected by rs743920, the upstream 2.4 bit site was used more frequently (<xref ref-type="table" rid="T2"><bold>Table 2</bold></xref>). Both <italic>IL19</italic> and <italic>XRCC4</italic> regions tested showed preference to the upstream acceptor as well, which is consistent with the processive mechanism documented to recognize acceptor splice sites (<xref ref-type="bibr" rid="B68">Robberson et al., 1990</xref>).</p>
</sec>
<sec id="s3_3">
<title>SNPs Affecting Exon Inclusion</title>
<p>SNPs that reduced natural site strength (&#x394;<italic>R<sub>i</sub></italic> from 1.6 to 10.9 bits) increased exon skipping from 2- to 27-fold for homozygotes of differing genotypes of rs2835585 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.21</bold></xref>), rs1018448 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.3</bold></xref>), rs1333973 (<xref ref-type="fig" rid="f3"><bold>Figure 3</bold></xref>), rs2266988 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.9</bold></xref>), and rs13076750 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.12</bold></xref>). The exon microarray probesets for rs1018448 and rs1333973 detect decreased expression by genotype, which is consistent with increased exon skipping. Changes of average SI values did not correspond as well to specific genotypes for rs2835585 (<italic>TTC3)</italic>, rs2266988 (<italic>PRAME)</italic>, and rs13076750 (<italic>LPP)</italic>, possibly due to increased cryptic site use (<italic>LPP</italic>) or large differences in the abundance of constitutive and skipped isoforms (<italic>PRAME</italic>, <italic>TTC3</italic>).</p>
</sec>
<sec id="s3_4">
<title>SNPs Promoting Alternate Exon Use</title>
<p>SNP-related decreases in natural splice site strength may promote the use of alternative exons up or downstream of the affected exon. rs10190751 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.2</bold></xref>) is known to modulate the presence of the shorter c-FLIP(S) splice form of <italic>CFLAR</italic> (<xref ref-type="bibr" rid="B89">Ueffing et al., 2009</xref>). The use of this exon differed by 2<sup>17</sup>-fold between the strong and weak homozygotes tested, which was reflected by the expression microarray result. By q-RT-PCR, the <italic>CFLAR</italic> (L) form using an alternate downstream exon was found to be 2.1-fold more abundant in the homozygote with the weaker splice site. rs3747107 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.8</bold></xref>) and rs2285141 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.20</bold></xref>) exhibit evidence of an increased preference by q-RT-PCR for activation of an alternate exon, though the microarray results for the corresponding genotypes for both SNPs were not significantly different.</p>
</sec>
<sec id="s3_5">
<title>SNP-Directed Effects on mRNA Levels</title>
<p>A change in the strength of a natural site of an exon can affect the quantity of the processed mRNA (<xref ref-type="bibr" rid="B9">Caminsky et al., 2014</xref>). This decrease in mRNA could be caused by nonsense mediated decay (NMD), which degrades aberrant transcripts that would result in premature protein truncation (<xref ref-type="bibr" rid="B12">Cartegni et al., 2002</xref>). Of the 22 SNPs tested, 2 showed a direct correlation between a decrease in natural splice site strength, reduced amplification of the internal reference by q-RT-PCR (of multiple individuals) and a decreasing trend in expression by genotype by microarray: rs2072049 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.9</bold></xref>) and rs1018448 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.3</bold></xref>), although these differences do not meet statistical significance.</p>
</sec>
<sec id="s3_6">
<title>SNPs With Pertinent Splicing Effects Detected by RNAseq</title>
<p>Evidence for impact on splicing of the previously described SNPs was also assessed in TCGA and ICGC tumors by high throughput expression analyses. Splicing effects of these variants detected by q-RT-PCR and RNAseq were concordant in 80% of cases (N = 16 of 20 SNPs), while impacts of 10% of SNPs (N = 2) were partially concordant as a result of inconsistent activation of cryptic splice sites (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Images 1.8D</bold></xref> and <xref ref-type="supplementary-material" rid="SM1"><bold>1.10D</bold></xref>). Several isoforms predicted by information analysis of these SNPs were present in complete transcriptomes, but were undetectable by q-RT-PCR or expression microarrays. Examples include a 4.9 bit cryptic site activated by rs3747107 located 2 nucleotides from the natural splice site (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.8D</bold></xref>), exon skipping by rs1893592 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.10D</bold></xref>), and a cryptic exon activated by rs2838010 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.16C</bold></xref>). Processed mRNAs that were not detected by q-RT-PCR may have arisen as a result of a lack of sensitivity of the assay, to NMD (which could mask detection of mis-splicing), to a deficiency of an undefined trans-acting splicing factor, or to design limitations in the experimental design. Another possibility is that the discordant splicing patterns of these two SNPs based could potentially be related to differences in tissue origin, since only the RNAseq findings were tumor-derived, whereas results obtained by the other approaches were generated from RNA extracted from lymphoblastoid cell lines. Cell culture conditions such as cell density and phosphorylation status can affect alternative splicing patterns (<xref ref-type="bibr" rid="B45">Li et al., 2006</xref>; <xref ref-type="bibr" rid="B84">Szafranski et al., 2014</xref>). These conditions, however, have not been studied in cases of allele-specific, sequence differences at splice sites or cis-acting regulatory sites that impact splice site selection. Considering the high level of concordance of splicing effects for the same SNPs in uncultured and cultured cells, it seems unlikely that culture conditions significantly impacts the majority of allele-specific, alternatively spliced isoforms. Our information theory-based analyses show that the dominant effect of SNP genotypes is to dictate common changes in splice site strength regardless of cell origin.</p>
<p>The results obtained from q-RT-PCR and RNAseq data for rs2070573, rs10190751, rs13076750, rs2072049, rs2835585, rs1893592, and rs1805377 were complementary to findings based on RNAseq (<xref ref-type="supplementary-material" rid="SM3"><bold>Supplementary Table 2</bold></xref>). RNAseq data can reveal potential allele-specific alternate splicing events that were not considered at the primer design phase of the study, while q-RT-PCR is more sensitive and can reveal less abundant alternative splice forms. A weak 0.4 bit splice site associated with rs2070573 was less abundant than the extended isoform (<xref ref-type="fig" rid="f2"><bold>Figure 2</bold></xref>) by both q-RT-PCR and exon microarray, however ValidSpliceMut also revealed increased total <italic>C21orf2</italic> intron 6 retention in five tumors with this allele. Similarly, rs10190751 was flagged for intron retention in 29 tumors, which was not evident by the other approaches. The long form of this transcript (c-FLIP[L]) in homozygous carriers of this SNP was twice as abundant by q-RT-PCR than the shorter allele, associated with the weak splice site. rs13076750 activates an alternate acceptor site for a rare exon that extends the original exon length by seven nucleotides. The exon boundary can also extend into an adjacent exon, based on RNAseq of eight tumors carrying this SNP. Expression was decreased in the presence of a 6.2 bit splice site derived from a rs2072049 allele that weakens the natural acceptor site of the terminal exon of <italic>PRAME</italic>. The actual cause of diminished expression is likely to have been related to NMD from intron retention. ValidSpliceMut showed intron retention to be increased in rs2835585, whereas increased exon skipping for the allele with the weaker splice site was demonstrated by q-RT-PCR. rs1893592 caused significant intron retention in all tumors (N = 9), with exon skipping present in 3 diffuse large B-cell lymphoma patients, which was not detected by q-RT-PCR. Finally, rs1805377 was associated with the significant abundance of read sequences indicating <italic>XRCC4</italic> intron 7 retention by RNAseq (N = 32), however this isoform could not be distinguished by the primers designed for q-RT-PCR and by TaqMan assay.</p>
<p>Alternative splicing events detected by RNAseq that were not evident in either q-RT-PCR or microarray studies included exon skipping induced by rs743920 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.7D</bold></xref>), activation of a preexisting cryptic splice site by rs1333973 (<xref ref-type="fig" rid="f3"><bold>Figure 3</bold></xref>), and intron retention by rs6003906 (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.15D</bold></xref>). rs743920 creates an exonic hnRNP A1 site (<italic>R<sub>i</sub></italic> = 2.8 bits) distant from the natural site which may compromise exon definition (<xref ref-type="bibr" rid="B55">Mucaki et al., 2013</xref>; <xref ref-type="bibr" rid="B65">Peterlongo et al., 2015</xref>) and may explain the SNP-associated increase in exon skipping. Exon definition analyses of total exon information (<italic>R<sub>i,total</sub></italic>) also predicted the cryptic isoform arising from rs1333973 to be the most abundant (<italic>R<sub>i,total</sub></italic> = 9.4 bits).</p>
</sec>
<sec id="s3_7">
<title>Allele-Specific mRNA Splicing for Other SNPs Identified Through RNAseq</title>
<p>A distinct set of 24 high population frequency SNPs were also evaluated for their potential impact on mRNA splicing by RNAseq analysis of ICGC patients. Those resulting in significantly decreased natural splice site strength (&#x394;<italic>R<sub>i</sub></italic> &lt; &#x2212;1 bit) were analyzed for SNP-derived alternative splicing events. SNPs fulfilling these criteria expressed at sufficient levels over the region of interest were: rs6467, rs36135, rs154290, rs166062, rs171632, rs232790, rs246391, rs324137, rs324726, rs448580, rs469074, rs518928, rs624105, rs653667, rs694180, rs722442, rs748767, rs751128, rs751552, rs752262, rs832567, rs909958, rs933208, and rs1018342 (<xref ref-type="table" rid="T3"><bold>Table 3</bold></xref>). Splicing was predicted to be leaky for all natural splice sites affected by these SNPs (<xref ref-type="bibr" rid="B72">Rogan et al., 1998</xref>; <italic>R<sub>i,final</sub></italic> &#x2265; 1.6 bits), where reduction in <italic>R<sub>i</sub></italic> values ranged from 1.1 to 3.3 bits.</p>
<table-wrap id="T3" position="float">
<label>Table 3</label>
<caption>
<p>RNAseq analysis of natural splice sites weakened by common single nucleotide polymorphisms (SNPs).</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="left">Gene</th>
<th valign="top" align="center">rsID<sup>1</sup></th>
<th valign="top" align="center">HGVS Notation<break/>(HG19)<sup>2</sup></th>
<th valign="top" align="center"><italic>R<sub>i</sub> initial</italic></th>
<th valign="top" align="center"><italic>R<sub>i</sub> final</italic></th>
<th valign="top" align="center"><italic>&#x394;R<sub>i</sub></italic></th>
<th valign="top" align="center">Alternative Splicing Observed</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>CYP21A2</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs6467">rs6467</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=447403+447404+819744&amp;referenceName=6&amp;alternateBases=A&amp;start=32006858&amp;referenceBases=C&amp;assemblyId=GRCh37&amp;includeDatasetResponses=HIT">6:32006858C &gt; A</uri></td>
<td valign="top" align="center">6.1</td>
<td valign="top" align="center">4.5</td>
<td valign="top" align="center">&#x2212;1.6</td>
<td valign="top" align="left">Intron Retention; Cryptic Site Use</td>
</tr>
<tr>
<td valign="top" align="left"><italic>TRIM23</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs36135">rs36135</uri></td>
<td valign="top" align="center">5:64890479A &gt; C</td>
<td valign="top" align="center">10.0</td>
<td valign="top" align="center">7.5</td>
<td valign="top" align="center">&#x2212;2.5</td>
<td valign="top" align="left">Intron Retention</td>
</tr>
<tr>
<td valign="top" align="left"><italic>ZFYVE16</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs166062">rs166062</uri></td>
<td valign="top" align="center">5:79773028T &gt; G</td>
<td valign="top" align="center">14.1</td>
<td valign="top" align="center">11.8</td>
<td valign="top" align="center">&#x2212;2.4</td>
<td valign="top" align="left">Intron Retention</td>
</tr>
<tr>
<td valign="top" align="left"><italic>APBB3</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs171632">rs171632</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=447388&amp;referenceName=5&amp;alternateBases=G&amp;start=139941318&amp;referenceBases=A&amp;assemblyId=GRCh37&amp;includeDatasetResponses=HIT">5:139941318A &gt; G</uri></td>
<td valign="top" align="center">2.5</td>
<td valign="top" align="center">1.4</td>
<td valign="top" align="center">&#x2212;1.1</td>
<td valign="top" align="left">Intron Retention; Cryptic Site Use</td>
</tr>
<tr>
<td valign="top" align="left"><italic>SMIM8</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs448580">rs448580</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=325307+684696+684697&amp;referenceName=6&amp;alternateBases=G&amp;start=88040399&amp;referenceBases=T&amp;assemblyId=GRCh37&amp;includeDatasetResponses=HIT">6:88040399T &gt; G</uri></td>
<td valign="top" align="center">6.9</td>
<td valign="top" align="center">4.4</td>
<td valign="top" align="center">&#x2212;2.5</td>
<td valign="top" align="left">Exon Skipping</td>
</tr>
<tr>
<td valign="top" align="left"><italic>FCHSD1</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs469074">rs469074</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=428442+428443+428444+428445+428446+428447+428448+812575+812576+812577+812578+812579+812580+812581+812582+812583&amp;referenceName=5&amp;alternateBases=G&amp;start=141024136&amp;referenceBases=T&amp;assemblyId=GRCh37&amp;includeDatasetResponses=HIT">5:141024136T &gt; G</uri></td>
<td valign="top" align="center">10.4</td>
<td valign="top" align="center">7.1</td>
<td valign="top" align="center">&#x2212;3.3</td>
<td valign="top" align="left">Intron Retention</td>
</tr>
<tr>
<td valign="top" align="left"><italic>KIFAP3</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs518928">rs518928</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=447481&amp;referenceName=1&amp;alternateBases=G&amp;start=169890933&amp;referenceBases=A&amp;assemblyId=GRCh37&amp;includeDatasetResponses=HIT">1:169890933A &gt; G</uri></td>
<td valign="top" align="center">15.6</td>
<td valign="top" align="center">14.5</td>
<td valign="top" align="center">&#x2212;1.1</td>
<td valign="top" align="left">Intron Retention</td>
</tr>
<tr>
<td valign="top" align="left"><italic>CEPT1</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs694180">rs694180</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=447444">1:111726213A &gt; G</uri></td>
<td valign="top" align="center">9.6</td>
<td valign="top" align="center">7.0</td>
<td valign="top" align="center">&#x2212;2.6</td>
<td valign="top" align="left">Intron Retention</td>
</tr>
<tr>
<td valign="top" align="left"><italic>ADCY10P1</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs722442">rs722442</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=411458+814669+863502&amp;referenceName=6&amp;alternateBases=G&amp;start=41089681&amp;referenceBases=A&amp;assemblyId=GRCh37&amp;includeDatasetResponses=HIT">6:41089681A &gt; G</uri></td>
<td valign="top" align="center">7.5</td>
<td valign="top" align="center">4.9</td>
<td valign="top" align="center">&#x2212;2.5</td>
<td valign="top" align="left">Exon Skipping</td>
</tr>
<tr>
<td valign="top" align="left"><italic>MICAL1</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs752262">rs752262</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=427495+427496+427497+427498+427499+427500+427501+427502+427503+427504+427505+447412+792832&amp;referenceName=6&amp;alternateBases=C&amp;start=109770999&amp;referenceBases=G&amp;assemblyId=GRCh37&amp;includeDatasetResponses=HIT">6:109770999G &gt; C</uri></td>
<td valign="top" align="center">4.0</td>
<td valign="top" align="center">2.7</td>
<td valign="top" align="center">&#x2212;1.4</td>
<td valign="top" align="left">Intron Retention</td>
</tr>
<tr>
<td valign="top" align="left"><italic>MAP3K1</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs832567">rs832567</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=82130+432785+432786+432787+432788+432789&amp;referenceName=5&amp;alternateBases=A&amp;start=56152416&amp;referenceBases=C&amp;assemblyId=GRCh37&amp;includeDatasetResponses=HIT">5:56152416C &gt; A</uri></td>
<td valign="top" align="center">6.8</td>
<td valign="top" align="center">5.0</td>
<td valign="top" align="center">&#x2212;1.7</td>
<td valign="top" align="left">Intron Retention; Exon Skipping</td>
</tr>
<tr>
<td valign="top" align="left"><italic>METTL13</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs909958">rs909958</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=80782+83375+447482+447483&amp;referenceName=1&amp;alternateBases=A&amp;start=171763522&amp;referenceBases=C&amp;assemblyId=GRCh37&amp;includeDatasetResponses=HIT">1:171763522C &gt; A</uri></td>
<td valign="top" align="center">11.4</td>
<td valign="top" align="center">9.9</td>
<td valign="top" align="center">&#x2212;1.4</td>
<td valign="top" align="left">Intron Retention</td>
</tr>
<tr>
<td valign="top" align="left"><italic>DDX39B</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs933208">rs933208</uri></td>
<td valign="top" align="center">6:31506648G &gt; T</td>
<td valign="top" align="center">4.8</td>
<td valign="top" align="center">3.7</td>
<td valign="top" align="center">&#x2212;1.1</td>
<td valign="top" align="left">Intron Retention; Cryptic Site Use</td>
</tr>
<tr>
<td valign="top" align="left"><italic>CCT7</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs1018342">rs1018342</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=447970+447971&amp;referenceName=2&amp;alternateBases=G&amp;start=73471653&amp;referenceBases=T&amp;assemblyId=GRCh37&amp;includeDatasetResponses=HIT">2:73471653T &gt; G</uri></td>
<td valign="top" align="center">5.9</td>
<td valign="top" align="center">4.4</td>
<td valign="top" align="center">&#x2212;1.5</td>
<td valign="top" align="left">Intron Retention</td>
</tr>
<tr>
<td valign="top" align="left"><italic>PPIP5K2</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs154290">rs154290</uri></td>
<td valign="top" align="center">5:102537200T &gt; G</td>
<td valign="top" align="center">12.5</td>
<td valign="top" align="center">11.3</td>
<td valign="top" align="center">&#x2212;1.3</td>
<td valign="top" align="left">Wildtype Only</td>
</tr>
<tr>
<td valign="top" align="left"><italic>MYSM1</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs232790">rs232790</uri></td>
<td valign="top" align="center">1:59131311G &gt; T</td>
<td valign="top" align="center">10.4</td>
<td valign="top" align="center">8.7</td>
<td valign="top" align="center">&#x2212;1.7</td>
<td valign="top" align="left">Wildtype Only</td>
</tr>
<tr>
<td valign="top" align="left"><italic>PDGFRB</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs246391">rs246391</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=78727+921594+921595+921596+921597+921599+921600&amp;referenceName=5&amp;alternateBases=C&amp;start=149497177&amp;referenceBases=T&amp;assemblyId=GRCh37&amp;includeDatasetResponses=HIT">5:149497177T &gt; C</uri></td>
<td valign="top" align="center">6.2</td>
<td valign="top" align="center">3.6</td>
<td valign="top" align="center">&#x2212;2.6</td>
<td valign="top" align="left">Wildtype Only</td>
</tr>
<tr>
<td valign="top" align="left"><italic>AARS2</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs324137">rs324137</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=447409">6:44273546A &gt; C</uri></td>
<td valign="top" align="center">10.2</td>
<td valign="top" align="center">8.9</td>
<td valign="top" align="center">&#x2212;1.3</td>
<td valign="top" align="left">Wildtype Only</td>
</tr>
<tr>
<td valign="top" align="left"><italic>USO1</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs324726">rs324726</uri></td>
<td valign="top" align="center">4:76722353G &gt; A</td>
<td valign="top" align="center">11.8</td>
<td valign="top" align="center">8.8</td>
<td valign="top" align="center">&#x2212;3.0</td>
<td valign="top" align="left">Wildtype Only</td>
</tr>
<tr>
<td valign="top" align="left"><italic>KIF13A</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs624105">rs624105</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=429720&amp;referenceName=6&amp;alternateBases=C&amp;start=17855864&amp;referenceBases=G&amp;assemblyId=GRCh37&amp;includeDatasetResponses=HIT">6:17855864G &gt; C</uri></td>
<td valign="top" align="center">14.1</td>
<td valign="top" align="center">13.0</td>
<td valign="top" align="center">&#x2212;1.1</td>
<td valign="top" align="left">Wildtype Only</td>
</tr>
<tr>
<td valign="top" align="left"><italic>TNFRSF1B</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs653667">rs653667</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=921006+921007+921008+921009+921010&amp;referenceName=1&amp;alternateBases=G&amp;start=12251808&amp;referenceBases=T&amp;assemblyId=GRCh37&amp;includeDatasetResponses=HIT">1:12251808T &gt; G</uri></td>
<td valign="top" align="center">3.7</td>
<td valign="top" align="center">2.4</td>
<td valign="top" align="center">&#x2212;1.3</td>
<td valign="top" align="left">Wildtype Only</td>
</tr>
<tr>
<td valign="top" align="left"><italic>CCDC93</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs748767">rs748767</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=78766+447505+447506&amp;referenceName=2&amp;alternateBases=A&amp;start=118731573&amp;referenceBases=G&amp;assemblyId=GRCh37&amp;includeDatasetResponses=HIT">2:118731573G &gt; A</uri></td>
<td valign="top" align="center">4.6</td>
<td valign="top" align="center">3.4</td>
<td valign="top" align="center">&#x2212;1.1</td>
<td valign="top" align="left">Wildtype Only</td>
</tr>
<tr>
<td valign="top" align="left"><italic>CAPN2</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs751128">rs751128</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=80842+80843+80844+447491&amp;referenceName=1&amp;alternateBases=C&amp;start=223951841&amp;referenceBases=T&amp;assemblyId=GRCh37&amp;includeDatasetResponses=HIT">1:223951841T &gt; C</uri></td>
<td valign="top" align="center">5.3</td>
<td valign="top" align="center">4.2</td>
<td valign="top" align="center">&#x2212;1.1</td>
<td valign="top" align="left">Wildtype Only</td>
</tr>
<tr>
<td valign="top" align="left"><italic>FYCO1</italic></td>
<td valign="top" align="center">
<uri xlink:href="https://www.ncbi.nlm.nih.gov/snp/rs751552">rs751552</uri></td>
<td valign="top" align="center">
<uri xlink:href="https://validsplicemut.cytognomix.com/view.php?targets=78853+447520+447521&amp;referenceName=3&amp;alternateBases=T&amp;start=46016851&amp;referenceBases=A&amp;assemblyId=GRCh37&amp;includeDatasetResponses=HIT">3:46016851A &gt; T</uri></td>
<td valign="top" align="center">6.9</td>
<td valign="top" align="center">4.7</td>
<td valign="top" align="center">&#x2212;2.2</td>
<td valign="top" align="left">Wildtype Only</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><sup>1</sup>rsIDs are hyperlinked to their associated dbSNP page; <sup>2</sup>If present, variant coordinates are hyperlinked to the ValidSpliceMut database; Thick bars separate SNP-affected exons with and without RNAseq-observed alternate splicing events.</p>
</table-wrap-foot>
</table-wrap>
<p>Alternative mRNA splicing was observed in 14 SNPs: rs6467, rs36135, rs166062, rs171632, rs448580, rs469074, rs518928, rs694180, rs722442, rs752262, rs832567, rs909958, rs933208, rs1018342; <xref ref-type="table" rid="T3"><bold>Table 3</bold></xref>). Reads spanning these regions revealed intron retention (N = 12), activation of cryptic splicing (N&#xa0;=&#xa0;4), and complete exon skipping (N = 3). Eleven of these SNPs (79%) exhibited splicing patterns that significantly differed from the control alleles, and were therefore present in ValidSpliceMut. Interestingly, ValidSpliceMut contained entries for 7 of 10 SNPs where alternative splicing had not been found in the two patients reported in <xref ref-type="table" rid="T3"><bold>Table 3</bold></xref> (rs246391, rs324137, rs624105, rs653667, rs748767, rs751128, rs751552). The observed significant splicing differences for these SNPs occurred in distinct tumor types, consistent with tissue-specific effects of these SNPs on splicing.</p>
</sec>
<sec id="s3_8">
<title>Instances of Limited Corroboration of SNP-Related Predictions</title>
<p>Anticipated effects of the SNPs on splicing were not always confirmed by expression studies. Aside from incomplete or incorrect predictions, both design and execution of these studies as well as uncharacterized tissue specific effects could provide an explanation for these discrepancies. Furthermore, these undetected splicing events may have been targeted for NMD, however expression was not compared with mRNA levels from cells cultured with an inhibitor of protein translation. Stronger preexisting cryptic sites were, in some instances, not recognized nor was isoform abundance changed. These include: rs1893592 (6.4 and 5.2 bit cryptic donor sites 29 and 555 nt downstream of the affected donor); rs17002806 (a 5.7 bit site 67 nt downstream of the natural site); rs3747107 [creates a 4.9 bit cryptic site 2 nt downstream (observed by RNAseq; <xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.8 D</bold></xref>)]; and rs2835585 (5.8 and 5.9 bit cryptic sites 60 and 87 nt upstream of the natural site). SNPs with modestly decreased natural site strength <italic>(</italic>0.2 to 4.5 bits) did not consistently result in exon skipping (for example, rs1893592, rs17002806, and rs2835655).</p>
<p>Six SNPs predicted to disrupt natural splice sites could not be confirmed. Splicing effects were not identified in the 4 SNPs where the information change was &lt; 1 bit (&lt;twofold). Genetic variability masked potential splicing effects of three of these SNPs, including rs16802, rs2252576, and rs8130564 (<xref ref-type="table" rid="T1"><bold>Table 1</bold></xref>). PCR primer sets designed for <italic>COL6A2</italic> exon 21 (affected by rs17357592) did not produce the expected amplicon. Interpreting the results for rs16994182 (<italic>CLDN14</italic>) was complicated by the lack of a suitable internal reference. As <italic>CLDN14</italic> consists of three exons, any internal reference covering the affected second exon cannot parse whether differences in exon 2 expression were caused by the SNP or by general expression changes.</p>
<p>The T-allele rs2838010 was predicted to activate a donor splice site of a rare exon in IVS1 of <italic>FAM3B</italic> (GenBank Accession AJ409094). The cryptic pseudoexon was neither detected by RT-PCR nor expression microarray (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.16</bold></xref>). Interestingly, this exon is expressed in a malignant lymphoma patient who is a carrier for this genotype [ICGC ID: DO27769; (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.16C</bold></xref>)]. Although the T-allele is probably required to activate the pseudoexon, additional unknown splicing-related factors appear to be necessary.</p>
</sec>
</sec>
<sec id="s4" sec-type="discussion">
<title>Discussion</title>
<p>Predicted SNP alleles that alter constitutive mRNA splicing are confirmed by expression data, and appear to be a common cause of alternative splicing. The preponderance of leaky splicing mutations and cryptic splice sites, which often produce both normal and mutant transcripts, is consistent with balancing selection (<xref ref-type="bibr" rid="B60">Nuzhdin et al., 2004</xref>) or possibly with mutant loci that contribute to multifactorial disease. Minor SNP alleles are often found in &gt; 1% of populations (<xref ref-type="bibr" rid="B35">Janos&#xed;kov&#xe1; et al., 2005</xref>). This would be consistent with a bias against finding mutations that abolish splice site recognition in dbSNP. Such mutations are more typical in rare Mendelian disorders (<xref ref-type="bibr" rid="B72">Rogan et al., 1998</xref>).</p>
<p>Exon-based expression microarrays and q-RT-PCR techniques were initially used to confirm the predicted impact of common and rare SNPs on splicing. Results were subsequently confirmed using RNAseq data for some of these SNPs (<xref ref-type="bibr" rid="B22">Dorman et al., 2014</xref>; <xref ref-type="bibr" rid="B91">Viner et al., 2014</xref>; <xref ref-type="bibr" rid="B77">Shirley et al., 2019</xref>). However, exon skipping due to rs1893592 was not consistently seen in all carriers. Although detected only in one type of tumor, this event may not be tissue specific, since five patients with the same genotype did not exhibit this isoform. Nevertheless, exon skipping was also observed in malignant lymphoma (<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Image 1.10D</bold></xref>). Intron retention in rs1805377 carriers was evident in only 22% of tumors. Increased total intron retention may be due to failure to recognize exons due to overlapping strong splice sites (<xref ref-type="bibr" rid="B92">Vockley et al., 2000</xref>; <xref ref-type="bibr" rid="B73">Rogan et al., 2003</xref>).</p>
<p>The splicing impacts of several of these SNPs have also been implicated in other studies. rs10190751 modulates the FLICE-inhibitory protein (c-FLIP) from its S-form to its R-form, with the latter having been linked to increased lymphoma risk (<xref ref-type="bibr" rid="B89">Ueffing et al., 2009</xref>). We observed the R-form to be twice as abundant for one of the rs10190751 alleles. Increased exon skipping attributed to rs1333973 has been reported in RNAseq analysis of <italic>IFI44L</italic> (<xref ref-type="bibr" rid="B101">Zhao et al., 2013a</xref>), which has been implicated in reduced antibody response to measles vaccine (<xref ref-type="bibr" rid="B30">Haralambieva et al., 2017</xref>). The splicing impact of <italic>XRCC4</italic> rs1805377 has been noted previously (<xref ref-type="bibr" rid="B57">Nalla and Rogan, 2005</xref>). This SNP has been implicated with an increased risk of gastric cancer (<xref ref-type="bibr" rid="B15">Chiu et al., 2008</xref>), pancreatic cancer (<xref ref-type="bibr" rid="B20">Ding and Li, 2015</xref>) and glioma (<xref ref-type="bibr" rid="B102">Zhao et al., 2013b</xref>). Similarly, the potential impact of rs1893592 in <italic>UBASH3A</italic> has been recognized (<xref ref-type="bibr" rid="B39">Kim et al., 2015</xref>) and is associated with arthritis (<xref ref-type="bibr" rid="B47">Liu et al., 2017</xref>) and type 1 diabetes (<xref ref-type="bibr" rid="B26">Ge and Concannon, 2018</xref>). <xref ref-type="bibr" rid="B31">Hiller et al. (2006)</xref> described the 3nt deletion caused by rs2243187 in <italic>IL19</italic> but did not report increased exon skipping. rs743920 was associated with change in <italic>EMID1</italic> expression (<xref ref-type="bibr" rid="B27">Ge et al., 2005</xref>), however its splicing impact was not recognized. Conversely, studies linking <italic>TMPRSS3</italic> variants to hearing loss did not report rs8130564 to be significant (<xref ref-type="bibr" rid="B44">Lee et al., 2013</xref>; <xref ref-type="bibr" rid="B16">Chung et al., 2014</xref>). Interestingly, rs2252576 (in which we did not find a splicing alteration) has been associated to Alzheimer&#x2019;s dementia in Down syndrome (<xref ref-type="bibr" rid="B53">Mok et al., 2014</xref>).</p>
<p>rs2835585 significantly increased exon skipping in <italic>TTC3</italic>, however normal expression levels at the affected exon junction were not significantly altered. This was most likely due to the large difference in abundance between the constitutive and skipped splice isoforms (<xref ref-type="table" rid="T2"><bold>Table 2</bold></xref>). The skipped isoform does not disrupt the reading frame and the affected coding region has not been assigned to any known protein domain (<xref ref-type="bibr" rid="B88">Tsukahara et al., 1996</xref>; <xref ref-type="bibr" rid="B81">Suizu et al., 2009</xref>). It is unclear whether allele-specific, exon skipping in this instance would impact TTC3 protein function or activity.</p>
<p>Why are so few natural splice sites strengthened by SNP-induced information changes? Most such changes would be thought to be neutral mutations, which are ultimately lost by chance (<xref ref-type="bibr" rid="B24">Fisher, 1930</xref>). Those variants which are retained are more likely to confer a selective advantage (<xref ref-type="bibr" rid="B46">Li, 1967</xref>). Indeed, the minor allele in rs2266988, which strengthens a donor splice site by 1.6 bits at the 5' end of the open reading frame in <italic>PRAME</italic> and occurs in 25% of the overall population (~50% in Europeans). Several instances of modest changes in splice site strength that would be expected to have little or no impact, in fact, alter the degree of exon skipping.</p>
<p>Allele frequency can significantly vary across different populations, which can be indicative of gene flow and migration of a population (<xref ref-type="bibr" rid="B14">Cavalli-Sforza and Bodmer, 1971</xref>) as well as, in the case of splicing variants, genetic load and fitness in a population (<xref ref-type="bibr" rid="B70">Rogan and Mucaki, 2011</xref>). The frequencies of several of the variants presented here are significantly different between ethnic and geographically defined populations (<xref ref-type="table" rid="T1"><bold>Tables 1</bold></xref> and <xref ref-type="table" rid="T3"><bold>3</bold></xref>). We examined allele frequencies of these variants in sub-populations in both HapMap and dbSNP version 153. For example, representation of the alleles of the <italic>XRCC4</italic> variant rs1805377 (where its A-allele leads to a 6 nt deletion of the gene's terminal exon) differs between Caucasians and Asians (for the G- and A-alleles, respectively). Different linkage disequilibrium patterns of this variant occur in Han Chinese (CHB) and Utah residents with Northern and Western European ancestry (CEU) populations (<xref ref-type="bibr" rid="B102">Zhao et al., 2013b</xref>). Similar differences in SNP population frequency include <italic>EMID1</italic> rs743920 (in HapMap: G-allele frequency is 47% in CHB, but only 7% in CEU and 10% in northern Swedish cohorts). This is consistent with dbSNP (version 153) where it is present in 72% in Vietnamese, but only 16% of a northern Sweden cohort). In <italic>BACE2</italic>, rs2252576, the T-allele is most prevalent at 84% in Yoruba in Ibadan, Nigeria populations (YRI), but only 8% in CHB. In <italic>FCHSD1</italic>, rs469074 the frequency of the G-allele is 37% in YRI and &lt;1% in CHB. Some SNPs were exclusively present in a single population in the HapMap cohort (e.g. only the YRI population is polymorphic for <italic>IL19</italic> rs2243187, <italic>WBP2NL</italic> rs17002806, <italic>DERL3</italic> rs6003906, and <italic>CLDN14</italic> rs16994182).</p>
<p>Because of their effects on mRNA splicing, these differences in allele frequency would be expected to alter the relative abundance of certain protein isoforms in these populations. We speculate about whether isoform-specific representation among populations influences disease predisposition, other common phenotypic differences, or whether they are neutral. We suggest that SNPs decreasing constitutive splicing while increasing mRNA isoforms which alter the reading frame would be more likely to result in a distinct phenotype. q-RT-PCR experiments confirmed five SNPs which increased the fraction of mRNA splice forms causing a frameshift (<xref ref-type="table" rid="T1"><bold>Table 1</bold></xref>), three of which simultaneously decrease constitutive splicing by &#x2265; 10-fold (<italic>WBP2NL</italic> rs17002806; <italic>GUSBP11</italic> rs3747107; and <italic>IFI44L</italic> rs1333973). rs3747107 and rs17002806 are much more common in YRI populations in HapMap (rs3747107 G-allele is present in 64% in YRI but only 23% in CEU; rs17002806 A-allele was not identified in any CHB or CEU individuals), while the A-allele of rs1333973 is much more common in CHB (76%, compared to 31% and 35% in CEU and YRI populations, respectively). These common variants are likely to change the function of these proteins and may influence individual phenotypes. A somewhat comprehensive catalog of DNA polymorphisms with splicing effects&#x2014;confined or with increased prevalence in specific ethnic or geographically identifiable groups&#x2014;could be derived from combining ValidSpliceMut with population-specific SNP databases. Aside from those phenotypes described earlier, genes implicated by GWAS or other analyses for specific disorders represent reasonable candidates for further detailed or replication studies aimed at identification of the risk alleles in these cohorts.</p>
<p>The extent to which SNP-related sequence variation accounts for the heterogeneity in mRNA transcript structures has been somewhat unappreciated, given the relatively high proportion of genes that exhibit tissue-specific alternative splicing (<xref ref-type="bibr" rid="B62">Pan et al., 2008</xref>; <xref ref-type="bibr" rid="B96">Wang et al., 2008</xref>; <xref ref-type="bibr" rid="B2">Baralle and Giudice, 2017</xref>). This and our previous study (<xref ref-type="bibr" rid="B77">Shirley et al., 2019</xref>) raise questions regarding the degree to which apparent alternative splicing is the result of genomic polymorphism rather than splicing regulation alone. Because much of the information required for splice site recognition resides within neighboring introns, it would be prudent to consider contributions from intronic and exonic polymorphism that produce structural mRNA variation, since these changes might be associated with disease or predisposition.</p>
<p>Individual information corresponds to a continuous molecular phenotypic measure that is well suited to the analysis of contributions of multiple, incompletely penetrant SNPs in different genes, as typically seen in genetically complex diseases (<xref ref-type="bibr" rid="B18">Cooper et al., 2013</xref>). Our protocol identifies low or nonpenetrant allele-specific alternative splicing events through bioinformatic analysis, and either q-RT-PCR, exon microarrays or RNAseq data analysis. Allele-specific splicing can also be determined by full-length alternative isoform analysis of RNA [or FLAIR (<xref ref-type="bibr" rid="B97">Workman et al., 2019</xref>)]. Differentiated splice forms are associated with specific alleles in heterozygotes with exonic SNPs. However, combining genome-based information with FLAIR may enable identification of intronic SNPs influencing splicing and low abundance alternative splice forms, which might otherwise be missed by FLAIR.</p>
<p>Targeted splicing analysis generally reproduces the results of our multi- genome-wide surveys of sequence variations affecting mRNA splicing. As splicing mutations and their effects were often observed in multiple tumor types, the impact of these mutations may be pleiotropic. Some events were only detected in q-RT-PCR data and not by RNAseq (and vice versa), highlighting the complementarity of these techniques for splicing mutation analyses. Results of this study increase confidence that the publicly (<uri xlink:href="https://ValidSpliceMut.cytognomix.com">https://ValidSpliceMut.cytognomix.com</uri>) and commercially (<uri xlink:href="https://MutationForecaster.com">https://MutationForecaster.com</uri>) available resources for information-theory based variant analysis and validation can distinguish mutations contributing to aberrant molecular phenotypes from allele-specific alternative splicing.</p>
</sec>
<sec id="s5">
<title>Data Availability Statement</title>
<p>All datasets generated for this study are included in the article/<xref ref-type="supplementary-material" rid="SM1"><bold>Supplementary Material</bold></xref>.</p>
</sec>
<sec id="s6">
<title>Ethics Statement</title>
<p>Controlled-access TCGA and ICGC sequence data was approved by NCBI at the US National Institutes of Health (dbGaP Project #988: &#x201c;Predicting common genetic variants that alter the splicing of human gene transcripts&#x201d;; Approval Number #13930-11; PI: PK Rogan) and by the International Cancer Genome Consortium (ICGC Project #DACO-1056047; &#x201c;Validation of mutations that alter gene expression&#x201d;).</p>
</sec>
<sec id="s7">
<title>Author Contributions</title>
<p>EM designed and performed all q-RT-PCR experiments, processed and analyzed publicly exon microarray data, and performed all formal analysis. BS performed data curation and software development. PR conceptualized the project and was the project administrator. EM and PR prepared the original draft of the manuscript, while EM, BS, and PR reviewed and edited the document.</p>
</sec>
<sec id="s8">
<title>Conflict of Interest</title>
<p>PR founded and BS is an employee of CytoGnomix. The company holds intellectual property related to information theory-based mutation analysis and validation.</p>
<p>EM declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack>
<title>Acknowledgments</title>
<p>PR acknowledges support from the Natural Sciences and Engineering Research Council of Canada (NSERC) [371758-09 and RGPIN-2015-06290], Canadian Foundation for Innovation, Canada Research Chairs, and CytoGnomix. Cell lines were obtained from the NIGMS Human Genetic Cell Repository under a Materials Transfer Agreement with the Coriel Institute (Camden, NJ). An earlier version of this article is available from bioRxiv: <uri xlink:href="https://www.biorxiv.org/content/10.1101/549089v2">https://www.biorxiv.org/content/10.1101/549089v2</uri> (Mucaki and Rogan, 2019).</p>
</ack>
<sec sec-type="supplementary-material" id="s9">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fgene.2020.00109/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fgene.2020.00109/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Image_1.pdf" id="SM1" mimetype="application/pdf"/>
<supplementary-material xlink:href="Table_1.docx" id="SM2" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document"/>
<supplementary-material xlink:href="Table_2.docx" id="SM3" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ars</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Kruyer</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Morell</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Pros</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Serra</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Ravella</surname> <given-names>A.</given-names>
</name>
<etal/>
</person-group>. (<year>2003</year>). <article-title>Recurrent mutations in the NF1 gene are common among neurofibromatosis type 1 patients</article-title>. <source>J. Med. Genet.</source> <volume>40</volume> (<issue>6</issue>), <fpage>e82</fpage>. doi: <pub-id pub-id-type="doi">10.1136/jmg.40.6.e82</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Baralle</surname> <given-names>F. E.</given-names>
</name>
<name>
<surname>Giudice</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Alternative splicing as a regulator of development and tissue identity</article-title>. <source>Nat. Rev. Mol. Cell Biol.</source> <volume>18</volume>, <fpage>437</fpage>&#x2013;<lpage>451</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nrm.2017.27</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Berget</surname> <given-names>S. M.</given-names>
</name>
</person-group> (<year>1995</year>). <article-title>Exon recognition in vertebrate splicing</article-title>. <source>J. Biol. Chem.</source> <volume>270</volume> (<issue>6</issue>), <fpage>2411</fpage>&#x2013;<lpage>2414</lpage>. doi: <pub-id pub-id-type="doi">10.1074/jbc.270.6.2411</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bi</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Bipartite pattern discovery by entropy minimization-based multiple local alignment</article-title>. <source>Nucleic Acids Res.</source> <volume>32</volume> (<issue>17</issue>), <fpage>4979</fpage>&#x2013;<lpage>4991</lpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/gkh825</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Birney</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Kumar</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Krainer</surname> <given-names>A. R.</given-names>
</name>
</person-group> (<year>1993</year>). <article-title>Analysis of the RNA-recognition motif and RS and RGG domains: conservation in metazoan pre-mRNA splicing factors</article-title>. <source>Nucleic Acids Res.</source> <volume>21</volume> (<issue>25</issue>), <fpage>5803</fpage>&#x2013;<lpage>5816</lpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/21.25.5803</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Black</surname> <given-names>D. L.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>Mechanisms of alternative pre-messenger RNA splicing</article-title>. <source>Annu. Rev. Biochem.</source> <volume>72</volume>, <fpage>291</fpage>&#x2013;<lpage>336</lpage>. doi: <pub-id pub-id-type="doi">10.1146/annurev.biochem.72.121801.161720</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Buratti</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Baralle</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Baralle</surname> <given-names>F. E.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Defective splicing, disease and therapy: searching for master checkpoints in exon definition</article-title>. <source>Nucleic Acids Res.</source> <volume>34</volume> (<issue>12</issue>), <fpage>3494</fpage>&#x2013;<lpage>3510</lpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/gkl498</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>C&#xe1;ceres</surname> <given-names>J. F.</given-names>
</name>
<name>
<surname>Stamm</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Helfman</surname> <given-names>D. M.</given-names>
</name>
<name>
<surname>Krainer</surname> <given-names>A. R.</given-names>
</name>
</person-group> (<year>1994</year>). <article-title>Regulation of alternative splicing <italic>in vivo</italic> by overexpression of antagonistic splicing factors</article-title>. <source>Science</source> <volume>265</volume> (<issue>5179</issue>), <fpage>1706</fpage>&#x2013;<lpage>1709</lpage>. doi: <pub-id pub-id-type="doi">10.1126/science.8085156</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Caminsky</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Mucaki</surname> <given-names>E. J.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Interpretation of mRNA splicing mutations in genetic disease: review of the literature and guidelines for information-theoretical analysis</article-title>. <source>F1000Res</source> <volume>3</volume>, <fpage>282</fpage>. doi: <pub-id pub-id-type="doi">10.12688/f1000research.5654.1</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Caminsky</surname> <given-names>N. G.</given-names>
</name>
<name>
<surname>Mucaki</surname> <given-names>E. J.</given-names>
</name>
<name>
<surname>Perri</surname> <given-names>A. M.</given-names>
</name>
<name>
<surname>Lu</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Knoll</surname> <given-names>J. H.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Prioritizing variants in complete hereditary breast and ovarian cancer genes in patients lacking known BRCA mutations</article-title>. <source>Hum. Mutat.</source> <volume>37</volume> (<issue>7</issue>), <fpage>640</fpage>&#x2013;<lpage>652</lpage>. doi: <pub-id pub-id-type="doi">10.1002/humu.22972</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Carothers</surname> <given-names>A. M.</given-names>
</name>
<name>
<surname>Urlaub</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Grunberger</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Chasin</surname> <given-names>L. A.</given-names>
</name>
</person-group> (<year>1993</year>). <article-title>Splicing mutants and their second-site suppressors at the dihydrofolate reductase locus in Chinese hamster ovary cells</article-title>. <source>Mol. Cell Biol.</source> <volume>13</volume> (<issue>8</issue>), <fpage>5085</fpage>&#x2013;<lpage>5098</lpage>. doi: <pub-id pub-id-type="doi">10.1128/MCB.13.8.5085</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cartegni</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Chew</surname> <given-names>S. L.</given-names>
</name>
<name>
<surname>Krainer</surname> <given-names>A. R.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>Listening to silence and understanding nonsense: exonic mutations that affect splicing</article-title>. <source>Nat. Rev. Genet.</source> <volume>3</volume> (<issue>4</issue>), <fpage>285</fpage>&#x2013;<lpage>298</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nrg775</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Caux-Moncoutier</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Pag&#xe8;s-Berhouet</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Michaux</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Asselain</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Cast&#xe9;ra</surname> <given-names>L.</given-names>
</name>
<name>
<surname>De Pauw</surname> <given-names>A.</given-names>
</name>
<etal/>
</person-group>. (<year>2009</year>). <article-title>Impact of BRCA1 and BRCA2 variants on splicing: clues from an allelic imbalance study</article-title>. <source>Eur. J. Hum. Genet.</source> <volume>17</volume> (<issue>11</issue>), <fpage>1471</fpage>&#x2013;<lpage>1480</lpage>. doi: <pub-id pub-id-type="doi">10.1038/ejhg.2009.89</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Cavalli-Sforza</surname> <given-names>L. L.</given-names>
</name>
<name>
<surname>Bodmer</surname> <given-names>W. F.</given-names>
</name>
</person-group> (<year>1971</year>). <source>The Genetics of Human Populations</source> (<publisher-loc>San Francisco</publisher-loc>: <publisher-name>W.H. Freeman and Company</publisher-name>).
</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chiu</surname> <given-names>C. F.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>C. H.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>C. L.</given-names>
</name>
<name>
<surname>Lin</surname> <given-names>C. C.</given-names>
</name>
<name>
<surname>Hsu</surname> <given-names>N. Y.</given-names>
</name>
<name>
<surname>Weng</surname> <given-names>J. R.</given-names>
</name>
<etal/>
</person-group>. (<year>2008</year>). <article-title>A novel single nucleotide polymorphism in XRCC4 gene is associated with gastric cancer susceptibility in Taiwan</article-title>. <source>Ann. Surg. Oncol.</source> <volume>15</volume> (<issue>2</issue>), <fpage>514</fpage>&#x2013;<lpage>518</lpage>. doi: <pub-id pub-id-type="doi">10.1245/s10434-007-9674-3</pub-id>
</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chung</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Park</surname> <given-names>S. M.</given-names>
</name>
<name>
<surname>Chang</surname> <given-names>S. O.</given-names>
</name>
<name>
<surname>Chung</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Lee</surname> <given-names>K. Y.</given-names>
</name>
<name>
<surname>Kim</surname> <given-names>A. R.</given-names>
</name>
<etal/>
</person-group>. (<year>2014</year>). <article-title>A novel mutation of TMPRSS3 related to milder auditory phenotype in Korean postlingual deafness: a possible future implication for a personalized auditory rehabilitation</article-title>. <source>J. Mol. Med. (Berl)</source> <volume>92</volume> (<issue>6</issue>), <fpage>651</fpage>&#x2013;<lpage>663</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s00109-014-1128-3</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Colombo</surname> <given-names>M.</given-names>
</name>
<name>
<surname>De Vecchi</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Caleca</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Foglia</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Ripamonti</surname> <given-names>C. B.</given-names>
</name>
<name>
<surname>Ficarazzi</surname> <given-names>F.</given-names>
</name>
<etal/>
</person-group>. (<year>2013</year>). <article-title>Comparative <italic>in vitro</italic> and in silico analyses of variants in splicing regions of BRCA1 and BRCA2 genes and characterization of novel pathogenic mutations</article-title>. <source>PloS One</source> <volume>8</volume> (<issue>2</issue>), <elocation-id>e57173</elocation-id>. doi: <pub-id pub-id-type="doi">10.1371/journal.pone.0057173</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cooper</surname> <given-names>D. N.</given-names>
</name>
<name>
<surname>Krawczak</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Polychronakos</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Tyler-Smith</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Kehrer-Sawatzki</surname> <given-names>H</given-names>
</name>
</person-group>. (<year>2013</year>). <article-title>Where genotype is not predictive of phenotype: towards an understanding of the molecular basis of reduced penetrance in human inherited disease</article-title>. <source>Hum Genet.</source> <volume>132</volume>(<issue>10</issue>), <fpage>1077</fpage>&#x2013;<lpage>1130</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s00439-013-1331-2</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dietz</surname> <given-names>H. C.</given-names>
</name>
<name>
<surname>Valle</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Francomano</surname> <given-names>C. A.</given-names>
</name>
<name>
<surname>Kendzior</surname> <given-names>R. J.</given-names> <suffix>Jr</suffix>
</name>
<name>
<surname>Pyeritz</surname> <given-names>R. E.</given-names>
</name>
<name>
<surname>Cutting</surname> <given-names>G. R.</given-names>
</name>
</person-group> (<year>1993</year>). <article-title>The skipping of constitutive exons <italic>in vivo</italic> induced by nonsense mutations</article-title>. <source>Science</source> <volume>259</volume> (<issue>5095</issue>), <fpage>680</fpage>&#x2013;<lpage>683</lpage>. doi: <pub-id pub-id-type="doi">10.1126/science.8430317</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ding</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>L. N.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Association between single nucleotide polymorphisms of X-ray repair cross-complementing protein 4 gene and development of pancreatic cancer</article-title>. <source>Genet. Mol. Res.</source> <volume>14</volume> (<issue>3</issue>), <fpage>9626</fpage>&#x2013;<lpage>9632</lpage>. doi: <pub-id pub-id-type="doi">10.4238/2015.August.14.25</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Domenjoud</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Kister</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Gallinaro</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Jacob</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>1993</year>). <article-title>Selection between a natural and a cryptic 5' splice site: a kinetic study of the effect of upstream exon sequences</article-title>. <source>Gene Expr.</source> <volume>3</volume> (<issue>1</issue>), <fpage>83</fpage>&#x2013;<lpage>94</lpage>.
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dorman</surname> <given-names>S. N.</given-names>
</name>
<name>
<surname>Viner</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Splicing mutation analysis reveals previously unrecognized pathways in lymph node-invasive breast cancer</article-title>. <source>Sci. Rep.</source> <volume>4</volume>, <fpage>7063</fpage>. doi: <pub-id pub-id-type="doi">10.1038/srep07063</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="other">
<person-group person-group-type="author">
<name>
<surname>Eliseos</surname> <given-names>E. J.</given-names>
</name>
<name>
<surname>Shirley</surname> <given-names>B. S.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Expression changes confirm predicted single nucleotide variants affecting mRNA splicing</article-title>. <source>bioRxiv</source>, <elocation-id>549089</elocation-id>. doi: <pub-id pub-id-type="doi">10.1101/549089</pub-id>
</citation>
</ref>
<ref id="B24">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Fisher</surname> <given-names>R. A.</given-names>
</name>
</person-group> (<year>1930</year>). <source>The genetical theory of natural selection</source> (<publisher-loc>Oxford New York</publisher-loc>: <publisher-name>Clarendon Press</publisher-name>).
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fu</surname> <given-names>X. D.</given-names>
</name>
<name>
<surname>Maniatis</surname> <given-names>T.</given-names>
</name>
</person-group> (<year>1992</year>). <article-title>The 35-kDa mammalian splicing factor SC35 mediates specific interactions between U1 and U2 small nuclear ribonucleoprotein particles at the 3' splice site</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>89</volume> (<issue>5</issue>), <fpage>1725</fpage>&#x2013;<lpage>1729</lpage>. doi: <pub-id pub-id-type="doi">10.1073/pnas.89.5.1725</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ge</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Concannon</surname> <given-names>P.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Molecular-genetic characterization of common, noncoding UBASH3A variants associated with type 1 diabetes</article-title>. <source>Eur. J. Hum. Genet.</source> <volume>26</volume> (<issue>7</issue>), <fpage>1060</fpage>&#x2013;<lpage>1064</lpage>. doi: <pub-id pub-id-type="doi">10.1038/s41431-018-0123-5</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ge</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Gurd</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Gaudin</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Dore</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Lepage</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Harmsen</surname> <given-names>E.</given-names>
</name>
<etal/>
</person-group>. (<year>2005</year>). <article-title>Survey of allelic expression using EST mining</article-title>. <source>Genome Res.</source> <volume>15</volume> (<issue>11</issue>), <fpage>1584</fpage>&#x2013;<lpage>1591</lpage>. doi: <pub-id pub-id-type="doi">10.1101/gr.4023805</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Giardine</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Riemer</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Hardison</surname> <given-names>R. C.</given-names>
</name>
<name>
<surname>Burhans</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Elnitski</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Shah</surname> <given-names>P.</given-names>
</name>
<etal/>
</person-group>. (<year>2005</year>). <article-title>Galaxy: a platform for interactive large-scale genome analysis</article-title>. <source>Genome Res.</source> <volume>15</volume> (<issue>10</issue>), <fpage>1451</fpage>&#x2013;<lpage>1455</lpage>. doi: <pub-id pub-id-type="doi">10.1101/gr.4086505</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gloss</surname> <given-names>B. S.</given-names>
</name>
<name>
<surname>Dinger</surname> <given-names>M. E.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Realizing the significance of noncoding functionality in clinical genomics</article-title>. <source>Exp. Mol. Med.</source> <volume>50</volume> (<issue>8</issue>), <fpage>97</fpage>. doi: <pub-id pub-id-type="doi">10.1038/s12276-018-0087-0</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Haralambieva</surname> <given-names>I. H.</given-names>
</name>
<name>
<surname>Ovsyannikova</surname> <given-names>I. G.</given-names>
</name>
<name>
<surname>Kennedy</surname> <given-names>R. B.</given-names>
</name>
<name>
<surname>Larrabee</surname> <given-names>B. R.</given-names>
</name>
<name>
<surname>Zimmermann</surname> <given-names>M. T.</given-names>
</name>
<name>
<surname>Grill</surname> <given-names>D. E.</given-names>
</name>
<etal/>
</person-group>. (<year>2017</year>). <article-title>Genome-wide associations of CD46 and IFI44L genetic variants with neutralizing antibody response to measles vaccine</article-title>. <source>Hum. Genet.</source> <volume>136</volume> (<issue>4</issue>), <fpage>421</fpage>&#x2013;<lpage>435</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s00439-017-1768-9</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hiller</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Huse</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Szafranski</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Jahn</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Hampe</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Schreiber</surname> <given-names>S.</given-names>
</name>
<etal/>
</person-group>. (<year>2006</year>). <article-title>Single-nucleotide polymorphisms in NAGNAG acceptors are highly predictive for variations of alternative splicing</article-title>. <source>Am. J. Hum. Genet.</source> <volume>78</volume> (<issue>2</issue>), <fpage>291</fpage>&#x2013;<lpage>302</lpage>. doi: <pub-id pub-id-type="doi">10.1086/500151</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hobson</surname> <given-names>G. M.</given-names>
</name>
<name>
<surname>Huang</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Sperle</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Sistermans</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
<name>
<surname>Garbern</surname> <given-names>J. Y.</given-names>
</name>
<etal/>
</person-group>. (<year>2006</year>). <article-title>Splice-site contribution in alternative splicing of PLP1 and DM20: molecular studies in oligodendrocytes</article-title>. <source>Hum. Mutat.</source> <volume>27</volume> (<issue>1</issue>), <fpage>69</fpage>&#x2013;<lpage>77</lpage>. doi: <pub-id pub-id-type="doi">10.1002/humu.20276</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huang</surname> <given-names>R. S.</given-names>
</name>
<name>
<surname>Duan</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Bleibel</surname> <given-names>W. K.</given-names>
</name>
<name>
<surname>Kistner</surname> <given-names>E. O.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Clark</surname> <given-names>T. A.</given-names>
</name>
<etal/>
</person-group>. (<year>2007</year>). <article-title>A genome-wide approach to identify genetic variants that contribute to etoposide-induced cytotoxicity</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>104</volume> (<issue>23</issue>), <fpage>9758</fpage>&#x2013;<lpage>9763</lpage>. doi: <pub-id pub-id-type="doi">10.1073/pnas.0703736104</pub-id>
</citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hull</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Campino</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Rowlands</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Chan</surname> <given-names>M. S.</given-names>
</name>
<name>
<surname>Copley</surname> <given-names>R. R.</given-names>
</name>
<name>
<surname>Taylor</surname> <given-names>M. S.</given-names>
</name>
<etal/>
</person-group>. (<year>2007</year>). <article-title>Identification of common genetic variation that modulates alternative splicing</article-title>. <source>PloS Genet.</source> <volume>3</volume> (<issue>6</issue>), <fpage>e99</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pgen.0030099</pub-id>
</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Janos&#xed;kov&#xe1;</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Zavad&#xe1;kov&#xe1;</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Kozich</surname> <given-names>V.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Single-nucleotide polymorphisms in genes relating to homocysteine metabolism: how applicable are public SNP databases to a typical European population</article-title>? <source>Eur. J. Hum. Genet.</source> <volume>13</volume> (<issue>1</issue>), <fpage>86</fpage>&#x2013;<lpage>95</lpage>. doi: <pub-id pub-id-type="doi">10.1038/sj.ejhg.5201282</pub-id>
</citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kannabiran</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
<name>
<surname>Olmos</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Basti</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Rao</surname> <given-names>G. N.</given-names>
</name>
<name>
<surname>Kaiser-Kupfer</surname> <given-names>M.</given-names>
</name>
<etal/>
</person-group>. (<year>1998</year>). <article-title>Autosomal dominant zonular cataract with sutural opacities is associated with a splice mutation in the betaA3/A1-crystallin gene</article-title>. <source>Mol. Vis.</source> <volume>4</volume>, <fpage>21</fpage>.
</citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kent</surname> <given-names>W. J.</given-names>
</name>
<name>
<surname>Sugnet</surname> <given-names>C. W.</given-names>
</name>
<name>
<surname>Furey</surname> <given-names>T. S.</given-names>
</name>
<name>
<surname>Roskin</surname> <given-names>K. M.</given-names>
</name>
<name>
<surname>Pringle</surname> <given-names>T. H.</given-names>
</name>
<name>
<surname>Zahler</surname> <given-names>A. M.</given-names>
</name>
<etal/>
</person-group>. (<year>2002</year>). <article-title>The human genome browser at UCSC</article-title>. <source>Genome Res.</source> <volume>12</volume>(<issue>6</issue>), <fpage>996</fpage>&#x2013;<lpage>1006</lpage>. doi: <pub-id pub-id-type="doi">10.1101/gr.229102</pub-id>
</citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Khan</surname> <given-names>S. G.</given-names>
</name>
<name>
<surname>Metin</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Gozukara</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Inui</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Shahlavi</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Muniz-Medina</surname> <given-names>V.</given-names>
</name>
<etal/>
</person-group>. (<year>2004</year>). <article-title>Two essential splice lariat branchpoint sequences in one intron in a xeroderma pigmentosum DNA repair gene: mutations result in reduced XPC mRNA levels that correlate with cancer risk</article-title>. <source>Hum. Mol. Genet.</source> <volume>13</volume> (<issue>3</issue>), <fpage>343</fpage>&#x2013;<lpage>352</lpage>. doi: <pub-id pub-id-type="doi">10.1093/hmg/ddh026</pub-id>
</citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kim</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Bang</surname> <given-names>S. Y.</given-names>
</name>
<name>
<surname>Lee</surname> <given-names>H. S.</given-names>
</name>
<name>
<surname>Cho</surname> <given-names>S. K.</given-names>
</name>
<name>
<surname>Choi</surname> <given-names>C. B.</given-names>
</name>
<name>
<surname>Sung</surname> <given-names>Y. K.</given-names>
</name>
<etal/>
</person-group>. (<year>2015</year>). <article-title>High-density genotyping of immune loci in Koreans and Europeans identifies eight new rheumatoid arthritis risk loci</article-title>. <source>Ann. Rheumatol. Dis.</source> <volume>74</volume> (<issue>3</issue>), <fpage>e13</fpage>. doi: <pub-id pub-id-type="doi">10.1136/annrheumdis-2013-204749</pub-id>
</citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Krawczak</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Reiss</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Cooper</surname> <given-names>D. N.</given-names>
</name>
</person-group> (<year>1992</year>). <article-title>The mutational spectrum of single base-pair substitutions in mRNA splice junctions of human genes: causes and consequences</article-title>. <source>Hum. Genet.</source> <volume>90</volume> (<issue>1-2</issue>), <fpage>41</fpage>&#x2013;<lpage>54</lpage>. doi: <pub-id pub-id-type="doi">10.1007/BF00210743</pub-id>
</citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>L&#xf3;pez-Bigas</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Audit</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Ouzounis</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Parra</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Guig&#xf3;</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Are splicing mutations the most frequent cause of hereditary disease</article-title>? <source>FEBS Lett.</source> <volume>579</volume> (<issue>9</issue>), <fpage>1900</fpage>&#x2013;<lpage>1903</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.febslet.2005.02.047</pub-id>
</citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lacroix</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Lacaze-Buzy</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Furio</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Tron</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Valari</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Van der Wier</surname> <given-names>G.</given-names>
</name>
<etal/>
</person-group>. (<year>2012</year>). <article-title>Clinical expression and new SPINK5 splicing defects in Netherton syndrome: unmasking a frequent founder synonymous mutation and unconventional intronic mutations</article-title>. <source>J. Invest. Dermatol.</source> <volume>132</volume> (<issue>3 Pt 1</issue>), <fpage>575</fpage>&#x2013;<lpage>582</lpage>. doi: <pub-id pub-id-type="doi">10.1038/jid.2011.366</pub-id>
</citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lamba</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Lamba</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Yasuda</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Strom</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Davila</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Hancock</surname> <given-names>M. L.</given-names>
</name>
<etal/>
</person-group>. (<year>2003</year>). <article-title>Hepatic CYP2B6 expression: gender and ethnic differences and relationship to CYP2B6 genotype and CAR (constitutive androstane receptor) expression</article-title>. <source>J. Pharmacol. Exp. Ther.</source> <volume>307</volume> (<issue>3</issue>), <fpage>906</fpage>&#x2013;<lpage>922</lpage>. doi: <pub-id pub-id-type="doi">10.1124/jpet.103.054866</pub-id>
</citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Baek</surname> <given-names>J. I.</given-names>
</name>
<name>
<surname>Choi</surname> <given-names>J. Y.</given-names>
</name>
<name>
<surname>Kim</surname> <given-names>U. K.</given-names>
</name>
<name>
<surname>Lee</surname> <given-names>S. H.</given-names>
</name>
<name>
<surname>Lee</surname> <given-names>K. Y.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Genetic analysis of TMPRSS3 gene in the Korean population with autosomal recessive nonsyndromic hearing loss</article-title>. <source>Gene</source> <volume>532</volume> (<issue>2</issue>), <fpage>276</fpage>&#x2013;<lpage>280</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.gene.2013.07.108</pub-id>
</citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Kato</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Shiue</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Shively</surname> <given-names>J. E.</given-names>
</name>
<name>
<surname>Ares</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Lin</surname> <given-names>R. J.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Cell type and culture condition-dependent alternative splicing in human breast cancer cells revealed by splicing-sensitive microarrays</article-title>. <source>Cancer Res.</source> <volume>66</volume> (<issue>4</issue>), <fpage>1990</fpage>&#x2013;<lpage>1999</lpage>. doi: <pub-id pub-id-type="doi">10.1158/0008-5472.CAN-05-2593</pub-id>
</citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>C. C.</given-names>
</name>
</person-group> (<year>1967</year>). <article-title>Genetic equilibrium under selection</article-title>. <source>Biometrics</source> <volume>23</volume>, <fpage>397</fpage>&#x2013;<lpage>484</lpage>. doi: <pub-id pub-id-type="doi">10.2307/2528009</pub-id>
</citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Cui</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Cao</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>L.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Evaluation of the association of UBASH3A and SYNGR1 with rheumatoid arthritis and disease activity and severity in Han Chinese</article-title>. <source>Oncotarget</source> <volume>8</volume> (<issue>61</issue>), <fpage>103385</fpage>&#x2013;<lpage>103392</lpage>. doi: <pub-id pub-id-type="doi">10.18632/oncotarget.21875</pub-id>
</citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lu</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Transcription factor binding site clusters identify target genes with similar tissue-wide expression and buffer against mutations [version 2; peer review: 2 approved]</article-title>. <source>F1000Research</source> <volume>7</volume>, <fpage>1933</fpage>. doi: <pub-id pub-id-type="doi">10.12688/f1000research.17363.2</pub-id>
</citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lu</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Mucaki</surname> <given-names>E. J.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Discovery and validation of information theory-based transcription factor and cofactor binding site motifs</article-title>. <source>Nucleic Acids Res.</source> <volume>45</volume> (<issue>5</issue>), <fpage>e27</fpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/gkw1036</pub-id>
</citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Margulis</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Lin</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Wood</surname> <given-names>C. G.</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>X.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Genetic susceptibility to renal cell carcinoma: the role of DNA double-strand break repair pathway</article-title>. <source>Cancer Epidemiol. Biomarkers Prev.</source> <volume>17</volume> (<issue>9</issue>), <fpage>2366</fpage>&#x2013;<lpage>2373</lpage>. doi: <pub-id pub-id-type="doi">10.1158/1055-9965.EPI-08-0259</pub-id>
</citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Martinez-Contreras</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Cloutier</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Shkreta</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Fisette</surname> <given-names>J. F.</given-names>
</name>
<name>
<surname>Revil</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Chabot</surname> <given-names>B.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>hnRNP proteins and splicing control</article-title>. <source>Adv. Exp. Med. Biol.</source> <volume>623</volume>, <fpage>123</fpage>&#x2013;<lpage>147</lpage>. doi: <pub-id pub-id-type="doi">10.1007/978-0-387-77374-2_8</pub-id>
</citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mayeda</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Krainer</surname> <given-names>A. R.</given-names>
</name>
</person-group> (<year>1992</year>). <article-title>Regulation of alternative pre-mRNA splicing by hnRNP A1 and splicing factor SF2</article-title>. <source>Cell</source> <volume>68</volume> (<issue>2</issue>), <fpage>365</fpage>&#x2013;<lpage>375</lpage>. doi: <pub-id pub-id-type="doi">10.1016/0092-8674(92)90477-T</pub-id>
</citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mok</surname> <given-names>K. Y.</given-names>
</name>
<name>
<surname>Jones</surname> <given-names>E. L.</given-names>
</name>
<name>
<surname>Hanney</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Harold</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Sims</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Williams</surname> <given-names>J.</given-names>
</name>
<etal/>
</person-group>. (<year>2014</year>). <article-title>Polymorphisms in BACE2 may affect the age of onset Alzheimer's dementia in Down syndrome</article-title>. <source>Neurobiol. Aging</source> <volume>35</volume> (<issue>6</issue>), <page-range>1513.e1&#x2013;5</page-range>. doi: <pub-id pub-id-type="doi">10.1016/j.neurobiolaging.2013.12.022</pub-id>
</citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Moore</surname> <given-names>M. J.</given-names>
</name>
<name>
<surname>Sharp</surname> <given-names>P. A.</given-names>
</name>
</person-group> (<year>1993</year>). <article-title>Evidence for two active sites in the spliceosome provided by stereochemistry of pre-mRNA splicing</article-title>. <source>Nature</source> <volume>365</volume> (<issue>6444</issue>), <fpage>364</fpage>&#x2013;<lpage>368</lpage>. doi: <pub-id pub-id-type="doi">10.1038/365364a0</pub-id>
</citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mucaki</surname> <given-names>E. J.</given-names>
</name>
<name>
<surname>Shirley</surname> <given-names>B. C.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Prediction of mutant mRNA splice isoforms by information theory-based exon definition</article-title>. <source>Hum. Mutat.</source> <volume>34</volume> (<issue>4</issue>), <fpage>557</fpage>&#x2013;<lpage>565</lpage>. doi: <pub-id pub-id-type="doi">10.1002/humu.22277</pub-id>
</citation>
</ref>
<ref id="B56">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mucaki</surname> <given-names>E. J.</given-names>
</name>
<name>
<surname>Caminsky</surname> <given-names>N. G.</given-names>
</name>
<name>
<surname>Perri</surname> <given-names>A. M.</given-names>
</name>
<name>
<surname>Lu</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Laederach</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Halvorsen</surname> <given-names>M.</given-names>
</name>
<etal/>
</person-group>. (<year>2016</year>). <article-title>A unified analytic framework for prioritization of non-coding variants of uncertain significance in heritable breast and ovarian cancer</article-title>. <source>BMC Med. Genomics</source> <volume>9</volume>, <fpage>19</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s12920-016-0178-5</pub-id>
</citation>
</ref>
<ref id="B57">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nalla</surname> <given-names>V. K.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Automated splicing mutation analysis by information theory</article-title>. <source>Hum. Mutat.</source> <volume>25</volume> (<issue>4</issue>), <fpage>334</fpage>&#x2013;<lpage>342</lpage>. doi: <pub-id pub-id-type="doi">10.1002/humu.20151</pub-id>
</citation>
</ref>
<ref id="B58">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nelson</surname> <given-names>K. K.</given-names>
</name>
<name>
<surname>Green</surname> <given-names>M. R.</given-names>
</name>
</person-group> (<year>1990</year>). <article-title>Mechanism for cryptic splice site activation during pre-mRNA splicing</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>87</volume> (<issue>16</issue>), <fpage>6253</fpage>&#x2013;<lpage>6257</lpage>. doi: <pub-id pub-id-type="doi">10.1073/pnas.87.16.6253</pub-id>
</citation>
</ref>
<ref id="B59">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nembaware</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Lupindo</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Schouest</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Spillane</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Scheffler</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Seoighe</surname> <given-names>C.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Genome-wide survey of allele-specific splicing in humans</article-title>. <source>BMC Genomics</source> <volume>9</volume>, <fpage>265</fpage>. doi: <pub-id pub-id-type="doi">10.1186/1471-2164-9-265</pub-id>
</citation>
</ref>
<ref id="B60">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nuzhdin</surname> <given-names>S. V.</given-names>
</name>
<name>
<surname>Wayne</surname> <given-names>M. L.</given-names>
</name>
<name>
<surname>Harmon</surname> <given-names>K. L.</given-names>
</name>
<name>
<surname>McIntyre</surname> <given-names>L. M.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Common pattern of evolution of gene expression level and protein sequence in Drosophila</article-title>. <source>Mol. Biol. Evol.</source> <volume>21</volume> (<issue>7</issue>), <fpage>1308</fpage>&#x2013;<lpage>1317</lpage>. doi: <pub-id pub-id-type="doi">10.1093/molbev/msh128</pub-id>
</citation>
</ref>
<ref id="B61">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Olsen</surname> <given-names>R. K.</given-names>
</name> <name>
<surname>Br&#xf8;ner</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Sabaratnam</surname> <given-names>R.</given-names>
</name> <name>
<surname>Doktor</surname> <given-names>T. K.</given-names>
</name> <name>
<surname>Andersen</surname> <given-names>H. S.</given-names>
</name>
<name>
<surname>Bruun</surname> <given-names>G. H.</given-names>
</name>
<etal/>
</person-group>. (<year>2014</year>). <article-title>The ETFDH c.158A&gt;G variation disrupts the balanced interplay of ESE- and ESS-binding proteins thereby causing missplicing and multiple Acyl-CoA dehydrogenation deficiency</article-title>. <source>Hum. Mutat.</source> <volume>35</volume> (<issue>1</issue>), <fpage>86</fpage>&#x2013;<lpage>95</lpage>. doi: <pub-id pub-id-type="doi">10.1002/humu.22455 </pub-id>
</citation>
</ref> <ref id="B62">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pan</surname> <given-names>Q.</given-names>
</name>
<name>
<surname>Shai</surname> <given-names>O.</given-names>
</name>
<name>
<surname>Lee</surname> <given-names>L. J.</given-names>
</name>
<name>
<surname>Frey</surname> <given-names>B. J.</given-names>
</name>
<name>
<surname>Blencowe</surname> <given-names>B. J.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing</article-title>. <source>Nat. Genet.</source> <volume>40</volume>, <fpage>1413</fpage>&#x2013;<lpage>1415</lpage>. doi: <pub-id pub-id-type="doi">10.1038/ng.259</pub-id>
</citation>
</ref>
<ref id="B63">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Park</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Pan</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Lin</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Xing</surname> <given-names>Y.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>The expanding landscape of alternative splicing variation in human populations</article-title>. <source>Am. J. Hum. Genet.</source> <volume>102</volume> (<issue>1</issue>), <fpage>11</fpage>&#x2013;<lpage>26</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.ajhg.2017.11.002</pub-id>
</citation>
</ref>
<ref id="B64">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Parker</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Siliciano</surname> <given-names>P. G.</given-names>
</name>
<name>
<surname>Guthrie</surname> <given-names>C.</given-names>
</name>
</person-group> (<year>1987</year>). <article-title>Recognition of the TACTAAC box during mRNA splicing in yeast involves base pairing to the U2-like snRNA</article-title>. <source>Cell</source> <volume>49</volume> (<issue>2</issue>), <fpage>229</fpage>&#x2013;<lpage>239</lpage>. doi: <pub-id pub-id-type="doi">10.1016/0092-8674(87)90564-2</pub-id>
</citation>
</ref>
<ref id="B65">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peterlongo</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Catucci</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Colombo</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Caleca</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Mucaki</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Bogliolo</surname> <given-names>M.</given-names>
</name>
<etal/>
</person-group>. (<year>2015</year>). <article-title>FANCM c.5791C &gt; T nonsense mutation (rs144567652) induces exon skipping, affects DNA repair activity and is a familial breast cancer risk factor</article-title>. <source>Hum. Mol. Genet.</source> <volume>24</volume> (<issue>18</issue>), <fpage>5345</fpage>&#x2013;<lpage>5355</lpage>. doi: <pub-id pub-id-type="doi">10.1093/hmg/ddv251</pub-id>
</citation>
</ref>
<ref id="B66">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pfaffl</surname> <given-names>M. W.</given-names>
</name>
<name>
<surname>Horgan</surname> <given-names>G. W.</given-names>
</name>
<name>
<surname>Dempfle</surname> <given-names>L.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>Relative expression software tool (REST) for groupwise comparison and statistical analysis of relative expression results in real-time PCR</article-title>. <source>Nucleic Acids Res.</source> <volume>30</volume> (<issue>9</issue>), <fpage>e36</fpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/30.9.e36</pub-id>
</citation>
</ref>
<ref id="B67">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Richard</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Beckmann</surname> <given-names>J. S.</given-names>
</name>
</person-group> (<year>1995</year>). <article-title>How neutral are synonymous codon mutations</article-title>? <source>Nat. Genet.</source> <volume>10</volume> (<issue>3</issue>), <fpage>259</fpage>. doi: <pub-id pub-id-type="doi">10.1038/ng0795-259</pub-id>
</citation>
</ref>
<ref id="B68">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Robberson</surname> <given-names>B. L.</given-names>
</name>
<name>
<surname>Cote</surname> <given-names>G. J.</given-names>
</name>
<name>
<surname>Berget</surname> <given-names>S. M.</given-names>
</name>
</person-group> (<year>1990</year>). <article-title>Exon definition may facilitate splice site selection in RNAs with multiple exons</article-title>. <source>Mol. Cell Biol.</source> <volume>10</volume> (<issue>1</issue>), <fpage>84</fpage>&#x2013;<lpage>94</lpage>. doi: <pub-id pub-id-type="doi">10.1128/MCB.10.1.84</pub-id>
</citation>
</ref>
<ref id="B69">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Robinson</surname> <given-names>J. T.</given-names>
</name>
<name>
<surname>Thorvaldsd&#xf3;ttir</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Wenger</surname> <given-names>A. M.</given-names>
</name>
<name>
<surname>Zehir</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Mesirov</surname> <given-names>J. P.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Variant Review with the Integrative Genomics Viewer</article-title>. <source>Cancer Res.</source> <volume>77</volume>(<issue>21</issue>), <fpage>e31</fpage>&#x2013;<lpage>e34</lpage>. doi: <pub-id pub-id-type="doi">10.1158/0008-5472.CAN-17-0337</pub-id>
</citation>
</ref>
<ref id="B70">
<citation citation-type="other">
<person-group person-group-type="author">
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
<name>
<surname>Mucaki</surname> <given-names>E. J.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Population fitness and genetic load of single nucleotide polymorphisms affecting mRNA splicing</article-title>. arXiv:1107.0716 [q-bio.GN].
</citation>
</ref>
<ref id="B71">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
<name>
<surname>Schneider</surname> <given-names>T. D.</given-names>
</name>
</person-group> (<year>1995</year>). <article-title>Using information content and base frequencies to distinguish mutations from genetic polymorphisms in splice junction recognition sites</article-title>. <source>Hum. Mutat.</source> <volume>6</volume> (<issue>1</issue>), <fpage>74</fpage>&#x2013;<lpage>76</lpage>. doi: <pub-id pub-id-type="doi">10.1002/humu.1380060114</pub-id>
</citation>
</ref>
<ref id="B72">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
<name>
<surname>Faux</surname> <given-names>B. M.</given-names>
</name>
<name>
<surname>Schneider</surname> <given-names>T. D.</given-names>
</name>
</person-group> (<year>1998</year>). <article-title>Information analysis of human splice site mutations</article-title>. <source>Hum. Mutat.</source> <volume>12</volume>, <fpage>153</fpage>&#x2013;<lpage>171</lpage>. doi: <pub-id pub-id-type="doi">10.1002/(SICI)1098-1004(1998)12:3&lt;153::AID-HUMU3&gt;3.0.CO;2-I</pub-id>
</citation>
</ref>
<ref id="B73">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
<name>
<surname>Svojanovsky</surname> <given-names>S. R.</given-names>
</name>
<name>
<surname>Leeder</surname> <given-names>J. S.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>Information theory-based analysis of CYP219, CYP2D6 and CYP3A5 splicing mutations</article-title>. <source>Pharmacogenetics</source> <volume>13</volume> (<issue>4</issue>), <fpage>207</fpage>&#x2013;<lpage>218</lpage>. doi: <pub-id pub-id-type="doi">10.1097/00008571-200304000-00005</pub-id>
</citation>
</ref>
<ref id="B74">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>S&#xe9;raphin</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Kretzner</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Rosbash</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>1988</year>). <article-title>A U1 snRNA:pre-mRNA base pairing interaction is required early in yeast spliceosome assembly but does not uniquely define the 5' cleavage site</article-title>. <source>EMBO J.</source> <volume>7</volume> (<issue>8</issue>), <fpage>2533</fpage>&#x2013;<lpage>2538</lpage>. doi: <pub-id pub-id-type="doi">10.1002/j.1460-2075.1988.tb03101.x</pub-id>
</citation>
</ref>
<ref id="B75">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schneider</surname> <given-names>T. D.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>Sequence walkers: a graphical method to display how binding proteins interact with DNA or RNA sequences</article-title>. <source>Nucleic Acids Res.</source> <volume>25</volume>, <fpage>4408</fpage>&#x2013;<lpage>4415</lpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/25.21.4408</pub-id>
</citation>
</ref>
<ref id="B76">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shirley</surname> <given-names>B. C.</given-names>
</name>
<name>
<surname>Mucaki</surname> <given-names>E. J.</given-names>
</name>
<name>
<surname>Whitehead</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Costea</surname> <given-names>P. I.</given-names>
</name>
<name>
<surname>Akan</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Interpretation, stratification and evidence for sequence variants affecting mRNA splicing in complete human genome sequences</article-title>. <source>Genomics Proteomics Bioinf.</source> <volume>11</volume> (<issue>2</issue>), <fpage>77</fpage>&#x2013;<lpage>85</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.gpb.2013.01.008</pub-id>
</citation>
</ref>
<ref id="B77">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shirley</surname> <given-names>B. C.</given-names>
</name>
<name>
<surname>Mucaki</surname> <given-names>E. J.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Pan-cancer repository of validated natural and cryptic mRNA splicing mutations [version 3; peer review: 1 approved, 2 approved with reservations]</article-title>. <source>F1000Research</source> <volume>7</volume>, <fpage>1908</fpage>. doi: <pub-id pub-id-type="doi">10.12688/f1000research.17204.3</pub-id>
</citation>
</ref>
<ref id="B78">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Soemedi</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Cygan</surname> <given-names>K. J.</given-names>
</name>
<name>
<surname>Rhine</surname> <given-names>C. L.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Bulacan</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>J.</given-names>
</name>
<etal/>
</person-group>. (<year>2017</year>). <article-title>Pathogenic variants that alter protein code often disrupt splicing</article-title>. <source>Nat. Genet.</source> <volume>49</volume> (<issue>6</issue>), <fpage>848</fpage>&#x2013;<lpage>855</lpage>. doi: <pub-id pub-id-type="doi">10.1038/ng.3837</pub-id>
</citation>
</ref>
<ref id="B79">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Spielmann</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Mundlos</surname> <given-names>S.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Looking beyond the genes: the role of non-coding variants in human disease</article-title>. <source>Hum. Mol. Genet.</source> <volume>25</volume> (<issue>R2</issue>), <fpage>R157</fpage>&#x2013;<lpage>R165</lpage>. doi: <pub-id pub-id-type="doi">10.1093/hmg/ddw205</pub-id>
</citation>
</ref>
<ref id="B80">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Staknis</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Reed</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>1994</year>). <article-title>SR proteins promote the first specific recognition of Pre-mRNA and are present together with the U1 small nuclear ribonucleoprotein particle in a general splicing enhancer complex</article-title>. <source>Mol. Cell Biol.</source> <volume>14</volume> (<issue>11</issue>), <fpage>7670</fpage>&#x2013;<lpage>7682</lpage>. doi: <pub-id pub-id-type="doi">10.1128/MCB.14.11.7670</pub-id>
</citation>
</ref>
<ref id="B81">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Suizu</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Hiramuki</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Okumura</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Matsuda</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Okumura</surname> <given-names>A. J.</given-names>
</name>
<name>
<surname>Hirata</surname> <given-names>N.</given-names>
</name>
<etal/>
</person-group>. (<year>2009</year>). <article-title>The E3 ligase TTC3 facilitates ubiquitination and degradation of phosphorylated Akt</article-title>. <source>Dev. Cell.</source> <volume>17</volume> (<issue>6</issue>), <fpage>800</fpage>&#x2013;<lpage>810</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.devcel.2009.09.007</pub-id>
</citation>
</ref>
<ref id="B82">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Susani</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Pangrazio</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Sobacchi</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Taranta</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Mortier</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Savarirayan</surname> <given-names>R.</given-names>
</name>
<etal/>
</person-group>. (<year>2004</year>). <article-title>TCIRG1-dependent recessive osteopetrosis: mutation analysis, functional identification of the splicing defects, and <italic>in vitro</italic> rescue by U1 snRNA</article-title>. <source>Hum. Mutat.</source> <volume>24</volume> (<issue>3</issue>), <fpage>225</fpage>&#x2013;<lpage>235</lpage>. doi: <pub-id pub-id-type="doi">10.1002/humu.20076</pub-id>
</citation>
</ref>
<ref id="B83">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Svojanovsky</surname> <given-names>S. R.</given-names>
</name>
<name>
<surname>Schneider</surname> <given-names>T. D.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>Redundant designations of BRCA1 intron 11 splicing mutation; c. 4216-2A &gt; G; IVS11-2A &gt; G; L78833, 37698, A &gt; G</article-title>. <source>Hum. Mutat.</source> <volume>16</volume> (<issue>3</issue>), <fpage>264</fpage>. doi: <pub-id pub-id-type="doi">10.1002/1098-1004(200009)16:3&lt;264::AID-HUMU9&gt;3.0.CO;2-1</pub-id>
</citation>
</ref>
<ref id="B84">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Szafranski</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Fritsch</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Schumann</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Siebel</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Sinha</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Hampe</surname> <given-names>J.</given-names>
</name>
<etal/>
</person-group>. (<year>2014</year>). <article-title>Physiological state co-regulates thousands of mammalian mRNA splicing events at tandem splice sites and alternative exons</article-title>. <source>Nucleic Acids Res.</source> <volume>42</volume> (<issue>14</issue>), <fpage>8895</fpage>&#x2013;<lpage>8904</lpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/gku532</pub-id>
</citation>
</ref>
<ref id="B85">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Talerico</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Berget</surname> <given-names>S. M.</given-names>
</name>
</person-group> (<year>1990</year>). <article-title>Effect of 5' splice site mutations on splicing of the preceding intron</article-title>. <source>Mol. Cell Biol.</source> <volume>10</volume> (<issue>12</issue>), <fpage>6299</fpage>&#x2013;<lpage>6305</lpage>. doi: <pub-id pub-id-type="doi">10.1128/MCB.10.12.6299</pub-id>
</citation>
</ref>
<ref id="B86">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Teraoka</surname> <given-names>S. N.</given-names>
</name>
<name>
<surname>Telatar</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Becker-Catania</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Liang</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Oneng&#xfc;t</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Tolun</surname> <given-names>A.</given-names>
</name>
<etal/>
</person-group>. (<year>1999</year>). <article-title>Splicing defects in the ataxia-telangiectasia gene, ATM: underlying mutations and consequences</article-title>. <source>Am. J. Hum. Genet.</source> <volume>64</volume> (<issue>6</issue>), <fpage>1617</fpage>&#x2013;<lpage>1631</lpage>. doi: <pub-id pub-id-type="doi">10.1086/302418</pub-id>
</citation>
</ref>
<ref id="B87">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Thorsen</surname> <given-names>K.</given-names>
</name>
<name>
<surname>S&#xf8;rensen</surname> <given-names>K. D.</given-names>
</name>
<name>
<surname>Brems-Eskildsen</surname> <given-names>A. S.</given-names>
</name>
<name>
<surname>Modin</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Gaustadnes</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Hein</surname> <given-names>A. M.</given-names>
</name>
<etal/>
</person-group>. (<year>2008</year>). <article-title>Alternative splicing in colon, bladder, and prostate cancer identified by exon array analysis</article-title>. <source>Mol. Cell Proteomics</source> <volume>7</volume> (<issue>7</issue>), <fpage>1214</fpage>&#x2013;<lpage>1224</lpage>. doi: <pub-id pub-id-type="doi">10.1074/mcp.M700590-MCP200</pub-id>
</citation>
</ref>
<ref id="B88">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tsukahara</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Hattori</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Muraki</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Sakaki</surname> <given-names>Y.</given-names>
</name>
</person-group> (<year>1996</year>). <article-title>Identification and cloning of a novel cDNA belonging to tetratricopeptide repeat gene family from Down syndrome-critical region 21q22.2</article-title>. <source>J. Biochem.</source> <volume>120</volume> (<issue>4</issue>), <fpage>820</fpage>&#x2013;<lpage>827</lpage>. doi: <pub-id pub-id-type="doi">10.1093/oxfordjournals.jbchem.a021485</pub-id>
</citation>
</ref>
<ref id="B89">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ueffing</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Singh</surname> <given-names>K. K.</given-names>
</name>
<name>
<surname>Christians</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Thorns</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Feller</surname> <given-names>A. C.</given-names>
</name>
<name>
<surname>Nagl</surname> <given-names>F.</given-names>
</name>
<etal/>
</person-group>. (<year>2009</year>). <article-title>A single nucleotide polymorphism determines protein isoform production of the human c-FLIP protein</article-title>. <source>Blood</source> <volume>114</volume> (<issue>3</issue>), <fpage>572</fpage>&#x2013;<lpage>579</lpage>. doi: <pub-id pub-id-type="doi">10.1182/blood-2009-02-204230</pub-id>
</citation>
</ref>
<ref id="B90">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vemula</surname> <given-names>S. R.</given-names>
</name>
<name>
<surname>Xiao</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Zhao</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Bastian</surname> <given-names>R. W.</given-names>
</name>
<name>
<surname>Perlmutter</surname> <given-names>J. S.</given-names>
</name>
<name>
<surname>Racette</surname> <given-names>B. A.</given-names>
</name>
<etal/>
</person-group>. (<year>2014</year>). <article-title>A rare sequence variant in intron 1 of THAP1 is associated with primary dystonia</article-title>. <source>Mol. Genet. Genom. Med.</source> <volume>2</volume> (<issue>3</issue>), <fpage>261</fpage>&#x2013;<lpage>272</lpage>. doi: <pub-id pub-id-type="doi">10.1002/mgg3.67</pub-id>
</citation>
</ref>
<ref id="B91">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Viner</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Dorman</surname> <given-names>S. N.</given-names>
</name>
<name>
<surname>Shirley</surname> <given-names>B. C.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Validation of predicted mRNA splicing mutations using high-throughput transcriptome data [version 2; referees: 4 approved]</article-title>. <source>F1000Res</source> <volume>3</volume>, <fpage>8</fpage>. doi: <pub-id pub-id-type="doi">10.12688/f1000research.3-8.v2</pub-id>
</citation>
</ref>
<ref id="B92">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vockley</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
<name>
<surname>Anderson</surname> <given-names>B. D.</given-names>
</name>
<name>
<surname>Willard</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Seelan</surname> <given-names>R. S.</given-names>
</name>
<name>
<surname>Smith</surname> <given-names>D. I.</given-names>
</name>
<etal/>
</person-group>. (<year>2000</year>). <article-title>Exon skipping in IVD RNA processing in isovaleric acidemia caused by point mutations in the coding region of the IVD gene</article-title>. <source>Am. J. Hum. Genet.</source> <volume>66</volume> (<issue>2</issue>), <fpage>356</fpage>&#x2013;<lpage>367</lpage>. doi: <pub-id pub-id-type="doi">10.1086/302751</pub-id>
</citation>
</ref>
<ref id="B93">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>von Kodolitsch</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Pyeritz</surname> <given-names>R. E.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Splice-site mutations in atherosclerosis candidate genes: relating individual information to phenotype</article-title>. <source>Circulation</source> <volume>100</volume> (<issue>7</issue>), <fpage>693</fpage>&#x2013;<lpage>699</lpage>. doi: <pub-id pub-id-type="doi">10.1161/01.CIR.100.7.693</pub-id>
</citation>
</ref>
<ref id="B94">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>von Kodolitsch</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Berger</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Predicting severity of haemophilia A and B splicing mutations by information analysis</article-title>. <source>Haemophilia</source> <volume>12</volume> (<issue>3</issue>), <fpage>258</fpage>&#x2013;<lpage>262</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1365-2516.2006.01216.x</pub-id>
</citation>
</ref>
<ref id="B95">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vyhlidal</surname> <given-names>C. A.</given-names>
</name>
<name>
<surname>Rogan</surname> <given-names>P. K.</given-names>
</name>
<name>
<surname>Leeder</surname> <given-names>J. S.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Development and refinement of pregnane X receptor (PXR) DNA binding site model using information theory: insights into PXR-mediated gene regulation</article-title>. <source>J. Biol. Chem.</source> <volume>279</volume> (<issue>45</issue>), <fpage>46779</fpage>&#x2013;<lpage>46786</lpage>. doi: <pub-id pub-id-type="doi">10.1074/jbc.M408395200</pub-id>
</citation>
</ref>
<ref id="B96">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname> <given-names>E. T.</given-names>
</name>
<name>
<surname>Sandberg</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Luo</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Khrebtukova</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Mayr</surname> <given-names>C.</given-names>
</name>
<etal/>
</person-group>. (<year>2008</year>). <article-title>Alternative isoform regulation in human tissue transcriptomes</article-title>. <source>Nature</source> <volume>456</volume> (<issue>7221</issue>), <fpage>470</fpage>&#x2013;<lpage>476</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nature07509</pub-id>
</citation>
</ref>
<ref id="B97">
<citation citation-type="other">
<person-group person-group-type="author">
<name>
<surname>Workman</surname> <given-names>R. E.</given-names>
</name>
<name>
<surname>Tang</surname> <given-names>A. D.</given-names>
</name>
<name>
<surname>Tang</surname> <given-names>P. S.</given-names>
</name>
<name>
<surname>Jain</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Tyson</surname> <given-names>J. R.</given-names>
</name>
<name>
<surname>Razaghi</surname> <given-names>R.</given-names>
</name>
<etal/>
</person-group>. (<year>2019</year>). <article-title>Nanopore native RNA sequencing of a human poly(A) transcriptome</article-title>. <source>Nat. Methods</source>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41592-019-0617-2</pub-id>
</citation>
</ref>
<ref id="B98">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Manley</surname> <given-names>J. L.</given-names>
</name>
</person-group> (<year>1989</year>). <article-title>Mammalian pre-mRNA branch site selection by U2 snRNP involves base pairing</article-title>. <source>Genes Dev.</source> <volume>3</volume> (<issue>10</issue>), <fpage>1553</fpage>&#x2013;<lpage>1561</lpage>. doi: <pub-id pub-id-type="doi">10.1101/gad.3.10.1553</pub-id>
</citation>
</ref>
<ref id="B99">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Gu</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Grossman</surname> <given-names>H. B.</given-names>
</name>
<name>
<surname>Amos</surname> <given-names>C. I.</given-names>
</name>
<name>
<surname>Etzel</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Huang</surname> <given-names>M.</given-names>
</name>
<etal/>
</person-group>. (<year>2006</year>). <article-title>Bladder cancer predisposition: a multigenic approach to DNA-repair and cell-cycle-control genes</article-title>. <source>Am. J. Hum. Genet.</source> <volume>78</volume> (<issue>3</issue>), <fpage>464</fpage>&#x2013;<lpage>479</lpage>. doi: <pub-id pub-id-type="doi">10.1086/500848</pub-id>
</citation>
</ref>
<ref id="B100">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yates</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Okoniewski</surname> <given-names>M. J.</given-names>
</name>
<name>
<surname>Miller</surname> <given-names>C. J.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>X:Map: annotation and visualization of genome structure for Affymetrix exon array analysis</article-title>. <source>Nucleic Acids Res.</source> <volume>36</volume> (<issue>Database issue</issue>), <fpage>D780</fpage>&#x2013;<lpage>D786</lpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/gkm779</pub-id>
</citation>
</ref>
<ref id="B101">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Lu</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Park</surname> <given-names>J. W.</given-names>
</name>
<name>
<surname>Zhou</surname> <given-names>Q.</given-names>
</name>
<name>
<surname>Xing</surname> <given-names>Y.</given-names>
</name>
</person-group> (<year>2013</year>a). <article-title>GLiMMPS: robust statistical model for regulatory variation of alternative splicing using RNA-seq data</article-title>. <source>Genome Biol.</source> <volume>14</volume> (<issue>7</issue>), <fpage>R74</fpage>. doi: <pub-id pub-id-type="doi">10.1186/gb-2013-14-7-r74</pub-id>
</citation>
</ref>
<ref id="B102">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Zou</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Zhao</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Yan</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Kang</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Jiang</surname> <given-names>T.</given-names>
</name>
<etal/>
</person-group>. (<year>2013</year>b). <article-title>Genetic polymorphisms of DNA double-strand break repair pathway genes and glioma susceptibility</article-title>. <source>BMC Cancer</source> <volume>13</volume>, <fpage>234</fpage>. doi: <pub-id pub-id-type="doi">10.1186/1471-2407-13-234</pub-id>
</citation>
</ref>
<ref id="B103">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhuang</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Weiner</surname> <given-names>A. M.</given-names>
</name>
</person-group> (<year>1986</year>). <article-title>A compensatory base change in U1 snRNA suppresses a 5' splice site mutation</article-title>. <source>Cell</source> <volume>46</volume> (<issue>6</issue>), <fpage>827</fpage>&#x2013;<lpage>835</lpage>. doi: <pub-id pub-id-type="doi">10.1016/0092-8674(86)90064-4</pub-id>
</citation>
</ref>
<ref id="B104">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zuo</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Maniatis</surname> <given-names>T.</given-names>
</name>
</person-group> (<year>1996</year>). <article-title>The splicing factor U2AF35 mediates critical protein-protein interactions in constitutive and enhancer-dependent splicing</article-title>. <source>Genes Dev.</source> <volume>10</volume> (<issue>11</issue>), <fpage>1356</fpage>&#x2013;<lpage>1368</lpage>. doi: <pub-id pub-id-type="doi">10.1101/gad.10.11.1356</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>