<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Bioeng. Biotechnol.</journal-id>
<journal-title>Frontiers in Bioengineering and Biotechnology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Bioeng. Biotechnol.</abbrev-journal-title>
<issn pub-type="epub">2296-4185</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fbioe.2020.00268</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Bioengineering and Biotechnology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Identification of Pan-Cancer Prognostic Biomarkers Through Integration of Multi-Omics Data</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Zhao</surname> <given-names>Ning</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/581315/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Guo</surname> <given-names>Maozu</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Wang</surname> <given-names>Kuanquan</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/643170/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Zhang</surname> <given-names>Chunlong</given-names></name>
<xref ref-type="aff" rid="aff5"><sup>5</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/879583/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Liu</surname> <given-names>Xiaoyan</given-names></name>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/936833/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>School of Life Sciences and Technology, Harbin Institute of Technology</institution>, <addr-line>Harbin</addr-line>, <country>China</country></aff>
<aff id="aff2"><sup>2</sup><institution>School of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture</institution>, <addr-line>Beijing</addr-line>, <country>China</country></aff>
<aff id="aff3"><sup>3</sup><institution>Beijing Key Laboratory of Intelligent Processing for Building Big Data, Beijing University of Civil Engineering and Architecture</institution>, <addr-line>Beijing</addr-line>, <country>China</country></aff>
<aff id="aff4"><sup>4</sup><institution>School of Computer Science and Technology, Harbin Institute of Technology</institution>, <addr-line>Harbin</addr-line>, <country>China</country></aff>
<aff id="aff5"><sup>5</sup><institution>College of Bioinformatics Science and Technology, Harbin Medical University</institution>, <addr-line>Harbin</addr-line>, <country>China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Meng Zhou, Wenzhou Medical University, China</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Richard R. Rodrigues, Oregon State University, United States; Alfred Grant Schissler, University of Nevada, Reno, United States</p></fn>
<corresp id="c001">&#x002A;Correspondence: Maozu Guo, <email>guomaozu@bucea.edu.cn</email></corresp>
<fn fn-type="other" id="fn004"><p>This article was submitted to Bioinformatics and Computational Biology, a section of the journal Frontiers in Bioengineering and Biotechnology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>02</day>
<month>04</month>
<year>2020</year>
</pub-date>
<pub-date pub-type="collection">
<year>2020</year>
</pub-date>
<volume>8</volume>
<elocation-id>268</elocation-id>
<history>
<date date-type="received">
<day>02</day>
<month>01</month>
<year>2020</year>
</date>
<date date-type="accepted">
<day>13</day>
<month>03</month>
<year>2020</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2020 Zhao, Guo, Wang, Zhang and Liu.</copyright-statement>
<copyright-year>2020</copyright-year>
<copyright-holder>Zhao, Guo, Wang, Zhang and Liu</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Prognostic biomarkers dedicating to treat cancer are very difficult to identify. Although high-throughput sequencing technology allows us to mine prognostic biomarkers much deeper by analyzing omics data, there is lack of effective methods to comprehensively utilize multi-omics data. In this work, we integrated multi-omics data [DNA methylation (DM), gene expression (GE), somatic copy number alternation, and microRNA expression (ME)] and proposed a method to rank genes by desiring a &#x201C;Score.&#x201D; Applying the method, cancer-specific prognostic biomarkers for 13 cancers were obtained. The prognostic powers of the biomarkers were further assessed by C-indexes (ranged from 0.76 to 0.96). Moreover, by comparing the 13 survival-related gene lists, seven genes (<italic>SLK</italic>, <italic>API5</italic>, <italic>BTBD2</italic>, <italic>PTAR1</italic>, <italic>VPS37A</italic>, <italic>EIF2B1</italic>, and <italic>ZRANB1</italic>) were found to be associated with prognosis in a variety of cancers. In particular, <italic>SLK</italic> was more likely to be cancer-related due to its high missense mutation rate and associated with cell adhesion. Furthermore, after network analysis, <italic>EPRS</italic>, <italic>HNRNPA2B1</italic>, <italic>BPTF</italic>, <italic>LRRK1</italic>, and <italic>PUM1</italic> were demonstrated to have a broad correlation with cancers. In summary, our method has a better integration of multi-omics data that can be extended to the researches of other diseases. And the prognostic biomarkers had a better prognostic power than previous methods. Our results could provide a reference for translational medicine researchers and clinicians.</p>
</abstract>
<kwd-group>
<kwd>multi-omics</kwd>
<kwd>pan-cancer</kwd>
<kwd>survival</kwd>
<kwd>biomarker</kwd>
<kwd>prognosis</kwd>
</kwd-group>
<contract-num rid="cn001">61532014</contract-num>
<contract-num rid="cn001">61671189</contract-num>
<contract-sponsor id="cn001">National Natural Science Foundation of China<named-content content-type="fundref-id">10.13039/501100001809</named-content></contract-sponsor>
<counts>
<fig-count count="11"/>
<table-count count="3"/>
<equation-count count="7"/>
<ref-count count="52"/>
<page-count count="15"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1">
<title>Introduction</title>
<p>Cancer is a major public health problem worldwide (<xref ref-type="bibr" rid="B38">Siegel et al., 2020</xref>) and the occurrence of cancer is caused by many factors. It is not only controlled by genetics and epigenetics, but also influenced by many other regulatory factors, such as miRNAs. A variety of regulatory factors contribute to the heterogeneity of cancer (<xref ref-type="bibr" rid="B30">Marusyk et al., 2012</xref>; <xref ref-type="bibr" rid="B40">Swanton, 2012</xref>; <xref ref-type="bibr" rid="B8">Burrell et al., 2013</xref>), which leads to a low cure rate and poor prognosis. Survival prediction provided a crucial evidence for the process of cancer diagnosis and treatment. Prognostic biomarkers are used to predict likelihood of recurrence or progression in patients with cancer (<xref ref-type="bibr" rid="B9">Cagney et al., 2018</xref>). However, it is still hard to identify the prognostic biomarkers of cancer accurately.</p>
<p>Omics data play a key role in predicting prognostic biomarkers. At present, many researchers have identified prognostic biomarkers based on differential analysis of DNA methylation (DM) or other omics data, involving gene expression (GE), somatic copy-number alteration (SCNA) and microRNA expression (ME). <xref ref-type="bibr" rid="B17">Dalerba et al. (2016)</xref> found that <italic>CDX2</italic> was a prognostic biomarker in stage II and stage III colon cancer by analyzing GE data. <xref ref-type="bibr" rid="B51">Zhao et al. (2017)</xref> identified eight differentially methylated CpGs as new prognostic biomarkers for prostate cancer by analyzing DM data. <xref ref-type="bibr" rid="B34">Morikawa et al. (2018)</xref> discovered that SCNAs in 8p11.21-22, 12p13.31, 20q13.2, 3q26.1, 4q13.2, and 22q11.23 were critical for the development and survival of ovarian clear cell carcinoma. <xref ref-type="bibr" rid="B28">Lindahl et al. (2018)</xref> developed a prognostic 3-miRNA classifier (miR-106b-5p, miR-148a-3p, and miR-338-3p) in early-stage mycosis fungoides. The advantage of omics data for identifying cancer-related prognostic biomarkers can be clearly seen in the studies mentioned above. However, each of these studies used only one type of omics data, which did not make full use of omics data.</p>
<p>The regulation of GE is a complex process. Generally, the DNA hypermethylation in promoter region of genes could cause transcriptional silencing (<xref ref-type="bibr" rid="B5">Baylin, 2005</xref>) and DNA hypomethylation was associated with the activation of GE (<xref ref-type="bibr" rid="B6">Berdasco and Esteller, 2010</xref>). Besides, the copy number correlated positively with expression levels for genes (<xref ref-type="bibr" rid="B18">Fehrmann et al., 2015</xref>). Moreover, miRNAs complementary bound to messenger RNAs (mRNAs) and formed RNA-induced silencing complex (RISC) to downregulate GE levels (<xref ref-type="bibr" rid="B4">Bartel, 2004</xref>). The researches of cancer focusing on one-dimensional omics data may only provide limited information for the etiology of oncogenesis and tumor progression. In the past few years, more and more researches applied multi-omics data. <xref ref-type="bibr" rid="B47">Xu et al. (2019)</xref> proposed a method, named high-order path elucidated similarity (HOPES), to identify cancer subtypes by simultaneous interrogation multi-omics data. They utilized their method on GE, DM, and ME data of five TCGA cancers to identify subtypes and further validated reliability and clinical role of them. <xref ref-type="bibr" rid="B45">Vasaikar et al. (2018)</xref> developed a powerful database, named LinkedOmics, for analysis of omics data in cancer. LinkedOmics contained multi-omics data of 32 cancer types and allowed for flexible exploration and comparison of associations between multiple types of attributes within and across tumor types. The positive results of these researches confirmed the feasibility of integrating multi-omics data. Both of these work used multi-omics data for cancer research. However, they did not focus on prognostic markers, so we cannot further compare them numerically with our method.</p>
<p>Similarly, integrating omics data indicated the potential benefits for discovering underlying prognostic markers in cancer (<xref ref-type="bibr" rid="B23">Huang et al., 2017</xref>). Using multi-omics data acquiring from the same set of samples has the potential capacity to expose more accurate biomarkers for patients&#x2019; survival than examining by one single-omics data (<xref ref-type="bibr" rid="B36">Rappoport and Shamir, 2018</xref>). <xref ref-type="bibr" rid="B49">Yuan et al. (2014)</xref> used somatic copy-number alteration (SCNA), DM, GE, ME, and protein expression data to predict survival status of patients. They found that incorporating molecular data with clinical variables improved the accuracy of survival prediction for cancers. This work provided a starting point and resources for the subsequent researches. <xref ref-type="bibr" rid="B50">Zhang et al. (2016)</xref> utilized GE, SCNA, ME, and DM data to uncover protein&#x2013;protein subnetworks associated with prognosis. This work built a multi-dimensional subnetwork atlas for cancer prognosis to investigate the potential impact of multiple genetics and epigenetics better. <xref ref-type="bibr" rid="B14">Chaudhary et al. (2018)</xref> presented a deep learning based model on liver hepatocellular carcinoma (LIHC) that robustly differentiates survival subpopulations of patients using GE, DM, and ME data. They validated this multi-omics model on five external datasets of various omics types and all have good performance. <xref ref-type="bibr" rid="B52">Zhu et al. (2017)</xref> presented a kernel machine learning method to systematically quantify the prognostic values of clinical information, GE, SCNA, DM, and ME across 14 cancer types. This study aimed to compare the advantages and disadvantages of using different omics data to evaluate patients&#x2019; survival. Based on their result, GE and ME data were demonstrated to be the best data for the prognosis of cancers. <xref ref-type="bibr" rid="B33">Mishra et al. (2019)</xref> used DM, GE, ME, and long non-coding RNA (lncRNA) expression data to identify potential prognostic markers of pancreatic ductal adenocarcinoma. They identified several genes, miRNA, lncRNA, and CpG sites as probable prognostic biomarkers. All methods mentioned above used multi-omics data to perform prediction of patients&#x2019; survival. However, most of them did not integrate multi-omics data comprehensively but only utilized multi-omics data to explore mechanism of cancer separately. Moreover most of them they did not provide specifically prognostic biomarkers for other clinical researches or just aimed at limited kinds of cancers.</p>
<p>The Cancer Genome Atlas (TCGA) provides multiple omics data for different cancers (<xref ref-type="bibr" rid="B10">Cancer Genome Atlas Research Network, 2011</xref>, <xref ref-type="bibr" rid="B11">2012</xref>, <xref ref-type="bibr" rid="B12">2013</xref>; <xref ref-type="bibr" rid="B13">Cancer Genome Atlas Research Network et al., 2016</xref>), which allows for analyzing multi-omics data coming from the same samples. So far, there already exist a variety of methods for predicting patients&#x2019; survival status using TCGA omics data.</p>
<p>In this work, we put forward our own method to identify prognostic biomarkers and identified prognostic gene lists for 13 types of cancers. This work provided theoretical foundation and reliably prognostic biomarkers for other researches focusing on diagnosis, prognosis, and treatment of cancers.</p>
</sec>
<sec id="S2" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec id="S2.SS1">
<title>Data</title>
<p>Multi-omics data were downloaded from TCGA. The scale and platform of each cancer data are shown in <xref ref-type="table" rid="T1">Table 1</xref>. We selected the cancers which had HM450K DM data, RNA-seq data (GE), miRNA-seq data (ME), and SNP 6.0 copy number data (SCNA) simultaneously and whose sample size was greater than 200. Samples with sample type codes of &#x201C;01&#x201D; were retained, which represented &#x201C;Primary Solid Tumor.&#x201D; After being filtered, there were 13 types of cancers available. For SCNA data, a matrix was obtained after being processed by Gistic 2.0 (<xref ref-type="bibr" rid="B32">Mermel et al., 2011</xref>). Next, all omics data matrixes except ME were converted into gene matrixes based on the annotation information from TCGA. Genes with missing values in &#x003E; 5% of the samples were removed in each matrix. Moreover, for GE and ME data, we retained the genes or miRNAs with values greater than 0 in &#x003E; 50% of the samples and with values greater than 1 in &#x003E; 10% of the samples, respectively. After converting if one gene had multiple signals in one sample, we calculated the average of the values as the final signal. For ME, miRNAs were specifically bound to mRNAs by complementary base pairing, therefore the corresponding relationships between miRNAs and genes were obtained through the miRNA&#x2013;mRNA interactions which were downloaded from the Starbase database (<xref ref-type="bibr" rid="B48">Yang et al., 2011</xref>). Interactions with no less than five supporting experiments and anti-correlation in no less than one cancer type were selected. Since multiple miRNAs were bound to the same gene, the average value of the miRNAs was assigned to the gene.</p>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>The sample size of 13 types of cancers.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Cancer</td>
<td valign="top" align="center">Clinical</td>
<td valign="top" align="center">DM (450K)</td>
<td valign="top" align="center">SCNA (nocnv)</td>
<td valign="top" align="center">GE (FPKM-UQ)</td>
<td valign="top" align="center">ME (isoform)</td>
<td valign="top" align="center">Total size</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Bladder urothelial carcinoma [BLCA]</td>
<td valign="top" align="center">291</td>
<td valign="top" align="center">412</td>
<td valign="top" align="center">410</td>
<td valign="top" align="center">408</td>
<td valign="top" align="center">409</td>
<td valign="top" align="center">283</td>
</tr>
<tr>
<td valign="top" align="left">Breast invasive carcinoma [BRCA]</td>
<td valign="top" align="center">735</td>
<td valign="top" align="center">783</td>
<td valign="top" align="center">1092</td>
<td valign="top" align="center">1091</td>
<td valign="top" align="center">1078</td>
<td valign="top" align="center">497</td>
</tr>
<tr>
<td valign="top" align="left">Cervical squamous cell carcinoma and endocervical adenocarcinoma [CESC]</td>
<td valign="top" align="center">177</td>
<td valign="top" align="center">307</td>
<td valign="top" align="center">295</td>
<td valign="top" align="center">304</td>
<td valign="top" align="center">307</td>
<td valign="top" align="center">166</td>
</tr>
<tr>
<td valign="top" align="left">Colon adenocarcinoma [COAD]</td>
<td valign="top" align="center">186</td>
<td valign="top" align="center">296</td>
<td valign="top" align="center">452</td>
<td valign="top" align="center">456</td>
<td valign="top" align="center">444</td>
<td valign="top" align="center">164</td>
</tr>
<tr>
<td valign="top" align="left">Head and neck squamous cell carcinoma [HNSC]</td>
<td valign="top" align="center">416</td>
<td valign="top" align="center">528</td>
<td valign="top" align="center">522</td>
<td valign="top" align="center">500</td>
<td valign="top" align="center">523</td>
<td valign="top" align="center">380</td>
</tr>
<tr>
<td valign="top" align="left">Kidney renal clear cell carcinoma [KIRC]</td>
<td valign="top" align="center">452</td>
<td valign="top" align="center">319</td>
<td valign="top" align="center">532</td>
<td valign="top" align="center">530</td>
<td valign="top" align="center">516</td>
<td valign="top" align="center">251</td>
</tr>
<tr>
<td valign="top" align="left">Kidney renal papillary cell carcinoma [KIRP]</td>
<td valign="top" align="center">183</td>
<td valign="top" align="center">275</td>
<td valign="top" align="center">289</td>
<td valign="top" align="center">288</td>
<td valign="top" align="center">291</td>
<td valign="top" align="center">172</td>
</tr>
<tr>
<td valign="top" align="left">Brain lower grade glioma [LGG]</td>
<td valign="top" align="center">347</td>
<td valign="top" align="center">516</td>
<td valign="top" align="center">515</td>
<td valign="top" align="center">511</td>
<td valign="top" align="center">512</td>
<td valign="top" align="center">340</td>
</tr>
<tr>
<td valign="top" align="left">Liver hepatocellular carcinoma [LIHC]</td>
<td valign="top" align="center">259</td>
<td valign="top" align="center">377</td>
<td valign="top" align="center">375</td>
<td valign="top" align="center">371</td>
<td valign="top" align="center">372</td>
<td valign="top" align="center">250</td>
</tr>
<tr>
<td valign="top" align="left">Lung adenocarcinoma [LUAD]</td>
<td valign="top" align="center">292</td>
<td valign="top" align="center">460</td>
<td valign="top" align="center">520</td>
<td valign="top" align="center">515</td>
<td valign="top" align="center">515</td>
<td valign="top" align="center">229</td>
</tr>
<tr>
<td valign="top" align="left">Lung squamous cell carcinoma [LUSC]</td>
<td valign="top" align="center">299</td>
<td valign="top" align="center">370</td>
<td valign="top" align="center">503</td>
<td valign="top" align="center">501</td>
<td valign="top" align="center">478</td>
<td valign="top" align="center">190</td>
</tr>
<tr>
<td valign="top" align="left">Sarcoma [SARC]</td>
<td valign="top" align="center">204</td>
<td valign="top" align="center">261</td>
<td valign="top" align="center">260</td>
<td valign="top" align="center">259</td>
<td valign="top" align="center">259</td>
<td valign="top" align="center">200</td>
</tr>
<tr>
<td valign="top" align="left">Stomach adenocarcinoma [STAD]</td>
<td valign="top" align="center">201</td>
<td valign="top" align="center">395</td>
<td valign="top" align="center">441</td>
<td valign="top" align="center">375</td>
<td valign="top" align="center">436</td>
<td valign="top" align="center">157</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<attrib><italic>The sample size of clinical, DNA methylation (DM), somatic copy-number alteration (SCNA), gene expression (GE), and microRNA expression(ME) of 13 types of cancers. The platforms for each data were written in the parentheses below. Abbreviations for cancer names were written in square brackets. The total size of each cancer was the number of samples which have all the five types of data simultaneously.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<p>Because of different scales for the omics data, the data were normalized based on the following rules. First, each omics data were organized into a matrix of the same genes and samples, separately. Second, the method z-score was used to transform a matrix into standardized one with the mean and standard deviation of 0 and 1, respectively. Finally, we uniformly kept the fourth decimal place for better integration of the standardized data.</p>
</sec>
<sec id="S2.SS2">
<title>Screening of Candidate Survival-Related Genes</title>
<p>Univariate Cox proportional hazards regression model (<xref ref-type="bibr" rid="B16">Cox, 1986</xref>) was used to identify candidate survival-related genes from each omics data through the formula:</p>
<disp-formula id="S2.E1">
<label>(1)</label>
<mml:math id="M1">
<mml:mrow>
<mml:mrow>
<mml:mtext>h</mml:mtext>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mtext>X</mml:mtext>
<mml:mo>,</mml:mo>
<mml:mtext>t</mml:mtext>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mtext>h</mml:mtext>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mtext>t</mml:mtext>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>&#x2062;</mml:mo>
<mml:mtext>exp</mml:mtext>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="normal">&#x03B2;</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mtext>X</mml:mtext>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where the explanatory variable X was the omics data (DM, GE, copy number variation, or miRNA expression) of a gene, and the response variable t was the survival time (<xref ref-type="bibr" rid="B1">Aalen, 1989</xref>). The proportional hazards regression model was calculated through the R package &#x201C;survival.&#x201D; &#x03B2; greater than zero meant the gene was a risk factor base on the corresponding omics data. Then using voting strategy, if a gene had a <italic>p</italic>-value of likelihood ratio test less than 0.05 (<xref ref-type="bibr" rid="B49">Yuan et al., 2014</xref>), the gene was denoted as &#x201C;1&#x201D;. Otherwise, it was denoted as &#x201C;0.&#x201D; Finally, a gene defined as a candidate survival-related gene should be marked as &#x201C;1&#x201D; in no less than two of the four omics data types (<xref ref-type="fig" rid="F1">Figure 1A</xref>).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>The workflow of survival-related genes identification. <bold>(A)</bold> Candidate survival-related gene screening. DNA methylation, gene expression, somatic copy-number alteration, and microRNA (miRNA) expression profiles of TCGA for the same samples were extracted. miRNA expression data were corresponding to genes according to miRNA&#x2013;mRNA interactions. Then, we got four types of data in the same samples and the same genes. On each omics data, univariate Cox proportional hazards model was utilized to identify survival-related genes. Only the genes associated with survival in more than two types of data were considered to be candidate genes. <bold>(B)</bold> Prognostic biomarker identifying. For the selected candidate genes, the multivariate Cox proportional hazards model was then applied to get risk scores (RS). Further, scores for ranking genes were obtained by calculating GS scores. In which, A, B, C, and D were binary variables indicating whether the gene was survival-related at the four omics data or not (&#x201C;1&#x201D; for related and &#x201C;0&#x201D; for not), respectively. The high ranked genes were identified survival-related.</p></caption>
<graphic xlink:href="fbioe-08-00268-g001.tif"/>
</fig>
</sec>
<sec id="S2.SS3">
<title>Identification of Prognostic Biomarkers</title>
<p>As shown in <xref ref-type="fig" rid="F1">Figure 1B</xref>, prognostic biomarkers were further identified in the set of candidate survival-related genes. For each gene, a matrix M = [<italic>O</italic><italic>m</italic><italic>i</italic><italic>c</italic><italic>s</italic><sub><italic>G</italic><italic>E</italic></sub>,<italic>O</italic><italic>m</italic><italic>i</italic><italic>c</italic><italic>s</italic><sub><italic>S</italic><italic>C</italic><italic>N</italic><italic>A</italic></sub>,<italic>O</italic><italic>m</italic><italic>i</italic><italic>c</italic><italic>s</italic><sub><italic>D</italic><italic>M</italic></sub>,<italic>O</italic><italic>m</italic><italic>i</italic><italic>c</italic><italic>s</italic><sub><italic>M</italic><italic>E</italic></sub>] merged by the vectors of the four omics data of the gene was obtained. Then, the multivariate Cox proportional hazards model was applied on it. Briefly, the model assumed that a patient with covariate values has a cumulative hazard rate related to an unspecified baseline hazard rate seen in the equation:</p>
<disp-formula id="S2.Ex1">
<label>(2)</label>
<mml:math id="M2">
<mml:mrow>
<mml:mtext>h</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mtext>t</mml:mtext>
<mml:mo>,</mml:mo>
<mml:mtext>M</mml:mtext>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mtext>t</mml:mtext>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mtext>exp</mml:mtext>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi mathvariant="normal">&#x03B2;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mi>O</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>c</mml:mi>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mrow>
<mml:mi>G</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>E</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi mathvariant="normal">&#x03B2;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mi>O</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>c</mml:mi>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>C</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>N</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>A</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi mathvariant="normal">&#x03B2;</mml:mi>
<mml:mn>3</mml:mn>
</mml:msub>
<mml:mi>O</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>c</mml:mi>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mrow>
<mml:mi>D</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi mathvariant="normal">&#x03B2;</mml:mi>
<mml:mn>4</mml:mn>
</mml:msub>
<mml:mi>O</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>c</mml:mi>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>E</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where h(t, M) was the patient&#x2019;s hazard of death at time t, <italic>h</italic><sub>0</sub>(<italic>t</italic>) was the baseline hazard rate, and <italic>B</italic> = [&#x03B2;<sub>1</sub>,&#x03B2;<sub>2</sub>,&#x03B2;<sub>3</sub>,&#x03B2;<sub>4</sub>] was a regression coefficient that gives the effect of each M covariate on the hazard rate (<xref ref-type="bibr" rid="B3">Alamartine et al., 1991</xref>). Each &#x03B2; could be interpreted as a risk coefficient (<xref ref-type="bibr" rid="B15">Collett, 2015</xref>). If the <italic>p</italic>-values of Cox fitting in all three overall tests (likelihood, Wald, and log-rank) were less than 0.05, the model was thought to be significant (<xref ref-type="bibr" rid="B37">Rodriguez-Martin et al., 2020</xref>). Therefore, we only kept genes whose all three <italic>p</italic>-values were less than 0.05.</p>
<p>For the retained genes, each gene had a vector including the value of four types of omics data in each sample <italic>V</italic> = [<italic>v</italic><sub>1</sub>,<italic>v</italic><sub>2</sub>,<italic>v</italic><sub>3</sub>,<italic>v</italic><sub>4</sub>]. The risk score (RS) for the gene in each sample was then calculated:</p>
<disp-formula id="S2.E3">
<label>(3)</label>
<mml:math id="M4">
<mml:mrow>
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mi>B</mml:mi>
<mml:mo>&#x22C5;</mml:mo>
<mml:mi>V</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>The RS score could be used to predict the patients&#x2019; risk.</p>
<p>Thereafter, RS scores of the genes were used to calculate each gene&#x2019;s score (GS):</p>
<disp-formula id="S2.E4">
<label>(4)</label>
<mml:math id="M5">
<mml:mrow>
<mml:mrow>
<mml:mi>G</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mo largeop="true" symmetric="true">&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>m</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mrow>
<mml:mi>m</mml:mi>
</mml:mfrac>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <italic>m</italic> was the number of samples. At last, the scores of univariate and multivariate Cox proportional hazards model were combined to calculate the survival-related score of each gene (Score):</p>
<disp-formula id="S2.E5">
<label>(5)</label>
<mml:math id="M6">
<mml:mrow>
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>c</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>e</mml:mi>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi>B</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi>C</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi>D</mml:mi>
<mml:mo>+</mml:mo>
<mml:mrow>
<mml:mi>G</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <italic>A</italic>, <italic>B</italic>, <italic>C</italic>, and <italic>D</italic> represented whether the gene was survival-related at the GE level, copy number level, DM level, and miRNA level, respectively (&#x201C;1&#x201D; meant related and &#x201C;0&#x201D; meant not). The higher the score, the more relevant between the gene and patients&#x2019; survival. Therefore, high score genes were identified as prognostic biomarkers.</p>
</sec>
<sec id="S2.SS4">
<title>Functional Analysis</title>
<p>Cumulative hypergeometric inspection was applied to enrichment analysis of Gene Ontology (GO) functions (<xref ref-type="bibr" rid="B20">Gene Ontology, 2015</xref>) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways (<xref ref-type="bibr" rid="B24">Kanehisa and Goto, 2000</xref>):</p>
<disp-formula id="S2.E6">
<label>(6)</label>
<mml:math id="M7">
<mml:mrow>
<mml:mtext>P</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>-</mml:mo>
<mml:mrow>
<mml:munderover>
<mml:mo largeop="true" movablelimits="false" symmetric="true">&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mo>-</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:munderover>
<mml:mfrac>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mtable rowspacing="0pt">
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mi>M</mml:mi>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mi>i</mml:mi>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mtable rowspacing="0pt">
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:mo>-</mml:mo>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>-</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mtable rowspacing="0pt">
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mi>N</mml:mi>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mi>n</mml:mi>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <italic>N</italic> was the whole number of genes. <italic>M</italic> was the number of genes on a term or a pathway. <italic>n</italic> was the intersection of interested gene set and <italic>N</italic>. <italic>i</italic> was the intersection of <italic>M</italic> and <italic>n</italic>. Significant threshold of hypergeometric test was set to <italic>P</italic> &#x003C; 0.05 (<xref ref-type="bibr" rid="B29">Liu et al., 2018</xref>). For the enrichment analysis of GO, we only investigated the biological process (BP) terms.</p>
</sec>
<sec id="S2.SS5">
<title>Different Expression</title>
<p>In order to identify differentially expressed genes, the corresponding normal samples of LUSC and KIRC were downloaded from TCGA. There were 49 LUSC normal samples and 72 KIRC normal samples. After organized into the same gene set, the differentially expressed genes between tumor and normal samples were identified using the R package &#x201C;samr&#x201D; with the threshold value <italic>q</italic>-value &#x003C; 0.05 and |<italic>l</italic><italic>o</italic><italic>g</italic><sub>2</sub>(<italic>f</italic><italic>o</italic><italic>l</italic><italic>d</italic><italic>c</italic><italic>h</italic><italic>a</italic><italic>n</italic><italic>g</italic><italic>e</italic>)| &#x003E; 1 (<xref ref-type="bibr" rid="B21">Group et al., 2020</xref>). The package was based on significance analysis of microarrays (SAM). SAM was developed based on <italic>t</italic>-test and adjusted the <italic>p</italic>-value to assess the statistically significant changes for genes (<xref ref-type="bibr" rid="B43">Tusher et al., 2001</xref>).</p>
</sec>
<sec id="S2.SS6">
<title>Predictive Model Validation</title>
<p>For each cancer type, in order to evaluate the prognostic power of the biomarker fairly and accurately, the concordance index (C-index) (<xref ref-type="bibr" rid="B22">Harrell et al., 1996</xref>) was applied to assess the prognostic power of the classifier. The C-index was a non-parametric measure to quantify the discriminatory power of a predictive model with the value ranging from 0.5 to 1. A C-index of 1 represented perfect prediction accuracy, while C-index of 0.5 indicated a bad prediction like a random guess.</p>
<p>First, we randomly selected 90% of the samples. Second, the Cox regression model was used to calculate the RS score for each sample by multi-omics data of the identified biomarker genes. Based on the RS score, samples were classified into high and low risk groups. Patients in the high risk group were more likely to have poor prognosis while patients in the low risk group were more likely to have good prognosis. Finally, the predicted outcomes for patients were compared with the real status to calculate the C-indexes.</p>
<p>The procedure above was repeated 100 times to generate 100 C-indexes. If the median value of C-index was significantly higher than 0.5, indicating that the model had substantially prognostic power.</p>
</sec>
<sec id="S2.SS7">
<title>Decision Curve Analysis</title>
<p>Decision curve analysis was performed through the multi-omics data and every single omics data, respectively. The method was based on the principle that the relative harms of false positives (e.g., unnecessary biopsy) and false negatives (e.g., missed cancer) could be expressed in terms of a probability threshold (<xref ref-type="bibr" rid="B46">Vickers et al., 2008</xref>). Therefore, this threshold probability could be used to determine both whether a patient was defined as test-positive or negative and to model the clinical consequences of true and false positives using a clinical net benefit function:</p>
<disp-formula id="S2.E7">
<label>(7)</label>
<mml:math id="M8">
<mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>e</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mpadded width="+2.8pt">
<mml:mi>t</mml:mi>
</mml:mpadded>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>b</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>e</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>n</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>e</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>f</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>t</mml:mi>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>u</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mpadded width="+2.8pt">
<mml:mi>e</mml:mi>
</mml:mpadded>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>p</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>v</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>e</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>s</mml:mi>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:mfrac>
<mml:mo>-</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>F</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>a</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>l</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mpadded width="+2.8pt">
<mml:mi>e</mml:mi>
</mml:mpadded>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>p</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>v</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>e</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>s</mml:mi>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:mfrac>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mfrac>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>t</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <italic>n</italic> was the total number of patients in the study and <italic>p</italic><sub><italic>t</italic></sub> was the threshold probability. Net benefit was weighted by the relative harm of forgoing treatment compared with the negative consequences of an unnecessary treatment. In the decision curve, the thin oblique line represented the assumption that all patients have been treated. The black line represented the assumption that no patients have been treated.</p>
</sec>
</sec>
<sec id="S3">
<title>Results</title>
<sec id="S3.SS1">
<title>Pan-Cancer Prognostic Biomarker Identification</title>
<p>We integrated GE, SCNA, DM, and miRNA expression data of 13 cancers from TCGA: bladder urothelial carcinoma (BLCA), breast invasive carcinoma (BRCA), cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC), colon adenocarcinoma (COAD), head and neck squamous cell carcinoma (HNSC), kidney renal clear cell carcinoma (KIRC), kidney renal papillary cell carcinoma (KIRP), brain lower grade glioma (LGG), LIHC, lung adenocarcinoma (LUAD), lung squamous cell carcinoma (LUSC), sarcoma (SARC), and stomach adenocarcinoma (STAD). After data preprocessing, samples with all the four omics data were kept. Whereupon, we collected the DM, GE, copy number, and miRNA expression of 3279 samples (<xref ref-type="fig" rid="F2">Figure 2A</xref>). The percentage of each cancer is shown in <xref ref-type="fig" rid="F2">Figure 2B</xref>. We then summarized the clinical characteristics of the 3279 samples. As shown in <xref ref-type="fig" rid="F2">Figure 2C</xref>, the majority of these patients were 60&#x2013;79 years old. And the number of men and women was basically equal. Hence, the sample set could be used to study cancer without gender and age bias. In addition, most of the patients were white people. The complete clinical information for each sample is provided in <xref ref-type="supplementary-material" rid="TS1">Supplementary Table S1</xref>.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>The information of pan-cancer samples. <bold>(A)</bold> The sample set intersections of the multi-omics data. Only the intersecting samples were chosen. We selected 3279 samples in this study. <bold>(B)</bold> The proportion of each cancer. <bold>(C)</bold> The clinical features distribution of the 3279 samples.</p></caption>
<graphic xlink:href="fbioe-08-00268-g002.tif"/>
</fig>
<p>The survival-related gene list of each cancer is shown in <xref ref-type="supplementary-material" rid="TS2">Supplementary Table S2</xref>. We took top-10 genes as a prognostic biomarker of each cancer (<xref ref-type="table" rid="T2">Table 2</xref>) to draw Kaplan&#x2013;Meier (KM) curves and calculated their log-rank <italic>p</italic>-values. As shown in <xref ref-type="fig" rid="F3">Figure 3</xref>, the prognostic markers for each cancer significantly distinguished the high and low risk groups, except for SARC.</p>
<table-wrap position="float" id="T2">
<label>TABLE 2</label>
<caption><p>The prognostic biomarkers of each cancer.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Cancers</td>
<td valign="top" align="left">Genes</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">BLCA</td>
<td valign="top" align="left"><italic>NCBP1</italic>, <italic>RP9</italic>, <italic>UBP1</italic>, <italic>AURKB</italic>, <italic>ARL6IP5</italic>, <italic>LEMD3</italic>, <italic>HSPBP1</italic>, <italic>TMEM214</italic>, <italic>MCMBP</italic>, <italic>FAM107B</italic></td>
</tr>
<tr>
<td valign="top" align="left">BRCA</td>
<td valign="top" align="left"><italic>LIMS1</italic>, <italic>NDUFA3</italic>, <italic>NKTR</italic>, <italic>SRP68</italic>, <italic>ARPC3</italic>, <italic>TMEM138</italic>, <italic>DDIT4</italic>, <italic>OCIAD1</italic>, <italic>MAF1</italic>, <italic>DPY19L4</italic></td>
</tr>
<tr>
<td valign="top" align="left">CESC</td>
<td valign="top" align="left"><italic>HNRNPA2B1</italic>, <italic>MYO9B</italic>, <italic>EIF3B</italic>, <italic>MTX2</italic>, <italic>MON1B</italic>, <italic>SUN1</italic>, <italic>SSH1</italic>, <italic>SLC35E3</italic>, <italic>MAP7D1</italic>, <italic>PGAM5</italic></td>
</tr>
<tr>
<td valign="top" align="left">COAD</td>
<td valign="top" align="left"><italic>SRP72</italic>, <italic>TAF10</italic>, <italic>USP1</italic>, <italic>USP8</italic>, <italic>JKAMP</italic>, <italic>YTHDF1</italic>, <italic>BRIX1</italic>, <italic>ATG101</italic>, <italic>VPS37A</italic>, <italic>TMED4</italic></td>
</tr>
<tr>
<td valign="top" align="left">HNSC</td>
<td valign="top" align="left"><italic>DAGLA</italic>, <italic>DNAJA3</italic>, <italic>PTER</italic>, <italic>NUDT5</italic>, <italic>FAM168A</italic>, <italic>GTPBP4</italic>, <italic>PTCD3</italic>, <italic>MINDY3</italic>, <italic>NCALD</italic>, <italic>STEAP2</italic></td>
</tr>
<tr>
<td valign="top" align="left">KIRC</td>
<td valign="top" align="left"><italic>BPTF</italic>, <italic>IMPA1</italic>, <italic>RPS6KA4</italic>, <italic>TSFM</italic>, <italic>CEPT1</italic>, <italic>PRKD3</italic>, <italic>PPME1</italic>, <italic>SPIRE1</italic>, <italic>NUDCD1</italic>, <italic>NHLRC2</italic></td>
</tr>
<tr>
<td valign="top" align="left">KIRP</td>
<td valign="top" align="left"><italic>DAPK3</italic>, <italic>EPRS</italic>, <italic>IARS</italic>, <italic>SYMPK</italic>, <italic>PCSK7</italic>, <italic>APBB3</italic>, <italic>MSANTD2</italic>, <italic>TBC1D17</italic>, <italic>DOHH</italic>, <italic>CMSS1</italic></td>
</tr>
<tr>
<td valign="top" align="left">LGG</td>
<td valign="top" align="left"><italic>FDXR</italic>, <italic>VPS4B</italic>, <italic>WASHC5</italic>, <italic>CITED2</italic>, <italic>BRD8</italic>, <italic>MON2</italic>, <italic>TSPAN13</italic>, <italic>MIOS</italic>, <italic>OGFOD3</italic>, <italic>PIGO</italic></td>
</tr>
<tr>
<td valign="top" align="left">LIHC</td>
<td valign="top" align="left"><italic>BCL2L1</italic>, <italic>SELENOW</italic>, <italic>HIST1H2BN</italic>, <italic>MMADHC</italic>, <italic>PNPO</italic>, <italic>ZDHHC11</italic>, <italic>ULBP2</italic>, <italic>CSRNP2</italic>, <italic>SPC24</italic>, <italic>RPL7L1</italic></td>
</tr>
<tr>
<td valign="top" align="left">LUAD</td>
<td valign="top" align="left"><italic>FGFR3</italic>, <italic>LTBP3</italic>, <italic>SLC6A4</italic>, <italic>PUM1</italic>, <italic>ARHGAP44</italic>, <italic>SLC39A1</italic>, <italic>NAGPA</italic>, <italic>BTBD2</italic>, <italic>LRRK1</italic>, <italic>ZFC3H1</italic></td>
</tr>
<tr>
<td valign="top" align="left">LUSC</td>
<td valign="top" align="left"><italic>HSF2</italic>, <italic>BCLAF1</italic>, <italic>UHRF1BP1L</italic>, <italic>CHORDC1</italic>, <italic>CREBZF</italic>, <italic>FBXO30</italic>, <italic>PCGF6</italic>, <italic>PLCD3</italic>, <italic>HINT3</italic>, <italic>SLC35E2B</italic></td>
</tr>
<tr>
<td valign="top" align="left">SARC</td>
<td valign="top" align="left"><italic>BMP1</italic>, <italic>NCAM2</italic>, <italic>PBX1</italic>, <italic>RAD17</italic>, <italic>ARHGEF10</italic>, <italic>PSD3</italic>, <italic>MRPL17</italic>, <italic>FAM160B2</italic>, <italic>CHMP7</italic>, <italic>VPS37A</italic></td>
</tr>
<tr>
<td valign="top" align="left">STAD</td>
<td valign="top" align="left"><italic>ANK3</italic>, <italic>GNAI2</italic>, <italic>MARCKS</italic>, <italic>NEDD4</italic>, <italic>PRKAA1</italic>, <italic>UGP2</italic>, <italic>TAF1C</italic>, <italic>INO80D</italic>, <italic>USP37</italic>, <italic>FAM126B</italic></td>
</tr>
</tbody>
</table></table-wrap>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>The Kaplan&#x2013;Meier curves of top-10 survival-related genes for each cancer. The green lines represented the low risk groups and the red lines represented the high risk groups. &#x201C; + &#x201D; indicated the censored follow-ups. <bold>(A)</bold> BLCA. <bold>(B)</bold> BRCA. <bold>(C)</bold> CESC. <bold>(D)</bold> COAD. <bold>(E)</bold> HNSC. <bold>(F)</bold> KIRC. <bold>(G)</bold> KIRP. <bold>(H)</bold> LGG. <bold>(I)</bold> LIHC. <bold>(J)</bold> LUAD. <bold>(K)</bold> LUSC. <bold>(L)</bold> SARC. <bold>(M)</bold> STAD.</p></caption>
<graphic xlink:href="fbioe-08-00268-g003.tif"/>
</fig>
<p>For each cancer type, we calculated the C-index which was a non-parametric measure to quantify the discriminatory power of a predictive model. <xref ref-type="fig" rid="F4">Figure 4</xref> shows the C-indexes of each cancer. All of the cancers had a C-index significantly higher than 0.5. BRCA had the highest C-index (0.96) while LUSC had the lowest (0.76).</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>The C-index comparison of the prognostic power of our prognostic biomarkers in 13 cancers.</p></caption>
<graphic xlink:href="fbioe-08-00268-g004.tif"/>
</fig>
<p>In order to discover the relationship among different cancers based on function, we used the prognostic biomarker genes to perform functional enrichment analysis of GO and KEGG (<xref ref-type="supplementary-material" rid="TS3">Supplementary Table S3</xref>). The most significantly enriched functions and pathways of each cancer are displayed in <xref ref-type="fig" rid="F5">Figure 5A</xref>. Among them, COAD, LGG, and SARC were enriched in &#x201C;endocytosis.&#x201D; BLCA was enriched in &#x201C;RNA splicing&#x201D; and CESC was enriched in &#x201C;mitophagy.&#x201D; The prognostic biomarker genes were enriched in closely cancer-related functions. Then, we calculated the counts of each function enriched by cancers. As shown in <xref ref-type="fig" rid="F5">Figure 5B</xref>, &#x201C;Mitophagy&#x201D; was enriched by the most cancers. Mitophagy was a tumor suppression mechanism (<xref ref-type="bibr" rid="B7">Bernardini et al., 2017</xref>). Besides, we had some interesting findings. First, the most significantly enriched functions of each cancer were their specific functions, while the common functions of cancers were not highly significant generally. For example, &#x201C;cytoskeleton-dependent cytokinesis&#x201D; was the common enriched function of STAD, KIRC, COAD, and BLCA, and they had <italic>p</italic>-values about 0.03 which was less significant than their most significantly enriched functions (<italic>p</italic>-values &#x003C; 0.003). And their most significant functions were all their specific functions. Second, even if different cancers were enriched in a same function, the enrichment of function in different cancers was caused by different gene sets. For instance, &#x201C;Mitophagy&#x201D; was the common function of LIHC, LGG, KIRP, COAD, and CESC, but the function was hit by different genes (<italic>BCL2L1</italic> of LIHC, <italic>CITED2</italic> of LGG, <italic>TBC1D17</italic> of KIRP, <italic>USP8</italic> of COAD, and <italic>PGAM5</italic> of CESC). Whereafter, we sought the intersection of associated functions for the 13 cancers (<xref ref-type="fig" rid="F6">Figure 6A</xref> top right corner). The result showed that the intersection of LGG and SARC was the largest, followed by BLCA and CESC.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p>Pan-cancer functional comparison of survival-related genes. <bold>(A)</bold> The representative KEGG pathways and GO functions enriched by the top-10 prognostic genes of each cancer. <bold>(B)</bold> The distribution of cancers enriched to each function. The size of the dots represented the number of enriched genes. The color of the dots represented the <italic>p</italic>-values.</p></caption>
<graphic xlink:href="fbioe-08-00268-g005.tif"/>
</fig>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p>The intersection of pan-cancer genes. <bold>(A)</bold> The intersections of the lists of survival-related genes (left bottom) and the intersection of associated functions (top right corner) of cancers. The total numbers of genes associated with survival of each cancer were on the left, and the total associated functions were on the right. The color blocks represented the number of intersecting samples of each two cancers. The darker the color, the greater the intersection was. <bold>(B)</bold> The pan-cancer survival-related genes. The red blocks indicated that the gene was survival-associated with the cancer.</p></caption>
<graphic xlink:href="fbioe-08-00268-g006.tif"/>
</fig>
<p>In order to discover the relationship among different cancers based on survival-related genes, we first compared the intersection of the survival-related gene lists between each two cancers. We found there was always an overlap between each two gene lists (<xref ref-type="fig" rid="F6">Figure 6A</xref> left bottom). The intersection of KIRP and KIRC was the largest. All of the intersections among KIRC, KIRP, and BLCA were large, which might be due to the reason that the three cancers had the largest number of genes. The intersections with other cancers were roughly proportional to the size of the gene list. Second, we compared the gene lists among the 13 types of cancers. We found that seven genes were associated with survival in three kinds of cancers (<xref ref-type="fig" rid="F6">Figure 6B</xref>). Subsequently, we downloaded the list of cancer-related genes from the Candidate Cancer Gene Database (CCGD) (<xref ref-type="bibr" rid="B2">Abbott et al., 2015</xref>), and retained the human genes that appeared in at least one of the COSMIC and CGC (<xref ref-type="bibr" rid="B39">Sondka et al., 2018</xref>). A total of 9265 genes were retained. All of the seven pan-cancer survival-related genes were in the list, and have been verified cancer-related in no less than one literature (<xref ref-type="table" rid="T3">Table 3</xref>). In addition, we investigated the functions of the seven genes (<xref ref-type="supplementary-material" rid="TS4">Supplementary Table S4</xref>) and conducted UCSC Genome Browser (<xref ref-type="bibr" rid="B44">Tyner et al., 2017</xref>) analysis on the seven genes. We found that <italic>SLK</italic> had an unconservative exon region, which containing four missense variants (<xref ref-type="fig" rid="F7">Figure 7A</xref>).</p>
<table-wrap position="float" id="T3">
<label>TABLE 3</label>
<caption><p>The seven pan-cancer survival-related genes.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Genes</td>
<td valign="top" align="left">Functions</td>
<td valign="top" align="left">PubMed IDs</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>SLK</italic></td>
<td valign="top" align="left">Cell adhesion, regulation of cell cycle</td>
<td valign="top" align="left">26676752, 27849608, 22057237, 27247392, 22699621</td>
</tr>
<tr>
<td valign="top" align="left"><italic>ZRANB1</italic></td>
<td valign="top" align="left">Protein catabolic process</td>
<td valign="top" align="left">26676752, 27790711, 25559195, 27006499, 24316982, 27178121, 23685747, 22421440</td>
</tr>
<tr>
<td valign="top" align="left"><italic>BTBD2</italic></td>
<td valign="top" align="left">Protein catabolic process</td>
<td valign="top" align="left">26676752, 22057237</td>
</tr>
<tr>
<td valign="top" align="left"><italic>PTAR1</italic></td>
<td valign="top" align="left">Cellular protein modification process</td>
<td valign="top" align="left">26676752</td>
</tr>
<tr>
<td valign="top" align="left"><italic>VPS37A</italic></td>
<td valign="top" align="left">Viral process</td>
<td valign="top" align="left">27849608, 24316982, 23045694</td>
</tr>
<tr>
<td valign="top" align="left"><italic>EIF2B1</italic></td>
<td valign="top" align="left">Glial cell development</td>
<td valign="top" align="left">22057237</td>
</tr>
<tr>
<td valign="top" align="left"><italic>API5</italic></td>
<td valign="top" align="left">Regulation of cell death</td>
<td valign="top" align="left">27790711, 27178121</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<attrib><italic>The most relevant functions of each gene were listed. The references supporting the association of gene with cancer were given in the form of PubMed IDs.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption><p>The characteristics of <italic>SLK</italic>. <bold>(A)</bold>The results of UCSC Genome Browser. <bold>(B)</bold> Distribution of mutations on <italic>SLK</italic>. <bold>(C)</bold> The functions of <italic>SLK</italic>. <bold>(D)</bold> The protein&#x2013;protein interactions of <italic>SLK</italic>.</p></caption>
<graphic xlink:href="fbioe-08-00268-g007.tif"/>
</fig>
<p>In order to further verify the close relationship of <italic>SLK</italic> and cancer, we checked the mutation of <italic>SLK</italic> in the COSMIC database (<xref ref-type="bibr" rid="B42">Tate et al., 2019</xref>), and found that missense substitution occurred in 36.93% of the samples (<xref ref-type="fig" rid="F7">Figure 7B</xref>). Next, enrichment analysis in GO terms (BP) was performed by Enrichr (<xref ref-type="bibr" rid="B26">Kuleshov et al., 2016</xref>) and found <italic>SLK</italic> was mainly associated with cell adhesion (<xref ref-type="fig" rid="F7">Figure 7C</xref>). Finally, we used STRING database (<xref ref-type="bibr" rid="B41">Szklarczyk et al., 2019</xref>) to check the interacted proteins of <italic>SLK</italic>. <xref ref-type="fig" rid="F7">Figure 7D</xref> shows there were 10 genes interacting with <italic>SLK</italic>. Half of the interactions have been demonstrated through more than one method, and the genes interacting with <italic>SLK</italic> also had strong relationship between each other.</p>
<p>In order to explore the correlation among prognostic biomarkers of different cancers, we checked the genomic locations of these 130 genes (<xref ref-type="fig" rid="F8">Figure 8A</xref>). There were many prognosis-related genes located in chr 6, chr 7, chr 8, chr 11, chr 12, and chr 17, while few genes in chr 13, chr 14, chr 18, chr 20, and chr 21. In addition, we constructed a protein&#x2013;protein interaction network of these genes based on the STRING database (<xref ref-type="fig" rid="F8">Figure 8B</xref>). As shown in the network, the prognosis-related genes of different cancers were connected to each other. The degree distribution (<xref ref-type="fig" rid="F8">Figure 8C</xref>) and the betweeness centrality (<xref ref-type="fig" rid="F8">Figure 8D</xref>) of the network satisfied the condition of scale-free network and were conformed as the general characteristics of biological network. In the network, the degree of <italic>EPRS</italic> had the highest degree of 33. The degrees of <italic>HNRNPA2B1</italic>, <italic>BPTF</italic>, <italic>LRRK1</italic>, and <italic>PUM1</italic> were all greater than 20. These genes mentioned above were widely mutated in many cancers.</p>
<fig id="F8" position="float">
<label>FIGURE 8</label>
<caption><p>Pan-cancer survival-related gene networks. <bold>(A)</bold> The chromosome distribution of the genes. The blue, green, red, and purple blocks represented the survival correlation of the genes in each omics data, respectively. The links in the middle represented the interaction of the genes. <bold>(B)</bold> The interaction network among the top-10 survival-related genes. Different colors represented prognostic genes of different cancers. Red nodes represented genes prognosis-related in multiple cancers. <bold>(C)</bold> The degree distribution of the prognostic-related gene network. <bold>(D)</bold> The betweeness centrality of the prognostic-related gene network.</p></caption>
<graphic xlink:href="fbioe-08-00268-g008.tif"/>
</fig>
<p>Based on the COSMIC database, we found extensive mutations occurred in <italic>EPRS</italic>, and 70% of them were missense mutation. <italic>EPRS</italic> has been shown to be associated with a wide range of cancers by 80 articles. The other four genes also had widespread mutations. In the COSMIC database, 78, 114, 113, and 83 studies confirmed the correlation between <italic>HNRNPA2B1</italic>, <italic>BPTF</italic>, <italic>LRRK1</italic>, and <italic>PUM1</italic> with cancer, respectively.</p>
</sec>
<sec id="S3.SS2">
<title>The Predictive Performance of Our Method</title>
<p>In order to demonstrate the effectiveness of our method, we compared our prognostic biomarkers with previous works. The works using TCGA data were chosen to compare with our work. <xref ref-type="bibr" rid="B14">Chaudhary et al. (2018)</xref> used LIHC data of TCGA in their work. Their C-indexs of training and testing set were 0.70(&#x00B1;0.04) and 0.69(&#x00B1;0.08), while our median C-index of LIHC was 0.82. The prognostic power of our method was stronger than theirs. Next, both <xref ref-type="bibr" rid="B49">Yuan et al. (2014)</xref> and <xref ref-type="bibr" rid="B50">Zhang et al. (2016)</xref> used KIRC and LUSC data of TCGA in their works, so we compared our results of these two cancers with their studies. The comparisons of the C-indexes are shown in <xref ref-type="fig" rid="F9">Figures 9A,B</xref>, which showed the higher prognostic power of our 10-gene biomarkers. For KIRC, the median C-index of our work was 0.91. The median C-index of the best performing data (clinical + miRNA) in Yuan&#x2019;s work and the best performing subnetwork (subnetwork K1) of Zhang&#x2019;s study were about 0.76 and 0.74, respectively. For LUSC, the median C-index of our work was 0.76. The median C-index of the best performing data (clinical + protein) in Yuan&#x2019;s work and the best performing subnetwork (subnetwork L1) of Zhang&#x2019;s study were about 0.66 and 0.62, respectively. Therefore, the biomarkers identified by our method could display the better prediction for the patients&#x2019; survival.</p>
<fig id="F9" position="float">
<label>FIGURE 9</label>
<caption><p>The prognostic biomarkers of KIRC and LUSC. <bold>(A,B)</bold> The C-index comparison of the prognostic power of our 10-gene prognostic biomarkers and other work. <bold>(C,D)</bold> The heatmap of samples hierarchical clustering by the expression of the 10-gene prognostic biomarkers. The bar on the top of the heatmap indicated the group the samples really belong to. Red represented tumor and green represented normal.</p></caption>
<graphic xlink:href="fbioe-08-00268-g009.tif"/>
</fig>
<p>To further confirm reliability of the genes, we downloaded GE data of the corresponding normal samples and used the prognostic biomarkers to cluster the samples. The results showed that the prognostic biomarkers could distinguish the tumor and normal samples (<xref ref-type="fig" rid="F9">Figures 9C,D</xref>).</p>
<p>Furthermore, we screened the differentially expressed genes between tumor and normal samples of LUSC and KIRC. After comparing them with the list of survival-associated genes, there were 12 differentially expressed genes in LUSC list (<xref ref-type="fig" rid="F10">Figure 10A</xref>) and 49 differentially expressed genes in KIRC list (<xref ref-type="fig" rid="F10">Figure 10D</xref>). Subsequently, we examined the copy number variation and chromosome location of both the differentially expressed genes and the top-10 biomarker genes (<xref ref-type="fig" rid="F10">Figures 10B,C,E,F</xref>). It turned out that among the 22 LUSC genes, six were located in chr 6q, three were in chr 10q, three were in chr 11q, and three were in chr 15q. Of the 59 KIRC genes, 10 were located in chr 8, eight were located in chr 17, and seven were in chr 1. These locations were the peak regions of copy number alternation, suggesting a relationship between these genes and cancer. Moreover, it could be seen that the driver genes of the two cancers were located in different chromosomes, which supported the uniqueness of different cancer-related genes.</p>
<fig id="F10" position="float">
<label>FIGURE 10</label>
<caption><p>The comparison of survival-related genes and differentially expressed genes. <bold>(A)</bold> The differentially expressed genes of LUSC. Red represented high expression and green represented low expression. Differentially expressed survival-related genes were marked. <bold>(B)</bold> The copy number variation peaks of LUSC. <bold>(C)</bold> Chromosomal positions and interactions of prognostic biomarkers and differentially expressed genes of LUSC. <bold>(D)</bold> The differentially expressed genes of KIRC. Red represented high expression and green represented low expression. Differentially expressed survival-related genes were marked. <bold>(E)</bold> The copy number variation peaks of KIRC. <bold>(F)</bold> Chromosomal positions and interactions of prognostic biomarkers and differentially expressed genes of KIRC.</p></caption>
<graphic xlink:href="fbioe-08-00268-g010.tif"/>
</fig>
<p>Moreover, in the process of univariate Cox regression model, we separately calculated the survival correlation of a gene in four different omics data and then counted them. We also tried the result of considering the same gene in different omics data as different features, and merged the four omics data into one matrix then performed multivariate Cox regression model on it. Only the genes identified as survival-related features more than twice were retained. Finally, the obtained genes were all involved in the gene lists identified through our method and had an incomplete coverage compared with our gene lists. Interestingly, most of these genes were related to survival in GE or SCNA.</p>
<p>In addition, in the process of multivariate Cox regression model, we involved all of the four types of omics data for each candidate gene to perform analysis. Actually, the genes were not survival-related at all of the four omics data in the univariate Cox regression model. To prove the validity of this process, we recalculated the Score of each gene by only using the types of omics data at which the gene was determined to be related to survival in univariate Cox regression model. The results showed that neither the Scores nor the ranks of the genes changed much after recalculation. In consequence, it could suggest the high predictive performance of our multivariable Cox regression model.</p>
</sec>
<sec id="S3.SS3">
<title>The Necessity of Multi-Omics Data Integration</title>
<p>In the process of univariate Cox regression analysis, we found that a gene appeared to be survival-related in one omics dataset, while it might appear to be unrelated to survival on another omics data even under the same model, selection criteria and set of samples. Although this phenomenon might be caused by the error of the data or the imprecision of the experiment, it implied the necessity of multi-omics data integration.</p>
<p>To verify the superiority of integrating multi-omics data, we compared the results of integrating multi-omics data with the results of single omics data in LUSC and KIRC. As shown in <xref ref-type="fig" rid="F11">Figure 11</xref>, the results of integrating multi-omics data were significantly higher than those of applying single omics data in decision curve analysis and C-index. The decision curve showed that compared with single omics data, the curve of multi-omics data was further apart from the two extreme curves, which had the greater application value.</p>
<fig id="F11" position="float">
<label>FIGURE 11</label>
<caption><p>The comparison of the results for multi-omics data and single omics data. <bold>(A)</bold> The decision curve of multi-omics data and each omics data in KIRC. The thin oblique line represented the assumption that all patients have been treated. The black line represented the assumption that no patients have been treated. <bold>(B)</bold> The C-indexes of multi-omics data and each omics data in KIRC. <bold>(C)</bold> The decision curve of multi-omics data and each omics data in LUSC. The thin oblique line represented the assumption that all patients have been treated. The black line represented the assumption that no patients have been treated. <bold>(D)</bold> The C-indexes of multi-omics data and each omics data in LUSC.</p></caption>
<graphic xlink:href="fbioe-08-00268-g011.tif"/>
</fig>
</sec>
</sec>
<sec id="S4">
<title>Discussion</title>
<p>The recognition of prognostic biomarkers in cancers could predict the prognostic status of each individual patient. This could help to achieve personalized medicine for cancer (<xref ref-type="bibr" rid="B35">Nalejska et al., 2014</xref>). Prior work has utilized omics data to predict prognostic status of cancer patients. However, multi-omics data were not used comprehensively.</p>
<p>In this work, we proposed a method to integrate multi-omics data and predict the prognostic status of patients. And gene lists associated with survival were identified in 13 types of cancers. Based on this foundation, the prognostic biomarkers of the cancers were obtained.</p>
<p>Compared with previous studies, this work took a more comprehensive integration of multi-omics data. To verify the reliability and reproducibility of our approach, we confirmed the relationship between our prognostic genes and cancer from multiple perspectives, and the results were stable when changing feature selection strategies. And this method was easy to implement because of its light calculation burden. We obtained candidate survival-related gene lists for 13 types of cancers, and compared the differences and similarities of the lists. The genes associated with survival in multiple cancers were found.</p>
<p>Not only have we successfully verified that genes like <italic>EPRS</italic> were indeed related to various cancers, but also we found that genes such as <italic>SLK</italic> were related to survival of multiple cancers. <italic>SLK</italic> has been reported to associate with blood cancer, breast cancer, colorectal cancer, liver cancer, and pancreatic cancer. In our work, we found that it was participated in the BP of patients&#x2019; survival of bladder cancer, lung cancer, and renal cancer. <italic>SLK</italic> mainly associated with cell adhesion. Cell adhesion plays an important role in the maintenance of tissue structure, whose abnormality results in tumor invasion and metastasis.</p>
<p>However, we also got some confused results in comparing different cancers. As different cancer subtypes of the same tissue, the overlap of gene lists between LUAD and LUSC was small, which was different from the expected outcome. We suspected that this might be due to their different pathogenesis. LUSC commonly occurred in older men and was strongly associated with smoking, but LUAD was more common in women and non-smokers (<xref ref-type="bibr" rid="B25">Kenfield et al., 2008</xref>). The differences in the pathogenesis might lead to differences in the genetic mechanisms and the list of related genes. Moreover, the prognostic markers for SARC did not significantly distinguish the high and low risk groups. This might be due to the subtypes of SARC (leiomyosarcoma, liposarcoma, myxofibrosarcoma, synovial SARC, etc.). The subtypes of SARC ought to be considered as different cancer types.</p>
<p>In addition, this might be caused by the bias of data. TCGA patient samples were selected from multiple sources, and were characterized at multiple centers, which might introduce heterogeneity and bias. And the clinical annotations of the patients might not be sufficiently rigorous and comprehensive (<xref ref-type="bibr" rid="B49">Yuan et al., 2014</xref>). Even though we only selected the basic information such as age and gender, there were still some missing values.</p>
<p>Till now, only a few molecular prognostic biomarkers based on multi-omics data have been applied to clinic (<xref ref-type="bibr" rid="B49">Yuan et al., 2014</xref>). The presence of publication bias and incompletion in the literatures is a major reason why the identified tumor biomarkers have not been applied in clinic (<xref ref-type="bibr" rid="B31">Mcshane and Hayes, 2012</xref>). Further, translational medicine researchers have no access to the results of these studies. Our work clearly provided gene lists related to the survival of various cancers, which could be easily obtained and searched, and help to transform biological data into clinical experiments.</p>
<p>Even so, our work remains inadequate. First of all, overfitting and collinearity of biological data make it technically challenging to effective integration of multi-omics data. Our work did not solve the problem. Although LASSO can well select the most important features to overcome the overfitting problem, it will lose many equally important features at the same time when high pairwise correlations occurred (<xref ref-type="bibr" rid="B50">Zhang et al., 2016</xref>). And the intra-tumor heterogeneities of cancer make it almost impossible to find prognostic biomarkers 100% suitable for each patient. Future efforts are still needed to address these problems.</p>
<p>In addition, since the data were downloaded from TCGA which was a program of the National Cancer Institute (NCI) of the United States, most of the patients were white people. The results of this study may be only appropriate for the whites. Although <xref ref-type="bibr" rid="B14">Chaudhary et al. (2018)</xref> have validated their model, which was built by TCGA data, on Japanese and Chinese datasets, further validation of other cancer should be done and data of black population should be included in future studies. Furthermore, some studies have suggested a non-linear relationship between miRNA expression and clinical outcomes (<xref ref-type="bibr" rid="B19">Fuchs et al., 2013</xref>; <xref ref-type="bibr" rid="B27">Lee et al., 2013</xref>). Therefore, some non-parametric algorithms can be applied to the analysis of the prognosis of miRNA in future studies.</p>
</sec>
<sec id="S5">
<title>Data Availability Statement</title>
<p>The datasets generated for this study can be found in the TCGA database: <ext-link ext-link-type="uri" xlink:href="https://www.cancer.gov/tcga">https://www.cancer.gov/tcga</ext-link>. And the core code of this study was merged in the <xref ref-type="supplementary-material" rid="DS1">Supplementary Data Sheet S1</xref>.</p>
</sec>
<sec id="S6">
<title>Author Contributions</title>
<p>NZ collected data, carried out the initial analyses, and drafted the manuscript. MG conceived of the study, participated in its design and coordination, and helped to draft the manuscript. KW and XL coordinated and supervised data collection, and critically commented on the important intellectual content of the manuscript. CZ participated in the design of the study and performed the statistical analysis. All authors read and approved the final manuscript.</p>
</sec>
<sec id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="financial-disclosure">
<p><bold>Funding.</bold> This work was supported by the National Natural Science Foundation of China (Grant Nos. 61532014 and 61671189) and the National Key Research and Development Plan Task of China (Grant No. 2016YFC0901902).</p>
</fn>
</fn-group>
<ack>
<p>We thank the TCGA database for sharing the multi-omics data.</p>
</ack>
<sec id="S9" sec-type="supplementary material"><title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fbioe.2020.00268/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fbioe.2020.00268/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.XLSX" id="TS1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>TABLE S1</label>
<caption><p>The complete clinical information for each sample.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Table_2.XLSX" id="TS2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>TABLE S2</label>
<caption><p>The survival-related gene list of each cancer.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Table_3.XLSX" id="TS3" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>TABLE S3</label>
<caption><p>The enriched functions and pathways of each cancer.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Table_4.XLSX" id="TS4" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>TABLE S4</label>
<caption><p>The functions of the seven pan-cancer survival-related genes.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Data_Sheet_1.ZIP" id="DS1" mimetype="application/zip" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>DATA SHEET S1</label>
<caption><p>Core code of this study.</p></caption>
</supplementary-material>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aalen</surname> <given-names>O. O.</given-names></name></person-group> (<year>1989</year>). <article-title>A linear regression model for the analysis of life times.</article-title> <source><italic>Statist. Med.</italic></source> <volume>8</volume> <fpage>907</fpage>&#x2013;<lpage>925</lpage>. <pub-id pub-id-type="doi">10.1002/sim.4780080803</pub-id> <pub-id pub-id-type="pmid">2678347</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abbott</surname> <given-names>K. L.</given-names></name> <name><surname>Nyre</surname> <given-names>E. T.</given-names></name> <name><surname>Abrahante</surname> <given-names>J.</given-names></name> <name><surname>Ho</surname> <given-names>Y. Y.</given-names></name> <name><surname>Isaksson Vogel</surname> <given-names>R.</given-names></name> <name><surname>Starr</surname> <given-names>T. K.</given-names></name></person-group> (<year>2015</year>). <article-title>The candidate cancer gene database: a database of cancer driver genes from forward genetic screens in mice.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>43</volume> <fpage>D844</fpage>&#x2013;<lpage>D848</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gku770</pub-id> <pub-id pub-id-type="pmid">25190456</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alamartine</surname> <given-names>E.</given-names></name> <name><surname>Sabatier</surname> <given-names>J. C.</given-names></name> <name><surname>Guerin</surname> <given-names>C.</given-names></name> <name><surname>Berliet</surname> <given-names>J. M.</given-names></name> <name><surname>Berthoux</surname> <given-names>F.</given-names></name></person-group> (<year>1991</year>). <article-title>Prognostic factors in mesangial IgA glomerulonephritis: an extensive study with univariate and multivariate analyses.</article-title> <source><italic>Am. J. Kidney Dis.</italic></source> <volume>18</volume> <fpage>12</fpage>&#x2013;<lpage>19</lpage>. <pub-id pub-id-type="doi">10.1016/s0272-6386(12)80284-8</pub-id> <pub-id pub-id-type="pmid">2063844</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bartel</surname> <given-names>D. P.</given-names></name></person-group> (<year>2004</year>). <article-title>MicroRNAs: genomics, biogenesis, mechanism, and function.</article-title> <source><italic>Cell</italic></source> <volume>116</volume> <fpage>281</fpage>&#x2013;<lpage>297</lpage>. <pub-id pub-id-type="pmid">14744438</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baylin</surname> <given-names>S. B.</given-names></name></person-group> (<year>2005</year>). <article-title>DNA methylation and gene silencing in cancer.</article-title> <source><italic>Nat. Clin. Pract. Oncol.</italic></source> <volume>2</volume>(<issue>Suppl. 1</issue>), <fpage>S4</fpage>&#x2013;<lpage>S11</lpage>. <pub-id pub-id-type="pmid">16341240</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Berdasco</surname> <given-names>M.</given-names></name> <name><surname>Esteller</surname> <given-names>M.</given-names></name></person-group> (<year>2010</year>). <article-title>Aberrant epigenetic landscape in cancer: how cellular identity goes awry.</article-title> <source><italic>Dev. Cell</italic></source> <volume>19</volume> <fpage>698</fpage>&#x2013;<lpage>711</lpage>. <pub-id pub-id-type="doi">10.1016/j.devcel.2010.10.005</pub-id> <pub-id pub-id-type="pmid">21074720</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bernardini</surname> <given-names>J. P.</given-names></name> <name><surname>Lazarou</surname> <given-names>M.</given-names></name> <name><surname>Dewson</surname> <given-names>G.</given-names></name></person-group> (<year>2017</year>). <article-title>Parkin and mitophagy in cancer.</article-title> <source><italic>Oncogene</italic></source> <volume>36</volume> <fpage>1315</fpage>&#x2013;<lpage>1327</lpage>. <pub-id pub-id-type="doi">10.1038/onc.2016.302</pub-id> <pub-id pub-id-type="pmid">27593930</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Burrell</surname> <given-names>R. A.</given-names></name> <name><surname>Mcgranahan</surname> <given-names>N.</given-names></name> <name><surname>Bartek</surname> <given-names>J.</given-names></name> <name><surname>Swanton</surname> <given-names>C.</given-names></name></person-group> (<year>2013</year>). <article-title>The causes and consequences of genetic heterogeneity in cancer evolution.</article-title> <source><italic>Nature</italic></source> <volume>501</volume> <fpage>338</fpage>&#x2013;<lpage>345</lpage>. <pub-id pub-id-type="doi">10.1038/nature12625</pub-id> <pub-id pub-id-type="pmid">24048066</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cagney</surname> <given-names>D. N.</given-names></name> <name><surname>Sul</surname> <given-names>J.</given-names></name> <name><surname>Huang</surname> <given-names>R. Y.</given-names></name> <name><surname>Ligon</surname> <given-names>K. L.</given-names></name> <name><surname>Wen</surname> <given-names>P. Y.</given-names></name> <name><surname>Alexander</surname> <given-names>B. M.</given-names></name></person-group> (<year>2018</year>). <article-title>The FDA NIH biomarkers, endpoints, and other tools (BEST) resource in neuro-oncology.</article-title> <source><italic>Neurol. Oncol.</italic></source> <volume>20</volume> <fpage>1162</fpage>&#x2013;<lpage>1172</lpage>. <pub-id pub-id-type="doi">10.1093/neuonc/nox242</pub-id> <pub-id pub-id-type="pmid">29294069</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><collab>Cancer Genome Atlas Research Network</collab> (<year>2011</year>). <article-title>Integrated genomic analyses of ovarian carcinoma.</article-title> <source><italic>Nature</italic></source> <volume>474</volume> <fpage>609</fpage>&#x2013;<lpage>615</lpage>. <pub-id pub-id-type="doi">10.1038/nature10166</pub-id> <pub-id pub-id-type="pmid">21720365</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><collab>Cancer Genome Atlas Research Network</collab> (<year>2012</year>). <article-title>Comprehensive genomic characterization of squamous cell lung cancers.</article-title> <source><italic>Nature</italic></source> <volume>489</volume> <fpage>519</fpage>&#x2013;<lpage>525</lpage>. <pub-id pub-id-type="doi">10.1038/nature11404</pub-id> <pub-id pub-id-type="pmid">22960745</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><collab>Cancer Genome Atlas Research Network</collab> (<year>2013</year>). <article-title>Comprehensive molecular characterization of clear cell renal cell carcinoma.</article-title> <source><italic>Nature</italic></source> <volume>499</volume> <fpage>43</fpage>&#x2013;<lpage>49</lpage>. <pub-id pub-id-type="doi">10.1038/nature12222</pub-id> <pub-id pub-id-type="pmid">23792563</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><collab>Cancer Genome Atlas Research Network</collab> <person-group person-group-type="author"><name><surname>Linehan</surname> <given-names>W. M.</given-names></name> <name><surname>Spellman</surname> <given-names>P. T.</given-names></name> <name><surname>Ricketts</surname> <given-names>C. J.</given-names></name> <name><surname>Creighton</surname> <given-names>C. J.</given-names></name> <name><surname>Fei</surname> <given-names>S. S.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Comprehensive molecular characterization of papillary renal-cell carcinoma.</article-title> <source><italic>N. Engl. J. Med.</italic></source> <volume>374</volume> <fpage>135</fpage>&#x2013;<lpage>145</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMoa1505917</pub-id> <pub-id pub-id-type="pmid">26536169</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chaudhary</surname> <given-names>K.</given-names></name> <name><surname>Poirion</surname> <given-names>O. B.</given-names></name> <name><surname>Lu</surname> <given-names>L.</given-names></name> <name><surname>Garmire</surname> <given-names>L. X.</given-names></name></person-group> (<year>2018</year>). <article-title>Deep learning-based multi-omics integration robustly predicts survival in liver cancer.</article-title> <source><italic>Clin. Cancer Res.</italic></source> <volume>24</volume> <fpage>1248</fpage>&#x2013;<lpage>1259</lpage>. <pub-id pub-id-type="doi">10.1158/1078-0432.CCR-17-0853</pub-id> <pub-id pub-id-type="pmid">28982688</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Collett</surname> <given-names>D.</given-names></name></person-group> (<year>2015</year>). <source><italic>Modelling Survival Data In Medical Research.</italic></source> <publisher-loc>Boca Raton, FL</publisher-loc>: <publisher-name>Chapman and Hall/CRC</publisher-name>.</citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cox</surname> <given-names>D.</given-names></name></person-group> (<year>1986</year>). <article-title>Citation-classic - regression-models and life-tables.</article-title> <source><italic>Curr. Contents Agric. Biol. Environ. Sci.</italic></source> <volume>34</volume>:<issue>16</issue>.</citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dalerba</surname> <given-names>P.</given-names></name> <name><surname>Sahoo</surname> <given-names>D.</given-names></name> <name><surname>Paik</surname> <given-names>S.</given-names></name> <name><surname>Guo</surname> <given-names>X.</given-names></name> <name><surname>Yothers</surname> <given-names>G.</given-names></name> <name><surname>Song</surname> <given-names>N.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>CDX2 as a prognostic biomarker in stage II and stage III colon cancer.</article-title> <source><italic>N. Engl. J. Med.</italic></source> <volume>374</volume> <fpage>211</fpage>&#x2013;<lpage>222</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMoa1506597</pub-id> <pub-id pub-id-type="pmid">26789870</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fehrmann</surname> <given-names>R. S.</given-names></name> <name><surname>Karjalainen</surname> <given-names>J. M.</given-names></name> <name><surname>Krajewska</surname> <given-names>M.</given-names></name> <name><surname>Westra</surname> <given-names>H. J.</given-names></name> <name><surname>Maloney</surname> <given-names>D.</given-names></name> <name><surname>Simeonov</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Gene expression analysis identifies global gene dosage sensitivity in cancer.</article-title> <source><italic>Nat. Genet.</italic></source> <volume>47</volume> <fpage>115</fpage>&#x2013;<lpage>125</lpage>. <pub-id pub-id-type="doi">10.1038/ng.3173</pub-id> <pub-id pub-id-type="pmid">25581432</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fuchs</surname> <given-names>M.</given-names></name> <name><surname>Beissbarth</surname> <given-names>T.</given-names></name> <name><surname>Wingender</surname> <given-names>E.</given-names></name> <name><surname>Jung</surname> <given-names>K.</given-names></name></person-group> (<year>2013</year>). <article-title>Connecting high-dimensional mRNA and miRNA expression data for binary medical classification problems.</article-title> <source><italic>Comput. Methods Program. Biomed.</italic></source> <volume>111</volume> <fpage>592</fpage>&#x2013;<lpage>601</lpage>. <pub-id pub-id-type="doi">10.1016/j.cmpb.2013.05.013</pub-id> <pub-id pub-id-type="pmid">23849930</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gene Ontology</surname> <given-names>C.</given-names></name></person-group> (<year>2015</year>). <article-title>Gene ontology consortium: going forward.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>43</volume> <fpage>D1049</fpage>&#x2013;<lpage>D1056</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gku1179</pub-id> <pub-id pub-id-type="pmid">25428369</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Group</surname> <given-names>P. T. C.</given-names></name> <name><surname>Calabrese</surname> <given-names>C.</given-names></name> <name><surname>Davidson</surname> <given-names>N. R.</given-names></name> <name><surname>Demircioglu</surname> <given-names>D.</given-names></name> <name><surname>Fonseca</surname> <given-names>N. A.</given-names></name> <name><surname>He</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Genomic basis for RNA alterations in cancer.</article-title> <source><italic>Nature</italic></source> <volume>578</volume> <fpage>129</fpage>&#x2013;<lpage>136</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-020-1970-0</pub-id> <pub-id pub-id-type="pmid">32025019</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Harrell</surname> <given-names>F. E.</given-names> <suffix>Jr.</suffix></name> <name><surname>Lee</surname> <given-names>K. L.</given-names></name> <name><surname>Mark</surname> <given-names>D. B.</given-names></name></person-group> (<year>1996</year>). <article-title>Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors.</article-title> <source><italic>Stat. Med.</italic></source> <volume>15</volume> <fpage>361</fpage>&#x2013;<lpage>387</lpage>. <pub-id pub-id-type="doi">10.1002/(sici)1097-0258(19960229)15:4&#x003C;361::aid-sim168&#x003E;3.0.co;2-4</pub-id> <pub-id pub-id-type="pmid">8668867</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>S.</given-names></name> <name><surname>Chaudhary</surname> <given-names>K.</given-names></name> <name><surname>Garmire</surname> <given-names>L. X.</given-names></name></person-group> (<year>2017</year>). <article-title>More is better: recent progress in multi-omics data integration methods.</article-title> <source><italic>Front. Genet.</italic></source> <volume>8</volume>:<issue>84</issue>. <pub-id pub-id-type="doi">10.3389/fgene.2017.00084</pub-id> <pub-id pub-id-type="pmid">28670325</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kanehisa</surname> <given-names>M.</given-names></name> <name><surname>Goto</surname> <given-names>S.</given-names></name></person-group> (<year>2000</year>). <article-title>KEGG: kyoto encyclopedia of genes and genomes.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>28</volume> <fpage>27</fpage>&#x2013;<lpage>30</lpage>. <pub-id pub-id-type="pmid">10592173</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kenfield</surname> <given-names>S. A.</given-names></name> <name><surname>Wei</surname> <given-names>E. K.</given-names></name> <name><surname>Stampfer</surname> <given-names>M. J.</given-names></name> <name><surname>Rosner</surname> <given-names>B. A.</given-names></name> <name><surname>Colditz</surname> <given-names>G. A.</given-names></name></person-group> (<year>2008</year>). <article-title>Comparison of aspects of smoking among the four histological types of lung cancer.</article-title> <source><italic>Tob. Control.</italic></source> <volume>17</volume> <fpage>198</fpage>&#x2013;<lpage>204</lpage>. <pub-id pub-id-type="doi">10.1136/tc.2007.022582</pub-id> <pub-id pub-id-type="pmid">18390646</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuleshov</surname> <given-names>M. V.</given-names></name> <name><surname>Jones</surname> <given-names>M. R.</given-names></name> <name><surname>Rouillard</surname> <given-names>A. D.</given-names></name> <name><surname>Fernandez</surname> <given-names>N. F.</given-names></name> <name><surname>Duan</surname> <given-names>Q.</given-names></name> <name><surname>Wang</surname> <given-names>Z.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Enrichr: a comprehensive gene set enrichment analysis web server 2016 update.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>44</volume> <fpage>W90</fpage>&#x2013;<lpage>W97</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw377</pub-id> <pub-id pub-id-type="pmid">27141961</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>I. H.</given-names></name> <name><surname>Lee</surname> <given-names>S. H.</given-names></name> <name><surname>Park</surname> <given-names>T. H.</given-names></name> <name><surname>Zhang</surname> <given-names>B. T.</given-names></name></person-group> (<year>2013</year>). <article-title>Non-linear molecular pattern classification using molecular beacons with multiple targets.</article-title> <source><italic>Biosystems</italic></source> <volume>114</volume> <fpage>206</fpage>&#x2013;<lpage>213</lpage>. <pub-id pub-id-type="doi">10.1016/j.biosystems.2013.05.008</pub-id> <pub-id pub-id-type="pmid">23743339</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lindahl</surname> <given-names>L. M.</given-names></name> <name><surname>Besenbacher</surname> <given-names>S.</given-names></name> <name><surname>Rittig</surname> <given-names>A. H.</given-names></name> <name><surname>Celis</surname> <given-names>P.</given-names></name> <name><surname>Willerslev-Olsen</surname> <given-names>A.</given-names></name> <name><surname>Gjerdrum</surname> <given-names>L. M. R.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Prognostic miRNA classifier in early-stage mycosis fungoides: development and validation in a Danish nationwide study.</article-title> <source><italic>Blood</italic></source> <volume>131</volume> <fpage>759</fpage>&#x2013;<lpage>770</lpage>. <pub-id pub-id-type="doi">10.1182/blood-2017-06-788950</pub-id> <pub-id pub-id-type="pmid">29208599</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>J.</given-names></name> <name><surname>Lichtenberg</surname> <given-names>T.</given-names></name> <name><surname>Hoadley</surname> <given-names>K. A.</given-names></name> <name><surname>Poisson</surname> <given-names>L. M.</given-names></name> <name><surname>Lazar</surname> <given-names>A. J.</given-names></name> <name><surname>Cherniack</surname> <given-names>A. D.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>An integrated TCGA pan-cancer clinical data resource to drive high-quality survival outcome analytics.</article-title> <source><italic>Cell</italic></source> <volume>173</volume> <fpage>400</fpage>&#x2013;<lpage>416e411</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2018.02.052</pub-id> <pub-id pub-id-type="pmid">29625055</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marusyk</surname> <given-names>A.</given-names></name> <name><surname>Almendro</surname> <given-names>V.</given-names></name> <name><surname>Polyak</surname> <given-names>K.</given-names></name></person-group> (<year>2012</year>). <article-title>Intra-tumour heterogeneity: a looking glass for cancer?</article-title> <source><italic>Nat. Rev. Cancer</italic></source> <volume>12</volume> <fpage>323</fpage>&#x2013;<lpage>334</lpage>. <pub-id pub-id-type="doi">10.1038/nrc3261</pub-id> <pub-id pub-id-type="pmid">22513401</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mcshane</surname> <given-names>L. M.</given-names></name> <name><surname>Hayes</surname> <given-names>D. F.</given-names></name></person-group> (<year>2012</year>). <article-title>Publication of tumor marker research results: the necessity for complete and transparent reporting.</article-title> <source><italic>J. Clin. Oncol.</italic></source> <volume>30</volume> <fpage>4223</fpage>&#x2013;<lpage>4232</lpage>. <pub-id pub-id-type="doi">10.1200/JCO.2012.42.6858</pub-id> <pub-id pub-id-type="pmid">23071235</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mermel</surname> <given-names>C. H.</given-names></name> <name><surname>Schumacher</surname> <given-names>S. E.</given-names></name> <name><surname>Hill</surname> <given-names>B.</given-names></name> <name><surname>Meyerson</surname> <given-names>M. L.</given-names></name> <name><surname>Beroukhim</surname> <given-names>R.</given-names></name> <name><surname>Getz</surname> <given-names>G.</given-names></name></person-group> (<year>2011</year>). <article-title>GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers.</article-title> <source><italic>Genome Biol.</italic></source> <volume>12</volume>:<issue>R41</issue>. <pub-id pub-id-type="doi">10.1186/gb-2011-12-4-r41</pub-id> <pub-id pub-id-type="pmid">21527027</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mishra</surname> <given-names>N. K.</given-names></name> <name><surname>Southekal</surname> <given-names>S.</given-names></name> <name><surname>Guda</surname> <given-names>C.</given-names></name></person-group> (<year>2019</year>). <article-title>Survival analysis of multi-omics data identifies potential prognostic markers of pancreatic ductal adenocarcinoma.</article-title> <source><italic>Front. Genet.</italic></source> <volume>10</volume>:<issue>624</issue>. <pub-id pub-id-type="doi">10.3389/fgene.2019.00624</pub-id> <pub-id pub-id-type="pmid">31379917</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morikawa</surname> <given-names>A.</given-names></name> <name><surname>Hayashi</surname> <given-names>T.</given-names></name> <name><surname>Kobayashi</surname> <given-names>M.</given-names></name> <name><surname>Kato</surname> <given-names>Y.</given-names></name> <name><surname>Shirahige</surname> <given-names>K.</given-names></name> <name><surname>Itoh</surname> <given-names>T.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Somatic copy number alterations have prognostic impact in patients with ovarian clear cell carcinoma.</article-title> <source><italic>Oncol. Rep.</italic></source> <volume>40</volume> <fpage>309</fpage>&#x2013;<lpage>318</lpage>. <pub-id pub-id-type="doi">10.3892/or.2018.6419</pub-id> <pub-id pub-id-type="pmid">29749539</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nalejska</surname> <given-names>E.</given-names></name> <name><surname>Maczynska</surname> <given-names>E.</given-names></name> <name><surname>Lewandowska</surname> <given-names>M. A.</given-names></name></person-group> (<year>2014</year>). <article-title>Prognostic and predictive biomarkers: tools in personalized oncology.</article-title> <source><italic>Mol. Diagn. Ther.</italic></source> <volume>18</volume> <fpage>273</fpage>&#x2013;<lpage>284</lpage>. <pub-id pub-id-type="doi">10.1007/s40291-013-0077-9</pub-id> <pub-id pub-id-type="pmid">24385403</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rappoport</surname> <given-names>N.</given-names></name> <name><surname>Shamir</surname> <given-names>R.</given-names></name></person-group> (<year>2018</year>). <article-title>Multi-omic and multi-view clustering algorithms: review and cancer benchmark.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>46</volume> <fpage>10546</fpage>&#x2013;<lpage>10562</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gky889</pub-id> <pub-id pub-id-type="pmid">30295871</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rodriguez-Martin</surname> <given-names>B.</given-names></name> <name><surname>Alvarez</surname> <given-names>E. G.</given-names></name> <name><surname>Baez-Ortega</surname> <given-names>A.</given-names></name> <name><surname>Zamora</surname> <given-names>J.</given-names></name> <name><surname>Supek</surname> <given-names>F.</given-names></name> <name><surname>Demeulemeester</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Pan-cancer analysis of whole genomes identifies driver rearrangements promoted by LINE-1 retrotransposition.</article-title> <source><italic>Nat. Genet</italic>.</source> <volume>52</volume> <fpage>1</fpage>&#x2013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-019-0562-0</pub-id> <pub-id pub-id-type="pmid">32024998</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Siegel</surname> <given-names>R. L.</given-names></name> <name><surname>Miller</surname> <given-names>K. D.</given-names></name> <name><surname>Jemal</surname> <given-names>A.</given-names></name></person-group> (<year>2020</year>). <article-title>Cancer statistics, 2020.</article-title> <source><italic>CA Cancer J. Clin.</italic></source> <volume>70</volume> <fpage>7</fpage>&#x2013;<lpage>30</lpage>. <pub-id pub-id-type="doi">10.3322/caac.21590</pub-id> <pub-id pub-id-type="pmid">31912902</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sondka</surname> <given-names>Z.</given-names></name> <name><surname>Bamford</surname> <given-names>S.</given-names></name> <name><surname>Cole</surname> <given-names>C. G.</given-names></name> <name><surname>Ward</surname> <given-names>S. A.</given-names></name> <name><surname>Dunham</surname> <given-names>I.</given-names></name> <name><surname>Forbes</surname> <given-names>S. A.</given-names></name></person-group> (<year>2018</year>). <article-title>The COSMIC cancer gene census: describing genetic dysfunction across all human cancers.</article-title> <source><italic>Nat. Rev. Cancer</italic></source> <volume>18</volume> <fpage>696</fpage>&#x2013;<lpage>705</lpage>. <pub-id pub-id-type="doi">10.1038/s41568-018-0060-1</pub-id> <pub-id pub-id-type="pmid">30293088</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Swanton</surname> <given-names>C.</given-names></name></person-group> (<year>2012</year>). <article-title>Intratumor heterogeneity: evolution through space and time.</article-title> <source><italic>Cancer Res.</italic></source> <volume>72</volume> <fpage>4875</fpage>&#x2013;<lpage>4882</lpage>. <pub-id pub-id-type="doi">10.1158/0008-5472.CAN-12-2217</pub-id> <pub-id pub-id-type="pmid">23002210</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Szklarczyk</surname> <given-names>D.</given-names></name> <name><surname>Gable</surname> <given-names>A. L.</given-names></name> <name><surname>Lyon</surname> <given-names>D.</given-names></name> <name><surname>Junge</surname> <given-names>A.</given-names></name> <name><surname>Wyder</surname> <given-names>S.</given-names></name> <name><surname>Huerta-Cepas</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>47</volume> <fpage>D607</fpage>&#x2013;<lpage>D613</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gky1131</pub-id> <pub-id pub-id-type="pmid">30476243</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tate</surname> <given-names>J. G.</given-names></name> <name><surname>Bamford</surname> <given-names>S.</given-names></name> <name><surname>Jubb</surname> <given-names>H. C.</given-names></name> <name><surname>Sondka</surname> <given-names>Z.</given-names></name> <name><surname>Beare</surname> <given-names>D. M.</given-names></name> <name><surname>Bindal</surname> <given-names>N.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>COSMIC: the catalogue of somatic mutations in cancer.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>47</volume> <fpage>D941</fpage>&#x2013;<lpage>D947</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gky1015</pub-id> <pub-id pub-id-type="pmid">30371878</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tusher</surname> <given-names>V. G.</given-names></name> <name><surname>Tibshirani</surname> <given-names>R.</given-names></name> <name><surname>Chu</surname> <given-names>G.</given-names></name></person-group> (<year>2001</year>). <article-title>Significance analysis of microarrays applied to the ionizing radiation response.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>98</volume> <fpage>5116</fpage>&#x2013;<lpage>5121</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.091062498</pub-id> <pub-id pub-id-type="pmid">11309499</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tyner</surname> <given-names>C.</given-names></name> <name><surname>Barber</surname> <given-names>G. P.</given-names></name> <name><surname>Casper</surname> <given-names>J.</given-names></name> <name><surname>Clawson</surname> <given-names>H.</given-names></name> <name><surname>Diekhans</surname> <given-names>M.</given-names></name> <name><surname>Eisenhart</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>The UCSC genome browser database: 2017 update.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>45</volume> <fpage>D626</fpage>&#x2013;<lpage>D634</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw1134</pub-id> <pub-id pub-id-type="pmid">27899642</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vasaikar</surname> <given-names>S. V.</given-names></name> <name><surname>Straub</surname> <given-names>P.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>B.</given-names></name></person-group> (<year>2018</year>). <article-title>LinkedOmics: analyzing multi-omics data within and across 32 cancer types.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>46</volume> <fpage>D956</fpage>&#x2013;<lpage>D963</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkx1090</pub-id> <pub-id pub-id-type="pmid">29136207</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vickers</surname> <given-names>A. J.</given-names></name> <name><surname>Cronin</surname> <given-names>A. M.</given-names></name> <name><surname>Elkin</surname> <given-names>E. B.</given-names></name> <name><surname>Gonen</surname> <given-names>M.</given-names></name></person-group> (<year>2008</year>). <article-title>Extensions to decision curve analysis, a novel method for evaluating diagnostic tests, prediction models and molecular markers.</article-title> <source><italic>BMC Med. Inform. Decis. Mak.</italic></source> <volume>8</volume>:<issue>53</issue>. <pub-id pub-id-type="doi">10.1186/1472-6947-8-53</pub-id> <pub-id pub-id-type="pmid">19036144</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>A.</given-names></name> <name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Peng</surname> <given-names>H.</given-names></name> <name><surname>Han</surname> <given-names>G.</given-names></name> <name><surname>Cai</surname> <given-names>H.</given-names></name></person-group> (<year>2019</year>). <article-title>Simultaneous interrogation of cancer omics to identify subtypes with significant clinical differences.</article-title> <source><italic>Front. Genet.</italic></source> <volume>10</volume>:<issue>236</issue>. <pub-id pub-id-type="doi">10.3389/fgene.2019.00236</pub-id> <pub-id pub-id-type="pmid">30984238</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>J. H.</given-names></name> <name><surname>Li</surname> <given-names>J. H.</given-names></name> <name><surname>Shao</surname> <given-names>P.</given-names></name> <name><surname>Zhou</surname> <given-names>H.</given-names></name> <name><surname>Chen</surname> <given-names>Y. Q.</given-names></name> <name><surname>Qu</surname> <given-names>L. H.</given-names></name></person-group> (<year>2011</year>). <article-title>starBase: a database for exploring microRNA-mRNA interaction maps from argonaute CLIP-Seq and degradome-seq data.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>39</volume> <fpage>D202</fpage>&#x2013;<lpage>D209</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkq1056</pub-id> <pub-id pub-id-type="pmid">21037263</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yuan</surname> <given-names>Y.</given-names></name> <name><surname>Van Allen</surname> <given-names>E. M.</given-names></name> <name><surname>Omberg</surname> <given-names>L.</given-names></name> <name><surname>Wagle</surname> <given-names>N.</given-names></name> <name><surname>Amin-Mansour</surname> <given-names>A.</given-names></name> <name><surname>Sokolov</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Assessing the clinical utility of cancer genomic and proteomic data across tumor types.</article-title> <source><italic>Nat. Biotechnol.</italic></source> <volume>32</volume> <fpage>644</fpage>&#x2013;<lpage>652</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.2940</pub-id> <pub-id pub-id-type="pmid">24952901</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>F.</given-names></name> <name><surname>Ren</surname> <given-names>C.</given-names></name> <name><surname>Lau</surname> <given-names>K. K.</given-names></name> <name><surname>Zheng</surname> <given-names>Z.</given-names></name> <name><surname>Lu</surname> <given-names>G.</given-names></name> <name><surname>Yi</surname> <given-names>Z.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>A network medicine approach to build a comprehensive atlas for the prognosis of human cancer.</article-title> <source><italic>Brief Bioinform.</italic></source> <volume>17</volume> <fpage>1044</fpage>&#x2013;<lpage>1059</lpage>. <pub-id pub-id-type="pmid">27559151</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname> <given-names>S.</given-names></name> <name><surname>Geybels</surname> <given-names>M. S.</given-names></name> <name><surname>Leonardson</surname> <given-names>A.</given-names></name> <name><surname>Rubicz</surname> <given-names>R.</given-names></name> <name><surname>Kolb</surname> <given-names>S.</given-names></name> <name><surname>Yan</surname> <given-names>Q.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>Epigenome-wide tumor DNA methylation profiling identifies novel prognostic biomarkers of metastatic-lethal progression in men diagnosed with clinically localized prostate cancer.</article-title> <source><italic>Clin. Cancer Res.</italic></source> <volume>23</volume> <fpage>311</fpage>&#x2013;<lpage>319</lpage>. <pub-id pub-id-type="doi">10.1158/1078-0432.CCR-16-0549</pub-id> <pub-id pub-id-type="pmid">27358489</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhu</surname> <given-names>B.</given-names></name> <name><surname>Song</surname> <given-names>N.</given-names></name> <name><surname>Shen</surname> <given-names>R.</given-names></name> <name><surname>Arora</surname> <given-names>A.</given-names></name> <name><surname>Machiela</surname> <given-names>M. J.</given-names></name> <name><surname>Song</surname> <given-names>L.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>Integrating clinical and multiple omics data for prognostic assessment across human cancers.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>7</volume>:<issue>16954</issue>. <pub-id pub-id-type="doi">10.1038/s41598-017-17031-8</pub-id> <pub-id pub-id-type="pmid">29209073</pub-id></citation></ref>
</ref-list>
</back>
</article>