<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fgene.2020.00230</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Effect for Human Genomic Variation During the BMP4-Induced Conversion From Pluripotent Stem Cells to Trophoblast</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Li</surname> <given-names>Hai-tao</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x2020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/939131/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Liu</surname> <given-names>Yajun</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x2020;</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Liu</surname> <given-names>Hongde</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/864169/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Sun</surname> <given-names>Xiao</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c002"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/306751/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University</institution>, <addr-line>Nanjing</addr-line>, <country>China</country></aff>
<aff id="aff2"><sup>2</sup><institution>The Second Affiliated Hospital of Zhengzhou University</institution>, <addr-line>Zhengzhou</addr-line>, <country>China</country></aff>
<aff id="aff3"><sup>3</sup><institution>Academy of Medical Sciences of Zhengzhou University Translational Medicine Platform, Zhengzhou University</institution>, <addr-line>Zhengzhou</addr-line>, <country>China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Jianzhong Su, Wenzhou Medical University, China</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Wei Jiang, Nanjing University of Aeronautics and Astronautics, China; Ruiping Wang, The University of Texas MD Anderson Cancer Center, United States</p></fn>
<corresp id="c001">&#x002A;Correspondence: Hongde Liu, <email>liuhongde@seu.edu.cn</email></corresp>
<corresp id="c002">Xiao Sun, <email>xsun@seu.edu.cn</email></corresp>
<fn fn-type="other" id="fn002"><p><sup>&#x2020;</sup>These authors have contributed equally to this work and share first authorship</p></fn>
<fn fn-type="other" id="fn004"><p>This article was submitted to Bioinformatics and Computational Biology, a section of the journal Frontiers in Genetics</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>07</day>
<month>04</month>
<year>2020</year>
</pub-date>
<pub-date pub-type="collection">
<year>2020</year>
</pub-date>
<volume>11</volume>
<elocation-id>230</elocation-id>
<history>
<date date-type="received">
<day>07</day>
<month>12</month>
<year>2019</year>
</date>
<date date-type="accepted">
<day>26</day>
<month>02</month>
<year>2020</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2020 Li, Liu, Liu and Sun.</copyright-statement>
<copyright-year>2020</copyright-year>
<copyright-holder>Li, Liu, Liu and Sun</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>The role of genomic variation in differentiation is currently not well understood. Here, the genomic variations were determined with the whole-genome sequencing for three pairs of pluripotent stem cell lines and their corresponding BMP4-induced trophoblast cell lines. We identified &#x223C;3,500 single nucleotide variations and &#x223C;4,500 indels by comparing the genome sequenced data between the stem cell lines and the matched BMP4-induced trophoblast cell lines and annotated them by integrating the epigenomic and transcriptomic datasets. Relatively, introns enrich more variations. We found &#x223C;45% (42 genes) of the differentially expressed genes in trophoblasts that associate genomic variations. Six variations, located at transcription factor binding sites where H3K4me3 and H3K27ac are enriched in both H1 and H1_BMP4, were identified. The epigenetic status around the genomic variations in H1 was similar to that in H1_BMP4. This means that the variation-associated gene&#x2019;s expression change can not be attributed to epigenetic alteration. The genes associated with the six variations were upregulated in differentiation. We inferred that during the differentiation, an increased in the expression level of the MEF2C gene is due to a genomic variation in chromosomes 5: 88179358 A &#x003E; G, which is at a binding site of TFs KLF16, NR2C2, and ZNF740 to MEF2C. Allele G shows a higher affinity to the TFs in the induced cells. The increased expression of MEF2C leads to an increased expression of TF MEF2C&#x2019;s target genes, subsequently affecting the differentiation. Although genomic variation should not be a dominant factor in differentiation, we believe that genomic variation could indeed play a role in the differentiation from stem cells into trophoblast.</p>
</abstract>
<kwd-group>
<kwd>genomic variations</kwd>
<kwd>pluripotent stem cells</kwd>
<kwd>trophoblast</kwd>
<kwd>whole genome sequencing</kwd>
<kwd>epigenomic and transcriptomic data</kwd>
</kwd-group>
<counts>
<fig-count count="6"/>
<table-count count="4"/>
<equation-count count="0"/>
<ref-count count="63"/>
<page-count count="12"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1">
<title>Introduction</title>
<p>Stem cell differentiation involves a complex but poorly understood biological process. Genetic and epigenetic factors have in the past been intensively studied for this process. Some key transcription factors (TFs) affecting differentiation have been identified, such as POU5F1, SOX2, and KLF4 (<xref ref-type="bibr" rid="B36">Nichols et al., 1998</xref>; <xref ref-type="bibr" rid="B50">Takahashi and Yamanaka, 2006</xref>; <xref ref-type="bibr" rid="B12">Goolam et al., 2016</xref>). Previously, our lab identified TFs for DNase I hypersensitive sites based on public available chromatin accessibility data of human embryonic stem cells (ESCs) and BMP4-induced trophoblast cell lines (<xref ref-type="bibr" rid="B33">Liu et al., 2017</xref>). The chromatin structure, including nucleosome positions, DNA methylation, and histone modifications also changed during the differentiation (<xref ref-type="bibr" rid="B49">Su et al., 2012</xref>; <xref ref-type="bibr" rid="B55">Xie et al., 2013</xref>; <xref ref-type="bibr" rid="B7">de Boni et al., 2018</xref>). However, genomic variation, another important factor, was rarely considered in the differentiation-related study.</p>
<p>The cells forming the outer layer of a blastocyst in early development are referred to as trophoblasts. These cells significantly contribute to the placenta and membrane, and also provides nutrients to the embryo. This layer of trophoblasts is collectively named &#x201C;the trophoblast.&#x201D; The trophoblast is the first cell type differentiated from the zygote during the first stage of pregnancy. Human ESCs provide a very useful model for studying the early development of human embryos and trophoblasts. However, the genomic variation that occurs in the trophoblast differentiation is not well studied <italic>in vivo</italic>. Since Xu et al. proved that bone morphogenetic protein 4 (BMP4) could induce human ESCs, efficiently differentiating to trophoblast lineage, multiple research institutes have adopted this system as a model to study trophoblast lineage specification <italic>in vitro</italic> (<xref ref-type="bibr" rid="B57">Xu et al., 2002</xref>). Roberts&#x2019;s et al. considered this model reliability by analyzing gene expression profiling through RNA sequencing (RNA-Seq) technology (<xref ref-type="bibr" rid="B58">Yabe et al., 2016</xref>; <xref ref-type="bibr" rid="B33">Liu et al., 2017</xref>).</p>
<p>Recently, studies have demonstrated how genetic variation affects gene expression (<xref ref-type="bibr" rid="B3">Banovich et al., 2018</xref>; <xref ref-type="bibr" rid="B9">Delahaye et al., 2018</xref>). <xref ref-type="bibr" rid="B20">Kilpinen et al. (2013)</xref> suggest that genetic variation appears to be the primary driver of gene expression variation. Furthermore, <xref ref-type="bibr" rid="B8">DeBoever et al. (2017)</xref> show that the vast majority of genetic variations associated with gene expression levels are located in the regulatory regions of human induced pluripotent stem cells (iPSCs).</p>
<p>An integrative analysis using combinations of genomic, epigenomic, and transcriptomic data will provide a basis for biomarker discovery and help to provide insight of disease etiology. Genomic sequence variation, epigenetic factor, and gene expression are interdependent and jointly contribute to the normal functioning or dysfunction of tissue (<xref ref-type="bibr" rid="B9">Delahaye et al., 2018</xref>). For example, the sequencing variations can alter the TF binding strength to regulate gene expression directly (<xref ref-type="bibr" rid="B18">Karczewski et al., 2011</xref>; <xref ref-type="bibr" rid="B34">Madsen et al., 2018</xref>; <xref ref-type="bibr" rid="B17">Johnston et al., 2019</xref>). We are interested in the role of the genome variations in differentiation from human ESCs to trophoblasts.</p>
<p>In this study, we analyzed the genomic DNA sequences for three paired human ESCs and iPSC (H1, H9, and MRucR) (<xref ref-type="bibr" rid="B45">Sheridan et al., 2019</xref>). We demonstrated that some of the genomic variations can affect the differentiation by altering TFs&#x2019; binding affinity.</p>
</sec>
<sec id="S2" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec id="S2.SS1">
<title>Whole Genome Sequence of Cell Lines in This Study</title>
<p>Genomic DNA sequences were determined for three stem cells and their corresponding BMP4-induced cell lines (<xref ref-type="supplementary-material" rid="PS1">Supplementary Table S1</xref>), in which two types of stem cells were human ESC lines (H1 and H9), and third an iPSC cell line, MRucR. The cell lines that differentiated into the trophoblast were named H1_BMP4, H9_BMP4, and MRucR_BMP4, respectively. The three cell lines and the matched BMP4 induced cell lines was obtained from the Roberts&#x2019;s laboratory, University of Missouri. For more details of these three pairs of cell lines see <xref ref-type="bibr" rid="B45">Sheridan et al. (2019)</xref>. MRucR iPSC was established from umbilical cords of babies born to mothers who experienced an early-onset form of preeclampsia during their pregnancies. The BMP4-inducing experiment was performed as previously described (<xref ref-type="bibr" rid="B45">Sheridan et al., 2019</xref>). Briefly, the trophoblast stem cells were exposed to BMP4 in combination with signaling inhibitors of ACTIVIN-A (A83-01) and FGF2 (PD173074) (BAP treatment) (<xref ref-type="bibr" rid="B45">Sheridan et al., 2019</xref>).</p>
</sec>
<sec id="S2.SS2">
<title>Library Preparation and Sequencing</title>
<p>The quality of isolated genomic DNA was verified using these three methods: (1) DNA purity and concentration were identified by NanoPhotometer<sup>&#x00AE;</sup> spectrophotometer (IMPLEN, CA, United States) (OD260/OD280). The Optical Density (OD) value of the qualified sample ranged between 1.8 &#x2013; 2.0. (2) DNA degradation, and suspected RNA/Protein contamination were verified by electrophoresis on 1% agarose gels. (3) The concentration and purity of DNA samples were further quantified precisely by the Qubit DNA Assay Kit in Qubit<sup>&#x00AE;</sup>2.0 Flurometer (Life Technologies, CA, United States). A total amount of 1 &#x03BC;g DNA per sample was required for library generation.</p>
<p>Paired-end DNA libraries were prepared according to the manufacturer&#x2019;s instructions (Illumina Truseq Library Construction). First, 1.0 &#x03BC;g Genomic DNA was sheared into an average size of 350 base pair (bp) fragments by Covaris S220 sonicator. Second, the ends of the gDNA fragments were repaired; 3&#x2032; ends were adenylated. Both ends of the gDNA fragments were ligated at the 3&#x2032; ends with paired-end adaptors (Illumina) with a single &#x2018;T&#x2019; base overhang, and purified using AMPure SPRI beads from Agencourt. The size distribution and concentration of the libraries were then determined using Agilent 2100 Bioanalyzer and were qualified by real-time PCR (2 nM), respectively. Lastly, DNA libraries were sequenced on Illumina Hiseq X according to the manufacturer&#x2019;s instructions for paired-end 150 bp reads.</p>
</sec>
<sec id="S2.SS3">
<title>Whole Genome Sequencing Data Analysis</title>
<p>The raw image files obtained from the Hi-Seq platform were processed with the Illumina pipeline for base calling and stored as FASTQ format (Raw data). Quality control (QC) was as follows: (1) to filter reads with adapter contamination (&#x003E;10 bp aligned to the adapter allowing &#x2264; 10% mismatches). (2) to discard the reads containing more than 10% uncertain nucleotides. (3) to discard the paired reads when a single read has more than 50% low quality nucleotides (Phred quality &#x003C; 5).</p>
<p>After quality control, the sequenced reads were mapped to the GRCh37 assembly of the human genome by Burrows-Wheeler Aligner (BWA) software using default settings (<xref ref-type="bibr" rid="B28">Li and Durbin, 2009</xref>). Subsequently, we used Samtools (<xref ref-type="bibr" rid="B29">Li et al., 2009</xref>) and Picard<sup><xref ref-type="fn" rid="footnote1">1</xref></sup> with default settings to sort reads, remove duplicated reads, and to generate the final bam file. If one or more pair read(s) had multiple mapping positions, the best one was selected. If there were multiple best mapping positions, we randomly picked one.</p>
</sec>
<sec id="S2.SS4">
<title>Variation Calling and Functional Annotation</title>
<p>The analysis flowchart is shown in <xref ref-type="fig" rid="F1">Figure 1A</xref>. The genomic variations were determined with the whole-genome sequencing for three pairs of pluripotent stem cell lines and their corresponding BMP4-induced trophoblast cell lines and annotated them by integrating the epigenomic and transcriptomic datasets. We only focused on the direct effect on the differentiation of genomic variation which is located in those genomic regions where epigenetic markers remain comparable between H1 and H1_BMP4.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Genomic variations during the differentiation. <bold>(A)</bold> Flow chart of the analysis. <bold>(B)</bold> The variations occurred in different genomic regions. In this figure, down/upstream means the variant overlaps the 1-kb region downstream of transcription end site or upstream of TSS.</p></caption>
<graphic xlink:href="fgene-11-00230-g001.tif"/>
</fig>
<p>Genomic variations were identified by comparing genomic DNA sequences between stem cell lines and matched BMP4-induced trophoblast cell lines using the following steps. Single nucleotide variation (SNV) and small somatic insertions and deletions (indels) were identified using the Strelka2 (<xref ref-type="bibr" rid="B21">Kim et al., 2018</xref>) tool. All &#x2018;PASS&#x2019; calls identified by Strelka2 were retained for downstream analyses. BreakDancer (<xref ref-type="bibr" rid="B4">Chen et al., 2009</xref>) was applied to detect structural variation (SV). The SVs that received minimal confidence scores (90 for insertions, inversions, deletions, and translocations) were selected for downstream analyses.</p>
<p>The ANNOVAR tool was used to produce statistical analyses of the SNVs/indels (<xref ref-type="bibr" rid="B52">Wang et al., 2010</xref>). The variation position, variation type, conservative prediction, and other information were obtained at this step through a variety of databases, including DbSNP, 1000 Genome, and the reference sequence (<xref ref-type="bibr" rid="B46">Sherry et al., 2001</xref>; <xref ref-type="bibr" rid="B40">Pruitt et al., 2007</xref>; <xref ref-type="bibr" rid="B51">The 1000 Genomes Project Consortium, 2010</xref>). The variations were then assigned to genes by associating them with the nearest transcription start sites (TSSs) using the BEDOPS toolkit (<xref ref-type="bibr" rid="B35">Neph et al., 2012</xref>) and Gencode v21 human annotation (<xref ref-type="bibr" rid="B13">Harrow et al., 2012</xref>).</p>
<p>The regions include exon, intron, intergenic region, UTR5, UTR3, and the upstream (variant overlaps 1-kb region upstream of transcription start site)/downstream (variant overlaps 1-kb region downstream of transcription end site) regions.</p>
<p>The annotation detail of these three cell lines is listed in <xref ref-type="supplementary-material" rid="DS1">Supplementary Excel File S1</xref>. The version of the bioinformatics tools used is listed in <xref ref-type="supplementary-material" rid="DS1">Supplementary Table S2</xref>.</p>
</sec>
<sec id="S2.SS5">
<title>Public Datasets Analyzed in This Study</title>
<p>The goal of the work is to assess the effect of the genomic variations during stem cell differentiation. We therefore need to exclude the effect of the epigenomic variation (<xref ref-type="fig" rid="F1">Figure 1A</xref>). We thus combined DNA methylation, histone modifications, chromatin accessibility, and transcriptomic datasets for the H1 and H1_BMP4 (differentiated with BAP treatment) cell lines. All the ChIP-Seq and DNase-Seq data in H1 and H1_BMP4 were generated by the ENCODE Consortium<sup><xref ref-type="fn" rid="footnote2">2</xref></sup> and retrieved from the GEO database according to their accession number (<xref ref-type="supplementary-material" rid="PS1">Supplementary Table S3</xref>) (<xref ref-type="bibr" rid="B32">Lister et al., 2011</xref>; <xref ref-type="bibr" rid="B43">Roadmap Epigenomics Consortium et al., 2015</xref>; <xref ref-type="bibr" rid="B6">Davis et al., 2018</xref>). The sequencing data were mapped to the human reference genome GRCH37/hg19 by BWA. For ChIP-Seq data, we performed peak calling by using the MACS2 tool with default settings (<xref ref-type="bibr" rid="B60">Zhang et al., 2008</xref>). Differential peaks between the two cell lines were also identified with MACS2. We are interested in the genomic variations where there is no significant change between the two matched cell lines in the epigenomic data, especially for histone marks H3K27ac and H3K4me3 since the two represent the active state for enhancers and promoters.</p>
<p>The methylation state at CpG sites in whole genome bisulfite sequencing (WGBS) data were mapped to GRCh38/hg38 (<xref ref-type="supplementary-material" rid="PS1">Supplementary Table S3</xref>) (<xref ref-type="bibr" rid="B43">Roadmap Epigenomics Consortium et al., 2015</xref>). We used the liftOver tool to transform the genome coordinates from hg38 to hg19 (<xref ref-type="bibr" rid="B10">Dreszer et al., 2012</xref>).</p>
<p>The gene expression data (<xref ref-type="bibr" rid="B55">Xie et al., 2013</xref>) were retrieved from the ENCODE project (<xref ref-type="supplementary-material" rid="PS1">Supplementary Table S3</xref>) with the following identifiers: ENCFF245NXP, ENCFF809MAC, ENCFF051SBM, and ENCFF787HWI. RNA abundance was represented as the logarithm of the transcripts per million (TPM) provided by the RSEM program (<xref ref-type="bibr" rid="B27">Li and Dewey, 2011</xref>). The data was used in two aspects. One to identify the differentially expressed genes, so as to check which genes are influenced by the genomic variations. The other is to find the transcription factors that truly have a function in cells. This was done by checking the expression level of the gene that encodes the transcription factor (<xref ref-type="fig" rid="F1">Figure 1A</xref>). Differentially expressed mRNAs were identified with limma (<xref ref-type="bibr" rid="B42">Ritchie et al., 2015</xref>). Genes with Fold Change &#x003E; 2 or &#x003C;1/2 and false discovery rate (FDR) <italic>p</italic>-value &#x003C; 0.05 were identified as significantly differentially expressed genes between ESC H1 and differentiated to trophoblast H1_BMP4.</p>
</sec>
<sec id="S2.SS6">
<title>Prediction for Transcription Factor Binding Sites (TFBSs) Around the Genomic Variations</title>
<p>To estimate the change of the TF binding affinity due to the genomic variation, the 150-bp DNA sequences were extracted around the genomic variations and inputted into a bioinformatics tool, HOMER (<xref ref-type="bibr" rid="B14">Heinz et al., 2010</xref>), to calculate the affinity (<xref ref-type="fig" rid="F1">Figure 1A</xref>). Each of the sequences includes one kind of genomic variation. The affinity of a TF to a specific DNA sequence can be estimated by comparing the DNA sequence and the motif of the TF. The motif is the most favorable binding DNA pattern of the TF and can be represented with Position Weight Matrix (PWM). The comparison will be a score on the motif (motif score). A high score means a high affinity between the DNA sequence and the TF. We listed the TFs with a motif score &#x2265; 10 in the stem cell and its BMP4-induced cells and found the difference of the TFs between the cells.</p>
</sec>
</sec>
<sec id="S3">
<title>Results</title>
<sec id="S3.SS1">
<title>The Distribution of Genomic Variations During the Differentiation</title>
<p>We performed the high-throughput DNA sequencing for three pluripotent stem cell lines (H1, H9, and MRucR) and differentiated trophoblasts (H1_BMP4, H9_BMP4, and MRucR_BMP4). After quality control (<xref ref-type="supplementary-material" rid="PS1">Supplementary Figure S1</xref>), genomic variations were identified by comparing the DNA sequence between the stem cell lines and matched BMP4-induced trophoblast cell lines. Reads mapping rate was more than 98%, and the sequencing depth was beyond 31X (<xref ref-type="supplementary-material" rid="PS1">Supplementary Tables S4</xref>, <xref ref-type="supplementary-material" rid="PS1">S5</xref>). We used Strelka2 to identify SNV and indels. SV was identified with BreakDancer (<xref ref-type="bibr" rid="B4">Chen et al., 2009</xref>; <xref ref-type="bibr" rid="B21">Kim et al., 2018</xref>). The Flow chart of our analysis is shown in <xref ref-type="fig" rid="F1">Figure 1A</xref>. We identified approximately 3,500 SNVs and 4,500 indels in the ESC and iPSC lines, respectively (<xref ref-type="table" rid="T1">Table 1A</xref>). The count of SV was relatively rare (&#x223C;230) (<xref ref-type="table" rid="T1">Table 1A</xref>).</p>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>Number of genomic variants <bold>(A)</bold> and structural variants <bold>(B)</bold> in which stem cells differentiate into trophoblast cells.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<tbody>
<tr>
<td valign="top" align="left"><bold>(A)</bold></td>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td valign="top" align="left"><bold>Matched hESC-trophoblast cell lines</bold></td>
<td valign="top" align="center" colspan="2"><bold>Indels</bold></td>
<td valign="top" align="center" colspan="2"><bold>SNV</bold></td>
<td valign="top" align="center"><bold>SV</bold></td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td valign="top" align="left">H1</td>
<td valign="top" align="center" colspan="2">4071</td>
<td valign="top" align="center" colspan="2">3781</td>
<td valign="top" align="center">233</td>
</tr>
<tr>
<td valign="top" align="left">H9</td>
<td valign="top" align="center" colspan="2">4703</td>
<td valign="top" align="center" colspan="2">3736</td>
<td valign="top" align="center">235</td>
</tr>
<tr>
<td valign="top" align="left">MRucR</td>
<td valign="top" align="center" colspan="2">4729</td>
<td valign="top" align="center" colspan="2">3435</td>
<td valign="top" align="center">225</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td valign="top" align="left"><bold>(B)</bold></td>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td valign="top" align="left"><bold>Matched hESC-trophoblast cell lines</bold></td>
<td valign="top" align="center"><bold>INS</bold></td>
<td valign="top" align="center"><bold>INV</bold></td>
<td valign="top" align="center"><bold>ITX</bold></td>
<td valign="top" align="center"><bold>CTX</bold></td>
<td valign="top" align="center"><bold>DEL</bold></td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td valign="top" align="left">H1</td>
<td valign="top" align="center">31</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">92</td>
<td valign="top" align="center">108</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">H9</td>
<td valign="top" align="center">19</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">96</td>
<td valign="top" align="center">110</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">MRucR</td>
<td valign="top" align="center">44</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">84</td>
<td valign="top" align="center">91</td>
<td valign="top" align="center">0</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<attrib><italic>Indels, small somatic insertions and deletions; SNV, single nucleotide variant; SV, structural variation; INS, insertion; INV, inversion; ITX, intra-chromosomal translocation; CTX, inter-chromosomal translocation; DEL, deletions.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<p>The frequency of SNVs was counted. According to our statistics, the single-base substitution patterns were similar in the three pluripotent stem cells (<xref ref-type="supplementary-material" rid="PS1">Supplementary Figure S2a</xref>). The highest frequency of SNV was the conversion of cytosine to thymine (C &#x003E; T), and the lowest frequency of SNV was the conversion of guanine to cytosine (G &#x003E; C). Among the SVs (<xref ref-type="table" rid="T1">Table 1B</xref>), the proportion of translocation, including inter- and intra-, was high (about 77%&#x223C;87%). The percentage of insertion inversion and deletion was 12% &#x223C; 22%. We did not detect the structure deletion events here. SV had similar distributions in these three pairs of cell lines (<xref ref-type="fig" rid="F1">Figure 1B</xref>).</p>
<p>We then annotated SNV and indels with ANNOVAR (<xref ref-type="bibr" rid="B52">Wang et al., 2010</xref>). The SNV and indels were counted in different genomic regions (<xref ref-type="supplementary-material" rid="DS1">Supplementary Excel File S1</xref>). To calculate fold enrichments, ratios of the variation counts in a certain category of genomic regions were divided by the proportions of this category in the whole genome length. The variations were significantly enriched in the introns (<xref ref-type="fig" rid="F2">Figure 2</xref>).</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>Ratios of the number of variations occurred in certain categories of the genomic regions. Fold enrichments, ratios of the variation counts in a certain category of genomic regions were divided by the proportions of this category in the whole genome length. Down/upstream means the variant overlaps the 1-kb region downstream of the transcription end site or upstream of TSS.</p></caption>
<graphic xlink:href="fgene-11-00230-g002.tif"/>
</fig>
<p>Distribution profiles of SNV/indels in the 2500-bp genomic regions around the TSSs seemly occurred in a periodic pattern in the cell lines (<xref ref-type="supplementary-material" rid="PS1">Supplementary Figure S2b</xref>). SNP frequency spectra show striking periodicities across nucleosomal regions, and SNPs have a preference for nucleosomes (<xref ref-type="bibr" rid="B25">Langley et al., 2014</xref>). We therefore speculated that there is an association between the patterns and nucleosome distribution in this region.</p>
</sec>
<sec id="S3.SS2">
<title>Transcription Factors That Bind Around Genomic Variations Sites</title>
<p>A genomic variation may directly alter the affinity of TF&#x2019;s binding as this variation occurs exactly at TFBSs, thus influencing the transcription levels of the downstream target genes of the TFs. We assessed the effect of the variations on the alteration of TF binding with HOMER (<xref ref-type="bibr" rid="B14">Heinz et al., 2010</xref>). The 150-bp DNA sequences around the variation sites were inputted into HOMER to identify motifs of the TFs and compared the affinity of the TFs variation between the stem cell and matched BMP4-induced trophoblast cell lines. In <xref ref-type="fig" rid="F3">Figures 3A,B</xref>, although counts of TF motifs around SNV/indels positions decreased, the number was not zero. There were some motifs that were far away from the SNV/indels. We intend to find out what TFs bind at the SNV/indels-harbored sites.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Transcription factors identified around genomic variations sites. <bold>(A)</bold> The Venn diagram describing the overlap between the TFs in H1 and H1_BMP4. <bold>(B,C)</bold> The count of the TF motif around SNV/indels in H1 and H1_BMP4. <bold>(D,E)</bold> The bubble chart of the top 20 key TFs in H1 and H1_BMP4.</p></caption>
<graphic xlink:href="fgene-11-00230-g003.tif"/>
</fig>
<p>A TF with its motif in one genomic region does not mean the TF indeed binds at the region in a cell because the TF maybe is not expressed in the cell. We therefore need to check if the TF is abundant enough in the cell, namely, to check if the expression level of the gene encoding the TF is high enough. To do this, a necessary step is to exclude those TFs with a low abundance. The expression of TF-encoding genes was shown for H1 and H1_BMP4 (<xref ref-type="supplementary-material" rid="PS1">Supplementary Figure S3</xref>). Here, the cutoff of TPM &#x2265; 3 was used to select the TFs that indeed exist in the cell. We identified 154 and 193 TFs in H1 and H1_BMP4 cells, respectively, with an overlap of 135 TFs (<xref ref-type="fig" rid="F3">Figure 3C</xref>). The top 20 key TFs were chosen by setting TPM &#x2265; 3 and motif score &#x2265; 10 (<xref ref-type="fig" rid="F3">Figures 3D,E</xref> and <xref ref-type="supplementary-material" rid="PS1">Supplementary Tables S6</xref>, <xref ref-type="supplementary-material" rid="PS1">S7</xref>). In the top 20 TFs of H1 and H1_BMP4, some widely known to be involved in placental development were found, such as GATA3 (<xref ref-type="bibr" rid="B23">Kubaczka et al., 2015</xref>), POU5F1 (<xref ref-type="bibr" rid="B54">Wang et al., 2012</xref>), and KLF4 (<xref ref-type="bibr" rid="B1">Abad et al., 2013</xref>). A target gene of SREBP1A is the transcriptional repressor BHLHB2, which also promotes the differentiation of stem cells to trophoblast giant cells (<xref ref-type="bibr" rid="B26">Lecomte et al., 2010</xref>). ZEB2 has recently been identified to play critical roles in the regulation of the epithelial-mesenchymal transition and trophoblast differentiation (<xref ref-type="bibr" rid="B5">DaSilva-Arnold et al., 2019</xref>).</p>
<p>We identified the TFs that differ in affinity around the genomic variation sites between H1 and H1_BMP4. The TFs were chosen under conditions of TPM &#x2265; 3, motif score &#x2265; 10, and motif score count difference (abs) &#x2265; 10. We found three TFs (Zfp281, OCTs, and KLF3), whose affinities near the genomic variations differ between H1 and H1_BMP4 cell lines. The three kinds of TFs have a higher motif score count (count of motif score &#x2265; 10) in H1 than in H1_BMP4. Interestingly, these TFs are widely known to be involved in trophoblast differentiation. For example, Zfp281 (Kr&#x00FC;ppel-like zinc finger transcription factor), a zinc finger transcription factor, which shapes the transcriptome of trophoblast cells and regulates early placental development, has also been investigated in a recent article (<xref ref-type="bibr" rid="B15">Ishiuchi et al., 2019</xref>). Moreover, the other TFs were widely known to be important in placental development (<xref ref-type="supplementary-material" rid="PS1">Supplementary Table S8</xref>).</p>
<p>In short, we found that there were indeed some genomic variations at TFBSs.</p>
</sec>
<sec id="S3.SS3">
<title>Correlation Between Genome Variations and Epigenomics</title>
<p>Genome variation was reported to be correlated to epigenetic alteration. Li et al. demonstrated that about 2/3 of eQTLs were due to variations that altered chromatin accessibility or histone marking (<xref ref-type="bibr" rid="B31">Li et al., 2016</xref>). We studied the correlation between genetic variations and chromatin alteration in H1 and H1_BMP4 cell lines. The sequencing data of H3K4me3, H3K27ac, H3K27me3, and DNA methylation data were retrieved from ENCODE (<xref ref-type="supplementary-material" rid="PS1">Supplementary Table S5</xref>).</p>
<p>In this analysis, the genomic variations were divided into four categories according to whether the variation site was at the peaks of the histone modifications and DNA methylation in H1 and H1_BMP4 (<xref ref-type="bibr" rid="B32">Lister et al., 2011</xref>; <xref ref-type="bibr" rid="B43">Roadmap Epigenomics Consortium et al., 2015</xref>). We counted the peaks of the histone modifications and DNA methylation within the 4K-bp genomic region around SNV/indel sites (<xref ref-type="fig" rid="F4">Figure 4</xref> and <xref ref-type="supplementary-material" rid="PS1">Supplementary Figure S4</xref>).</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>The histone modifications surrounding SNV/indels. <italic>X</italic>-axis is the upstream and downstream 2 Kbp relative to SNV/indels sites. <bold>(A,B)</bold> The number of peak counts on the genomic variations which are located in both promoter activated <bold>(A)</bold> or inactivated <bold>(B)</bold> around H3K4me3 between H1 and H1_BMP4. <bold>(C,D)</bold> The number of peak counts on the genomic variations which are located in both enhancer activated <bold>(C)</bold> or inactivated <bold>(D)</bold> around H3K27ac between H1 and H1_BMP4. The number in parentheses indicates the number of SNV/indels. Complete profiles are shown in <xref ref-type="supplementary-material" rid="PS1">Supplementary Figure S4</xref>.</p></caption>
<graphic xlink:href="fgene-11-00230-g004.tif"/>
</fig>
<p>The enriching regions (peaks) of DNA methylation and H3K27me3 were considered as &#x201C;inhibited&#x201D; regions. Modified regions (peaks) by H3K4me3 and H3K27ac indicate &#x201C;promoter activated&#x201D; and &#x201C;enhancer inactivated&#x201D; regions, respectively. By comparing the epigenomic states of the regions around the genomic variations between the stem cells and matched BMP4-induced trophoblast cell lines, 66 variations that do not associate an epigenomic alteration were identified (<xref ref-type="table" rid="T2">Table 2</xref>, and <xref ref-type="supplementary-material" rid="DS2">Supplementary Excel File S2</xref>). Next, we evaluated the effects of the genomic variation around which the epigenetic modifications do not alter between H1 and H1_BMP4.</p>
<table-wrap position="float" id="T2">
<label>TABLE 2</label>
<caption><p>Number of the genomic variations that occur in regions associating different switches of the epigenomic modifications between H1 and H1_BMP4.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Regions</td>
<td valign="top" align="left">Modification types</td>
<td valign="top" align="center">State switch from H1 to H1_BMP4</td>
<td valign="top" align="center">The number of genomic variations</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" align="left">H3K27me3 and DNA methylation</td>
<td valign="top" align="center">In - &#x003E; T</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" align="left">H3K27me3 and DNA methylation</td>
<td valign="top" align="center">T - &#x003E; In</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Enhancer</td>
<td valign="top" align="left">H3K27ac</td>
<td valign="top" align="center">I - &#x003E; A</td>
<td valign="top" align="center">18</td>
</tr>
<tr>
<td valign="top" align="left">Enhancer</td>
<td valign="top" align="left">H3K27ac</td>
<td valign="top" align="center">A - &#x003E; I</td>
<td valign="top" align="center">12</td>
</tr>
<tr>
<td valign="top" align="left">Promoter</td>
<td valign="top" align="left">H3K4me3</td>
<td valign="top" align="center">I - &#x003E; A</td>
<td valign="top" align="center">30</td>
</tr>
<tr>
<td valign="top" align="left">Promoter</td>
<td valign="top" align="left">H3K4me3</td>
<td valign="top" align="center">A - &#x003E; I</td>
<td valign="top" align="center">30</td>
</tr>
<tr>
<td valign="top" align="left">Promoter</td>
<td valign="top" align="left">H3K4me3</td>
<td valign="top" align="center">A - &#x003E; A</td>
<td valign="top" align="center">46</td>
</tr>
<tr>
<td valign="top" align="left">Enhancer</td>
<td valign="top" align="left">H3K27ac</td>
<td valign="top" align="center">A - &#x003E; A</td>
<td valign="top" align="center">20</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<attrib><italic>In, inhibited; T, transcribed; I, inactivated; A, activated.</italic></attrib>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="S3.SS4">
<title>The Direct Effect of Genomic Variation on Differentiation</title>
<p>We are interested in the gene expression change that is only caused by the genomic variation instead of the epigenetic variation. We therefore only focused on the genomic variations which are located in those genomic regions where epigenetic markers remain comparable between H1 and H1_BMP4.</p>
<p>We identified 5,688 differentially expressed genes between H1 and H1_BMP4 (FDR p-value &#x2264; 0.05 and fold change &#x2265; 2 or &#x2264;1/2) (<xref ref-type="fig" rid="F5">Figure 5A</xref>). In the genes, there are 3,763 up-regulated and 1,925 down-regulated genes. Since each genomic variation was assigned to a gene, we could identify the genomic variations that associate differentially expressed genes.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p><bold>(A)</bold> Volcano plotting shows the differentially expressed genes between human ESC and BMP4-induced trophoblast cell lines. The black points are for the genes with a genomic variation, while the gray points are for the other genes. <bold>(B)</bold> The Venn plotting shows the genes both with a genomic variation and those exhibiting a differential expression from H1 to H1_BMP4. <bold>(C)</bold> The significant KEGG pathway for 26 genes. <bold>(D)</bold> The Venn plotting shows the genes both containing variations, and 94 differentially expressed target genes, which were regulated by downstream signaling proteins of BMP4.</p></caption>
<graphic xlink:href="fgene-11-00230-g005.tif"/>
</fig>
<p>One-hundred-and-twenty-nine differential expression genes were determined using stricter criteria (<italic>p</italic>-value &#x2264; 1E&#x2013;5 and fold change &#x2265; 4 or &#x2264;1/4). We found a total of 26 genes with variations in the differentiation process out of the 129 differential expression genes (<xref ref-type="fig" rid="F5">Figure 5B</xref>). The functional annotation tool of KEGG was applied to the 26 genes (<xref ref-type="fig" rid="F5">Figure 5C</xref>). As expected, two pathways implicated in regulating human ESC differentiation was found with an adjusted <italic>p</italic>-value &#x003C; 0.05, namely the &#x201C;signaling pathways regulating pluripotency of stem cells&#x201D; and the &#x201C;TGF-beta signaling pathway,&#x201D; which represent the differentiation pathways. For instance, inhibition of the TGF-beta signaling pathway could be sufficient for the derivation and long term expansion of human trophoblast cells and the placenta (<xref ref-type="bibr" rid="B56">Xu et al., 2017</xref>; <xref ref-type="bibr" rid="B22">Kn&#x00F6;fler et al., 2019</xref>).</p>
<p>In the induction by BMP4, BMP ligands are bound to the BMP4 complex, then transphosphorylate the intracellular signaling proteins, Smad1/5/8. The phospho-Smad1/5/8 interacts with Smad4, and the complex translocates into the nucleus where it interacts with transcription cofactors and regulates expression of target genes in a cell type-specific manner (<xref ref-type="bibr" rid="B47">Shimasaki et al., 2004</xref>). Based on these biological processes, 235 target genes of Smad 1/4/5/6 were retrieved from the TF targets in the Harmonizome database (<xref ref-type="bibr" rid="B44">Rouillard et al., 2016</xref>). Among them, 94 are differentially expressed between H1 and H1_BMP4 (FDR <italic>p</italic>-value &#x2264; 0.05 and fold change &#x2265; 2 or &#x2264;1/2). There are 72 up-regulated (30.64%) and 22 down-regulated genes (9.36%). In the 94 BMP4-induced genes, 42 (44.68%) contain the genomic variations (<xref ref-type="fig" rid="F5">Figure 5D</xref>), suggesting a tight association between the variation and BMP4 induced differentiation.</p>
<p>In particular, by excluding the variations associating the epigenetic modification alteration, six genomic variations, which associate differentially expressed genes, were identified (<xref ref-type="table" rid="T3">Table 3</xref>, and <xref ref-type="supplementary-material" rid="DS3">Supplementary Excel File S3</xref>). Five of the six variations were located in the promoter regions of genes MEF2C, TNFAIP8, TEX2, INHBA-AS1, and CAMK2N1. The remaining one was at the enhancer of gene COL1A2.</p>
<table-wrap position="float" id="T3">
<label>TABLE 3</label>
<caption><p>Information of six genomic variations.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Chr</td>
<td valign="top" align="center">Start</td>
<td valign="top" align="center">Ref</td>
<td valign="top" align="center">Alt</td>
<td valign="top" align="center">Location</td>
<td valign="top" align="center">Gene.refGene</td>
<td valign="top" align="center">Modification types</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">chr5</td>
<td valign="top" align="center">118605129</td>
<td valign="top" align="center">ATG</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center">Intronic</td>
<td valign="top" align="center">TNFAIP8</td>
<td valign="top" align="center">H3K4me3</td>
</tr>
<tr>
<td valign="top" align="left">chr5</td>
<td valign="top" align="center">88179358</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center">G</td>
<td valign="top" align="center">ncRNA_intronic</td>
<td valign="top" align="center">MEF2C</td>
<td valign="top" align="center">H3K4me3</td>
</tr>
<tr>
<td valign="top" align="left">chr17</td>
<td valign="top" align="center">62339443</td>
<td valign="top" align="center">G</td>
<td valign="top" align="center">T</td>
<td valign="top" align="center">Intronic</td>
<td valign="top" align="center">TEX2</td>
<td valign="top" align="center">H3K4me3</td>
</tr>
<tr>
<td valign="top" align="left">chr7</td>
<td valign="top" align="center">41740828</td>
<td valign="top" align="center">G</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center">ncRNA_intronic</td>
<td valign="top" align="center">INHBA-AS1</td>
<td valign="top" align="center">H3K4me3</td>
</tr>
<tr>
<td valign="top" align="left">chr1</td>
<td valign="top" align="center">20811135</td>
<td valign="top" align="center">T</td>
<td valign="top" align="center">C</td>
<td valign="top" align="center">Intronic</td>
<td valign="top" align="center">CAMK2N1</td>
<td valign="top" align="center">H3K4me3</td>
</tr>
<tr>
<td valign="top" align="left">chr7</td>
<td valign="top" align="center">94023911</td>
<td valign="top" align="center">A</td>
<td valign="top" align="center">G</td>
<td valign="top" align="center">UTR5</td>
<td valign="top" align="center">COL1A2</td>
<td valign="top" align="center">H3K27ac</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<attrib><italic>Column Location lists the genomic region of the variation. ncRNA_intronic means non-coding RNA genes. Gene.refGene is the gene associating the variation. Modification types indicate the H3K4me3 or H3K27ac around the variation in both H1 and H1_BMP4.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<p>We highlighted a core function of the six genes (MEF2C, TNFAIP8, TEX2, INHBA-AS1, CAMK2N1, and COL1A2) in differentiation and placenta development. TNFAIP8 plays a role in immune homeostasis, inflammatory responses, tumor genesis, and development. TNFAIP8 is also highly expressed in most normal human tissues especially for immunity-related tissues like the placenta (<xref ref-type="bibr" rid="B59">Zhang et al., 2018</xref>). INHBA encodes a member of the TGF-&#x03B2; (transforming growth factor-beta) superfamily of proteins which has been proven to promote the differentiation of human embryonic stem cells into trophoblasts (<xref ref-type="bibr" rid="B41">Puc&#x00E9;at, 2007</xref>). COL1A2 encodes one of the chains for type I collagen, the fibrillar collagen found in most connective tissues, and it is an early stage marker of osteoblast differentiation (<xref ref-type="bibr" rid="B38">Parisuthiman et al., 2005</xref>). MEF2C plays a pivotal role in myogenesis, neural crest, and craniofacial development, and may have an influence on maintaining the differentiated state of muscle cells (<xref ref-type="bibr" rid="B63">Zweier et al., 2010</xref>). In a recent study, dysregulation of MEF2 expression or signaling in early pregnancy may be associated with placenta-related pregnancy disorders, including preeclampsia (<xref ref-type="bibr" rid="B30">Li et al., 2017</xref>). In total, the six genes both associate a genomic variation in their regulatory regions and show differential expression from H1 to H1_BMP4, suggesting that the genomic variations associated with differentiation.</p>
<p>Since the six genomic variations are in non-coding regions, we assessed the effect of the variation in altering the binding affinity of TFs. We applied JASPAR2018<sup><xref ref-type="fn" rid="footnote3">3</xref></sup> to calculate the motif score of 12-bp DNA sequences around the six SNVs/indels to the TFs that bind at those variation-harbored regions (<xref ref-type="bibr" rid="B19">Khan et al., 2018</xref>). We listed the motif scores in <xref ref-type="supplementary-material" rid="DS3">Supplementary Excel File S3</xref>. We found that the allele G of a variation (chr5: 88179358 A &#x003E; G), which is at the enhancer of gene MEF2C, can increase the motif scores &#x2265; 10 in the induced cells (<xref ref-type="table" rid="T4">Table 4</xref>). it should be considered, however, that the MEF2C expression level is &#x223C;4 fold higher in H1_BMP4 than in H1. The results mean that the genomic variation accounts for the increase of MEF2C expression by increasing TFs&#x2019; affinity at MEF2C&#x2019;s promoter. Moreover, the H2K27ac does not show a significant change between the two cell lines. We confirmed that the MEF2C expression increase is caused by the genomic variation.</p>
<table-wrap position="float" id="T4">
<label>TABLE 4</label>
<caption><p>The TF motif scores of MEF2C.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">TF_Name</td>
<td valign="top" align="center">Start</td>
<td valign="top" align="center">End</td>
<td valign="top" align="center">Strand</td>
<td valign="top" align="center">H1 Expression(TPM)</td>
<td valign="top" align="center">H1_BMP4 Expression (TPM)</td>
<td valign="top" align="center">H1_BMP4_motif score</td>
<td valign="top" align="center">H1_motif score</td>
<td valign="top" align="center">motif score_diff</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">KLF16</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">21</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">15.52</td>
<td valign="top" align="center">14.81</td>
<td valign="top" align="center">10.90</td>
<td valign="top" align="center">3.90</td>
</tr>
<tr>
<td valign="top" align="left">ZNF740</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">16</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">5.4</td>
<td valign="top" align="center">7.29</td>
<td valign="top" align="center">12.63</td>
<td valign="top" align="center">7.09</td>
<td valign="top" align="center">5.54</td>
</tr>
<tr>
<td valign="top" align="left">ZNF740</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">15</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">5.4</td>
<td valign="top" align="center">7.29</td>
<td valign="top" align="center">10.66</td>
<td valign="top" align="center">5.53</td>
<td valign="top" align="center">5.13</td>
</tr>
<tr>
<td valign="top" align="left">NR2C2</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">24</td>
<td valign="top" align="center">+</td>
<td valign="top" align="center">9.27</td>
<td valign="top" align="center">12.99</td>
<td valign="top" align="center">10.09</td>
<td valign="top" align="center">3.87</td>
<td valign="top" align="center">6.22</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<attrib><italic>The TFs in this table were chosen as follows: (1) the motif scores of the TFs larger than 10 in H1 or H1_BMP4; (2) TPM value of the TF larger than 3 in both H1 and H1_BMP4; (3) the fold change of the expressed TF genes between H1 and H1_BMP4 less than 2.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<p>Importantly, the MEF2C gene itself encodes a TF protein. More recently, MEF2 was proven to regulate human trophoblast differentiation and invasion (<xref ref-type="bibr" rid="B30">Li et al., 2017</xref>). We retrieved the list of 954 MEF2C target genes reported in ENCODE TF targets (<xref ref-type="bibr" rid="B44">Rouillard et al., 2016</xref>). We then compared the TF MEF2C target genes&#x2019; expression between H1 and H1_BMP4. The result showed that they are significantly up-regulated in the transition from human ESC and the trophoblast (paired sample <italic>t</italic>-test, <italic>p</italic> &#x003C; 2.2E-16) (<xref ref-type="fig" rid="F6">Figure 6</xref>).</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p>A genomic variation (chromosomes 5: 88179358 A &#x003E; G) inserts its function in differentiation by affecting MEF2C&#x2019; expression, by altering the affinity of TFs binding. <bold>(A)</bold> The H3K4me3 track of H1 and H1_BMP4 around the variation site. <bold>(B)</bold> The gene expression of TF MEF2C was significantly increased (left panel). Violin plotting (right panel) shows that the gene expression of target genes of TF &#x2018;MEF2C&#x2019; was significantly up-regulated between H1 and H1_BMP4 (<italic>p</italic> &#x003C; 2.2E-16). <bold>(C)</bold> During differentiation, increased expression levels of the MEF2C gene is due to a genomic alteration, chromosomes 5: 88179358 A &#x003E; G, which is at a binding site of TFs KLF16, NR2C2, and ZNF740. Allele G shows a higher affinity to TFs in the induced cell. A high expression of MEF2C leads to an increased expression of TF MEF2C&#x2019;s target genes, subsequently affecting the differentiation.</p></caption>
<graphic xlink:href="fgene-11-00230-g006.tif"/>
</fig>
<p>Functional annotations by Enrichr (<xref ref-type="bibr" rid="B24">Kuleshov et al., 2016</xref>) on KEGG (Kyoto Encyclopedia of Genes and Genomes) and GO (Gene ontology) suggest the target genes of TF MEF2C were enriched in the &#x201C;NF-kappa B signaling pathway&#x201D; (<xref ref-type="bibr" rid="B39">Pollheimer and Kn&#x00F6;fler, 2005</xref>), &#x201C;Primary immunodeficiency,&#x201D; &#x201C;Systemic lupus erythematosus,&#x201D; &#x201C;positive regulation of NF-kappaB transcription factor activity,&#x201D; &#x201C;positive regulation of immune response,&#x201D; and so on (adjust <italic>p</italic>-value &#x2264; 0.05, <xref ref-type="supplementary-material" rid="PS1">Supplementary Figure S5</xref>). Literature suggests that the placenta is an important immune-related tissue (<xref ref-type="bibr" rid="B11">Erkers et al., 2017</xref>). We therefore considered that the target genes of TF MEF2C have a significant impact on the cell differentiation process.</p>
<p>Briefly, the gene expression of MEF2C was significantly increased due to the high-affinity binding sites of TFs like KLF16, NR2C2, and ZNF740, a variation site located in MEF2C (chromosomes 5: 88179358 A &#x003E; G). Further, the expression levels of these three TFs were not significantly different between H1 and H1_BMP4, suggesting that MEF2C&#x2019;s expression increase is not caused by a rise in the abundance of the TFs of MEF2C, but due to the genomic variation (<xref ref-type="fig" rid="F6">Figure 6</xref>). Interestingly, literature indicates that simultaneous depletion of KLF2, KLF4, and KLF5 leads to differentiation of the embryonic stem cell, and it has been postulated that other members of the KLF family, such as KLF16, may play similar roles in embryonic stem cells (<xref ref-type="bibr" rid="B16">Jiang et al., 2008</xref>; <xref ref-type="bibr" rid="B2">Andreoli et al., 2010</xref>). It has been established that NR2C2 (TR4) plays a critical role in embryonic development and differentiation.</p>
<p>NR2C2 is expressed in blastocysts and embryonic stem cells and can act as transcriptional activators in hESC (<xref ref-type="bibr" rid="B48">Shyr et al., 2009</xref>; <xref ref-type="bibr" rid="B37">O&#x2019;Geen et al., 2010</xref>). Altogether, this indicates that due to the up-regulation of MEF2C, its target genes were up-regulated. Importantly, the upregulation of MEF2C is caused by the genomic variation (chr5: 88179358 A &#x003E; G), which alters the affinity of MEF2C&#x2019;s TFs.</p>
</sec>
</sec>
<sec id="S4">
<title>Discussion</title>
<p>Understanding the genetic underpinnings of complex traits remains a major challenge in human genetics. In this study, we obtained paired genomic DNA sequences of human ESC and the trophoblast from three cell lines (H1, H9, and MRucR) through whole genome sequencing, and integrated the epigenomic (DNA methylation, histone modifications and chromatin accessibility) and transcriptomic datasets to investigate the impact of the genome variations in human ESC differentiation to the trophoblast. We found that the SNVs and indels generally tend to be located in the intron regions rather than in the other regions.</p>
<p>We focused on the gene expression variation caused by genomic variation rather than the epigenetic variation. Six genomic variations were identified. One of them, located in MEF2C (chromosomes 5: 88179358 A &#x003E; G), is a TF. This SNV increased the TF binding strength to regulate gene expression directly. Thereby, leading to an increase in the expression of downstream target genes affecting the differentiation of human ESC into the trophoblast. It suggested that the variations in the non-coding region played an important role in the differentiation process.</p>
<p>The inducer, BMP4, is the most significant factor to differentiate to the trophoblast. BMP4 is able to inhibit the Activin/Nodal signaling pathway and activate the BMP signaling pathway, which is required for human ESCs to differentiate into trophoblasts (<xref ref-type="bibr" rid="B57">Xu et al., 2002</xref>). We showed that &#x223C;45% (42 genes) of the differentially expressed BMP4-induced genes associated with genomic variations. Although genomic variations are not possible to be the only dominant factor in differentiation, some genomic variations indeed have an effect on differentiation. There are two limitations in this work. The first is that the conclusion is confined to only three pairs of cell lines. A similar analysis should be carried out in more cell lines of iPSC. The second limitation is that comprehensive biochemical experiments are still needed to validate the conclusion. CRISPR technology in cultured cells could be employed to prove the role of genomic variation in the differentiation process (<xref ref-type="bibr" rid="B62">Zhou et al., 2014</xref>, p. 2).</p>
</sec>
<sec id="S5">
<title>Data Availability Statement</title>
<p>The raw sequence data reported in this paper have been deposited in the Genome Sequence Archive (<xref ref-type="bibr" rid="B53">Wang et al., 2017</xref>) at the BIG Data Center (<xref ref-type="bibr" rid="B61">Zhang et al., 2019</xref>), Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, under accession number <ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="CRA002190">CRA002190</ext-link>, which is publicly accessible at <ext-link ext-link-type="uri" xlink:href="https://bigd.big.ac.cn/gsa">https://bigd.big.ac. cn/gsa</ext-link>.</p>
</sec>
<sec id="S6">
<title>Author Contributions</title>
<p>All authors contributed to the study conception and design, analyzed and discussed the results. XS conceived the project and completed the core program. HaL, HoL, and YL performed the computational analysis. HaL and HoL wrote the manuscript.</p>
</sec>
<sec id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="financial-disclosure">
<p><bold>Funding.</bold> This work was supported by grants from the National Natural Science Foundation of China (Nos. 81830053 and 61972084) and the Key Research &#x0026; Development Program of Jiangsu Province (BE2016002-3).</p>
</fn>
</fn-group>
<ack>
<p>We thank Professor Toshihiko Ezashi (University of Missouri) and Professor R. Michael Roberts (University of Missouri) for providing genomic materials obtained from three human pluripotent stem cells (H1, H9, and MRucR). We would also like to thank Professor Toshihiko Ezashi and Yue Hou for critical reading and helpful discussions for the manuscript.</p>
</ack>
<sec id="S9" sec-type="supplementary material"><title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fgene.2020.00230/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fgene.2020.00230/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.XLSX" id="DS1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Data_Sheet_2.XLSX" id="DS2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Data_Sheet_3.XLSX" id="DS3" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Presentation_1.pdf" id="PS1" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abad</surname> <given-names>M.</given-names></name> <name><surname>Mosteiro</surname> <given-names>L.</given-names></name> <name><surname>Pantoja</surname> <given-names>C.</given-names></name> <name><surname>Ca&#x00F1;amero</surname> <given-names>M.</given-names></name> <name><surname>Rayon</surname> <given-names>T.</given-names></name> <name><surname>Ors</surname> <given-names>I.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Reprogramming <italic>in vivo</italic> produces teratomas and iPS cells with totipotency features.</article-title> <source><italic>Nature</italic></source> <volume>502</volume> <fpage>340</fpage>&#x2013;<lpage>345</lpage>. <pub-id pub-id-type="doi">10.1038/nature12586</pub-id> <pub-id pub-id-type="pmid">24025773</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Andreoli</surname> <given-names>V.</given-names></name> <name><surname>Gehrau</surname> <given-names>R. C.</given-names></name> <name><surname>Bocco</surname> <given-names>J. L.</given-names></name></person-group> (<year>2010</year>). <article-title>Biology of Kr&#x00FC;ppel-like factor 6 transcriptional regulator in cell life and death.</article-title> <source><italic>IUBMB Life</italic></source> <volume>62</volume> <fpage>896</fpage>&#x2013;<lpage>905</lpage>. <pub-id pub-id-type="doi">10.1002/iub.396</pub-id> <pub-id pub-id-type="pmid">21154818</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Banovich</surname> <given-names>N. E.</given-names></name> <name><surname>Li</surname> <given-names>Y. I.</given-names></name> <name><surname>Raj</surname> <given-names>A.</given-names></name> <name><surname>Ward</surname> <given-names>M. C.</given-names></name> <name><surname>Greenside</surname> <given-names>P.</given-names></name> <name><surname>Calderon</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Impact of regulatory variation across human iPSCs and differentiated cells.</article-title> <source><italic>Genome Res.</italic></source> <volume>28</volume> <fpage>122</fpage>&#x2013;<lpage>131</lpage>. <pub-id pub-id-type="doi">10.1101/gr.224436.117</pub-id> <pub-id pub-id-type="pmid">29208628</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>K.</given-names></name> <name><surname>Wallis</surname> <given-names>J. W.</given-names></name> <name><surname>McLellan</surname> <given-names>M. D.</given-names></name> <name><surname>Larson</surname> <given-names>D. E.</given-names></name> <name><surname>Kalicki</surname> <given-names>J. M.</given-names></name> <name><surname>Pohl</surname> <given-names>C. S.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>BreakDancer: an algorithm for high-resolution mapping of genomic structural variation.</article-title> <source><italic>Nat. Methods</italic></source> <volume>6</volume> <fpage>677</fpage>&#x2013;<lpage>681</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.1363</pub-id> <pub-id pub-id-type="pmid">19668202</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>DaSilva-Arnold</surname> <given-names>S. C.</given-names></name> <name><surname>Kuo</surname> <given-names>C. Y.</given-names></name> <name><surname>Davra</surname> <given-names>V.</given-names></name> <name><surname>Remache</surname> <given-names>Y.</given-names></name> <name><surname>Kim</surname> <given-names>P. C. W.</given-names></name> <name><surname>Fisher</surname> <given-names>J. P.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>ZEB2, a master regulator of the epithelial-mesenchymal transition, mediates trophoblast differentiation.</article-title> <source><italic>Mol. Hum. Reprod.</italic></source> <volume>25</volume> <fpage>61</fpage>&#x2013;<lpage>75</lpage>. <pub-id pub-id-type="doi">10.1093/molehr/gay053</pub-id> <pub-id pub-id-type="pmid">30462321</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Davis</surname> <given-names>C. A.</given-names></name> <name><surname>Hitz</surname> <given-names>B. C.</given-names></name> <name><surname>Sloan</surname> <given-names>C. A.</given-names></name> <name><surname>Chan</surname> <given-names>E. T.</given-names></name> <name><surname>Davidson</surname> <given-names>J. M.</given-names></name> <name><surname>Gabdank</surname> <given-names>I.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>The Encyclopedia of DNA elements (ENCODE): data portal update.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>46</volume> <fpage>D794</fpage>&#x2013;<lpage>D801</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkx1081</pub-id> <pub-id pub-id-type="pmid">29126249</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Boni</surname> <given-names>L.</given-names></name> <name><surname>Gasparoni</surname> <given-names>G.</given-names></name> <name><surname>Haubenreich</surname> <given-names>C.</given-names></name> <name><surname>Tierling</surname> <given-names>S.</given-names></name> <name><surname>Schmitt</surname> <given-names>I.</given-names></name> <name><surname>Peitz</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>DNA methylation alterations in iPSC- and hESC-derived neurons: potential implications for neurological disease modeling.</article-title> <source><italic>Clin. Epigenet.</italic></source> <volume>10</volume>:<issue>13</issue>. <pub-id pub-id-type="doi">10.1186/s13148-018-0440-440</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>DeBoever</surname> <given-names>C.</given-names></name> <name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Jakubosky</surname> <given-names>D.</given-names></name> <name><surname>Benaglio</surname> <given-names>P.</given-names></name> <name><surname>Reyna</surname> <given-names>J.</given-names></name> <name><surname>Olson</surname> <given-names>K. M.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>Large-scale profiling reveals the influence of genetic variation on gene expression in human induced pluripotent stem cells.</article-title> <source><italic>Cell Stem Cell</italic></source> <volume>20</volume> <fpage>533.e7</fpage>&#x2013;<lpage>546.e7</lpage>. <pub-id pub-id-type="doi">10.1016/j.stem.2017.03.009</pub-id> <pub-id pub-id-type="pmid">28388430</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Delahaye</surname> <given-names>F.</given-names></name> <name><surname>Do</surname> <given-names>C.</given-names></name> <name><surname>Kong</surname> <given-names>Y.</given-names></name> <name><surname>Ashkar</surname> <given-names>R.</given-names></name> <name><surname>Salas</surname> <given-names>M.</given-names></name> <name><surname>Tycko</surname> <given-names>B.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Genetic variants influence on the placenta regulatory landscape.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>14</volume>:<issue>e1007785</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1007785</pub-id> <pub-id pub-id-type="pmid">30452450</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dreszer</surname> <given-names>T. R.</given-names></name> <name><surname>Karolchik</surname> <given-names>D.</given-names></name> <name><surname>Zweig</surname> <given-names>A. S.</given-names></name> <name><surname>Hinrichs</surname> <given-names>A. S.</given-names></name> <name><surname>Raney</surname> <given-names>B. J.</given-names></name> <name><surname>Kuhn</surname> <given-names>R. M.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>The UCSC Genome Browser database: extensions and updates 2011.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>40</volume> <fpage>D918</fpage>&#x2013;<lpage>D923</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkr1055</pub-id> <pub-id pub-id-type="pmid">22086951</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Erkers</surname> <given-names>T.</given-names></name> <name><surname>Stikvoort</surname> <given-names>A.</given-names></name> <name><surname>Uhlin</surname> <given-names>M.</given-names></name></person-group> (<year>2017</year>). <article-title>Lymphocytes in placental tissues: immune regulation and translational possibilities for immunotherapy.</article-title> <source><italic>Stem Cells Int.</italic></source> <volume>2017</volume>:<issue>5738371</issue>. <pub-id pub-id-type="doi">10.1155/2017/5738371</pub-id> <pub-id pub-id-type="pmid">29348758</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goolam</surname> <given-names>M.</given-names></name> <name><surname>Scialdone</surname> <given-names>A.</given-names></name> <name><surname>Graham</surname> <given-names>S. J. L.</given-names></name> <name><surname>Macaulay</surname> <given-names>I. C.</given-names></name> <name><surname>Jedrusik</surname> <given-names>A.</given-names></name> <name><surname>Hupalowska</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Heterogeneity in Oct4 and Sox2 targets biases cell fate in 4-cell mouse embryos.</article-title> <source><italic>Cell</italic></source> <volume>165</volume> <fpage>61</fpage>&#x2013;<lpage>74</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2016.01.047</pub-id> <pub-id pub-id-type="pmid">27015307</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Harrow</surname> <given-names>J.</given-names></name> <name><surname>Frankish</surname> <given-names>A.</given-names></name> <name><surname>Gonzalez</surname> <given-names>J. M.</given-names></name> <name><surname>Tapanari</surname> <given-names>E.</given-names></name> <name><surname>Diekhans</surname> <given-names>M.</given-names></name> <name><surname>Kokocinski</surname> <given-names>F.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>GENCODE: the reference human genome annotation for the ENCODE project.</article-title> <source><italic>Genome Res.</italic></source> <volume>22</volume> <fpage>1760</fpage>&#x2013;<lpage>1774</lpage>. <pub-id pub-id-type="doi">10.1101/gr.135350.111</pub-id> <pub-id pub-id-type="pmid">22955987</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Heinz</surname> <given-names>S.</given-names></name> <name><surname>Benner</surname> <given-names>C.</given-names></name> <name><surname>Spann</surname> <given-names>N.</given-names></name> <name><surname>Bertolino</surname> <given-names>E.</given-names></name> <name><surname>Lin</surname> <given-names>Y. C.</given-names></name> <name><surname>Laslo</surname> <given-names>P.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>Simple combinations of lineage-determining transcription factors Prime cis-regulatory elements required for macrophage and B cell identities.</article-title> <source><italic>Mol. Cell</italic></source> <volume>38</volume> <fpage>576</fpage>&#x2013;<lpage>589</lpage>. <pub-id pub-id-type="doi">10.1016/j.molcel.2010.05.004</pub-id> <pub-id pub-id-type="pmid">20513432</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ishiuchi</surname> <given-names>T.</given-names></name> <name><surname>Ohishi</surname> <given-names>H.</given-names></name> <name><surname>Sato</surname> <given-names>T.</given-names></name> <name><surname>Kamimura</surname> <given-names>S.</given-names></name> <name><surname>Yorino</surname> <given-names>M.</given-names></name> <name><surname>Abe</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Zfp281 shapes the transcriptome of trophoblast stem cells and is essential for placental development.</article-title> <source><italic>Cell Rep.</italic></source> <volume>27</volume> <fpage>1742</fpage>&#x2013;<lpage>1754</lpage>. <pub-id pub-id-type="doi">10.1016/j.celrep.2019.04.028</pub-id> <pub-id pub-id-type="pmid">31067460</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname> <given-names>J.</given-names></name> <name><surname>Chan</surname> <given-names>Y.-S.</given-names></name> <name><surname>Loh</surname> <given-names>Y.-H.</given-names></name> <name><surname>Cai</surname> <given-names>J.</given-names></name> <name><surname>Tong</surname> <given-names>G.-Q.</given-names></name> <name><surname>Lim</surname> <given-names>C.-A.</given-names></name><etal/></person-group> (<year>2008</year>). <article-title>A core Klf circuitry regulates self-renewal of embryonic stem cells.</article-title> <source><italic>Nat. Cell Biol.</italic></source> <volume>10</volume> <fpage>353</fpage>&#x2013;<lpage>360</lpage>. <pub-id pub-id-type="doi">10.1038/ncb1698</pub-id> <pub-id pub-id-type="pmid">18264089</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Johnston</surname> <given-names>A. D.</given-names></name> <name><surname>Sim&#x00F5;es-Pires</surname> <given-names>C. A.</given-names></name> <name><surname>Thompson</surname> <given-names>T. V.</given-names></name> <name><surname>Suzuki</surname> <given-names>M.</given-names></name> <name><surname>Greally</surname> <given-names>J. M.</given-names></name></person-group> (<year>2019</year>). <article-title>Functional genetic variants can mediate their regulatory effects through alteration of transcription factor binding.</article-title> <source><italic>Nat. Commun.</italic></source> <volume>10</volume> <fpage>1</fpage>&#x2013;<lpage>16</lpage>. <pub-id pub-id-type="doi">10.1038/s41467-019-11412-11415</pub-id> <pub-id pub-id-type="pmid">31375681</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Karczewski</surname> <given-names>K. J.</given-names></name> <name><surname>Tatonetti</surname> <given-names>N. P.</given-names></name> <name><surname>Landt</surname> <given-names>S. G.</given-names></name> <name><surname>Yang</surname> <given-names>X.</given-names></name> <name><surname>Slifer</surname> <given-names>T.</given-names></name> <name><surname>Altman</surname> <given-names>R. B.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Cooperative transcription factor associations discovered using regulatory variation.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>108</volume> <fpage>13353</fpage>&#x2013;<lpage>13358</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1103105108</pub-id> <pub-id pub-id-type="pmid">21828005</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Khan</surname> <given-names>A.</given-names></name> <name><surname>Fornes</surname> <given-names>O.</given-names></name> <name><surname>Stigliani</surname> <given-names>A.</given-names></name> <name><surname>Gheorghe</surname> <given-names>M.</given-names></name> <name><surname>Castro-Mondragon</surname> <given-names>J. A.</given-names></name> <name><surname>van der Lee</surname> <given-names>R.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>46</volume> <fpage>D260</fpage>&#x2013;<lpage>D266</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkx1126</pub-id> <pub-id pub-id-type="pmid">29140473</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kilpinen</surname> <given-names>H.</given-names></name> <name><surname>Waszak</surname> <given-names>S. M.</given-names></name> <name><surname>Gschwind</surname> <given-names>A. R.</given-names></name> <name><surname>Raghav</surname> <given-names>S. K.</given-names></name> <name><surname>Witwicki</surname> <given-names>R. M.</given-names></name> <name><surname>Orioli</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Coordinated effects of sequence variation on DNA binding. Chromatin structure, and transcription.</article-title> <source><italic>Science</italic></source> <volume>342</volume> <fpage>744</fpage>&#x2013;<lpage>747</lpage>. <pub-id pub-id-type="doi">10.1126/science.1242463</pub-id> <pub-id pub-id-type="pmid">24136355</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>S.</given-names></name> <name><surname>Scheffler</surname> <given-names>K.</given-names></name> <name><surname>Halpern</surname> <given-names>A. L.</given-names></name> <name><surname>Bekritsky</surname> <given-names>M. A.</given-names></name> <name><surname>Noh</surname> <given-names>E.</given-names></name> <name><surname>K&#x00E4;llberg</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Strelka2: fast and accurate calling of germline and somatic variants.</article-title> <source><italic>Nat. Methods</italic></source> <volume>15</volume> <fpage>591</fpage>&#x2013;<lpage>594</lpage>. <pub-id pub-id-type="doi">10.1038/s41592-018-0051-x</pub-id> <pub-id pub-id-type="pmid">30013048</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kn&#x00F6;fler</surname> <given-names>M.</given-names></name> <name><surname>Haider</surname> <given-names>S.</given-names></name> <name><surname>Saleh</surname> <given-names>L.</given-names></name> <name><surname>Pollheimer</surname> <given-names>J.</given-names></name> <name><surname>Gamage</surname> <given-names>T. K. J. B.</given-names></name> <name><surname>James</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>Human placenta and trophoblast development: key molecular mechanisms and model systems.</article-title> <source><italic>Cell. Mol. Life Sci.</italic></source> <volume>76</volume> <fpage>3479</fpage>&#x2013;<lpage>3496</lpage>. <pub-id pub-id-type="doi">10.1007/s00018-019-03104-3106</pub-id> <pub-id pub-id-type="pmid">31049600</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kubaczka</surname> <given-names>C.</given-names></name> <name><surname>Senner</surname> <given-names>C. E.</given-names></name> <name><surname>Cierlitza</surname> <given-names>M.</given-names></name> <name><surname>Ara&#x00FA;zo-Bravo</surname> <given-names>M. J.</given-names></name> <name><surname>Kuckenberg</surname> <given-names>P.</given-names></name> <name><surname>Peitz</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Direct induction of trophoblast stem cells from murine fibroblasts.</article-title> <source><italic>Cell Stem Cell</italic></source> <volume>17</volume> <fpage>557</fpage>&#x2013;<lpage>568</lpage>. <pub-id pub-id-type="doi">10.1016/j.stem.2015.08.005</pub-id> <pub-id pub-id-type="pmid">26412560</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuleshov</surname> <given-names>M. V.</given-names></name> <name><surname>Jones</surname> <given-names>M. R.</given-names></name> <name><surname>Rouillard</surname> <given-names>A. D.</given-names></name> <name><surname>Fernandez</surname> <given-names>N. F.</given-names></name> <name><surname>Duan</surname> <given-names>Q.</given-names></name> <name><surname>Wang</surname> <given-names>Z.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Enrichr: a comprehensive gene set enrichment analysis web server 2016 update.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>44</volume> <fpage>W90</fpage>&#x2013;<lpage>W97</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw377</pub-id> <pub-id pub-id-type="pmid">27141961</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Langley</surname> <given-names>S. A.</given-names></name> <name><surname>Karpen</surname> <given-names>G. H.</given-names></name> <name><surname>Langley</surname> <given-names>C. H.</given-names></name></person-group> (<year>2014</year>). <article-title>Nucleosomes shape DNA polymorphism and divergence.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>10</volume>:<issue>e1004457</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1004457</pub-id> <pub-id pub-id-type="pmid">24991813</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lecomte</surname> <given-names>V.</given-names></name> <name><surname>Meugnier</surname> <given-names>E.</given-names></name> <name><surname>Euthine</surname> <given-names>V.</given-names></name> <name><surname>Durand</surname> <given-names>C.</given-names></name> <name><surname>Freyssenet</surname> <given-names>D.</given-names></name> <name><surname>Nemoz</surname> <given-names>G.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>A new role for sterol regulatory element binding protein 1 transcription factors in the regulation of muscle mass and muscle cell differentiation.</article-title> <source><italic>Mol. Cell. Biol.</italic></source> <volume>30</volume> <fpage>1182</fpage>&#x2013;<lpage>1198</lpage>. <pub-id pub-id-type="doi">10.1128/MCB.00690-699</pub-id> <pub-id pub-id-type="pmid">20028734</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>B.</given-names></name> <name><surname>Dewey</surname> <given-names>C. N.</given-names></name></person-group> (<year>2011</year>). <article-title>RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.</article-title> <source><italic>BMC Bioinformatics</italic></source> <volume>12</volume>:<issue>323</issue>. <pub-id pub-id-type="doi">10.1186/1471-2105-12-323</pub-id> <pub-id pub-id-type="pmid">21816040</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Durbin</surname> <given-names>R.</given-names></name></person-group> (<year>2009</year>). <article-title>Fast and accurate short read alignment with Burrows&#x2013;Wheeler transform.</article-title> <source><italic>Bioinformatics</italic></source> <volume>25</volume> <fpage>1754</fpage>&#x2013;<lpage>1760</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp324</pub-id> <pub-id pub-id-type="pmid">19451168</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Handsaker</surname> <given-names>B.</given-names></name> <name><surname>Wysoker</surname> <given-names>A.</given-names></name> <name><surname>Fennell</surname> <given-names>T.</given-names></name> <name><surname>Ruan</surname> <given-names>J.</given-names></name> <name><surname>Homer</surname> <given-names>N.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>The Sequence Alignment/Map format and SAMtools.</article-title> <source><italic>Bioinformatics</italic></source> <volume>25</volume> <fpage>2078</fpage>&#x2013;<lpage>2079</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp352</pub-id> <pub-id pub-id-type="pmid">19505943</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>L.</given-names></name> <name><surname>Gong</surname> <given-names>X.</given-names></name> <name><surname>Rubin</surname> <given-names>L. P.</given-names></name></person-group> (<year>2017</year>). <article-title>Expression of MEF2 transcription factors in human placenta and involvement in trophoblast invasion and differentiation.</article-title> <source><italic>FASEB J.</italic></source> <volume>31</volume>:<issue>692.8</issue>. <pub-id pub-id-type="doi">10.1096/fasebj.31.1_supplement.692.8</pub-id> <pub-id pub-id-type="pmid">29127222</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Y. I.</given-names></name> <name><surname>van de Geijn</surname> <given-names>B.</given-names></name> <name><surname>Raj</surname> <given-names>A.</given-names></name> <name><surname>Knowles</surname> <given-names>D. A.</given-names></name> <name><surname>Petti</surname> <given-names>A. A.</given-names></name> <name><surname>Golan</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>RNA splicing is a primary link between genetic variation and disease.</article-title> <source><italic>Science</italic></source> <volume>352</volume> <fpage>600</fpage>&#x2013;<lpage>604</lpage>. <pub-id pub-id-type="doi">10.1126/science.aad9417</pub-id> <pub-id pub-id-type="pmid">27126046</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lister</surname> <given-names>R.</given-names></name> <name><surname>Pelizzola</surname> <given-names>M.</given-names></name> <name><surname>Kida</surname> <given-names>Y. S.</given-names></name> <name><surname>Hawkins</surname> <given-names>R. D.</given-names></name> <name><surname>Nery</surname> <given-names>J. R.</given-names></name> <name><surname>Hon</surname> <given-names>G.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Hotspots of aberrant epigenomic reprogramming in human induced pluripotent stem cells.</article-title> <source><italic>Nature</italic></source> <volume>471</volume> <fpage>68</fpage>&#x2013;<lpage>73</lpage>. <pub-id pub-id-type="doi">10.1038/nature09798</pub-id> <pub-id pub-id-type="pmid">21289626</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>Y.</given-names></name> <name><surname>Ding</surname> <given-names>D.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Sun</surname> <given-names>X.</given-names></name></person-group> (<year>2017</year>). <article-title>The accessible chromatin landscape during conversion of human embryonic stem cells to trophoblast by bone morphogenetic protein 4.</article-title> <source><italic>Biol. Reprod.</italic></source> <volume>96</volume> <fpage>1267</fpage>&#x2013;<lpage>1278</lpage>. <pub-id pub-id-type="doi">10.1093/biolre/iox028</pub-id> <pub-id pub-id-type="pmid">28430877</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Madsen</surname> <given-names>J. G. S.</given-names></name> <name><surname>Rauch</surname> <given-names>A.</given-names></name> <name><surname>Van Hauwaert</surname> <given-names>E. L.</given-names></name> <name><surname>Schmidt</surname> <given-names>S. F.</given-names></name> <name><surname>Winnefeld</surname> <given-names>M.</given-names></name> <name><surname>Mandrup</surname> <given-names>S.</given-names></name></person-group> (<year>2018</year>). <article-title>Integrated analysis of motif activity and gene expression changes of transcription factors.</article-title> <source><italic>Genome Res.</italic></source> <volume>28</volume> <fpage>243</fpage>&#x2013;<lpage>255</lpage>. <pub-id pub-id-type="doi">10.1101/gr.227231.117</pub-id> <pub-id pub-id-type="pmid">29233921</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Neph</surname> <given-names>S.</given-names></name> <name><surname>Kuehn</surname> <given-names>M. S.</given-names></name> <name><surname>Reynolds</surname> <given-names>A. P.</given-names></name> <name><surname>Haugen</surname> <given-names>E.</given-names></name> <name><surname>Thurman</surname> <given-names>R. E.</given-names></name> <name><surname>Johnson</surname> <given-names>A. K.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>BEDOPS: high-performance genomic feature operations.</article-title> <source><italic>Bioinformatics</italic></source> <volume>28</volume> <fpage>1919</fpage>&#x2013;<lpage>1920</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bts277</pub-id> <pub-id pub-id-type="pmid">22576172</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nichols</surname> <given-names>J.</given-names></name> <name><surname>Zevnik</surname> <given-names>B.</given-names></name> <name><surname>Anastassiadis</surname> <given-names>K.</given-names></name> <name><surname>Niwa</surname> <given-names>H.</given-names></name> <name><surname>Klewe-Nebenius</surname> <given-names>D.</given-names></name> <name><surname>Chambers</surname> <given-names>I.</given-names></name><etal/></person-group> (<year>1998</year>). <article-title>Formation of pluripotent stem cells in the mammalian embryo depends on the POU transcription factor Oct4.</article-title> <source><italic>Cell</italic></source> <volume>95</volume> <fpage>379</fpage>&#x2013;<lpage>391</lpage>. <pub-id pub-id-type="doi">10.1016/S0092-8674(00)81769-81769</pub-id> <pub-id pub-id-type="pmid">9814708</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>O&#x2019;Geen</surname> <given-names>H.</given-names></name> <name><surname>Lin</surname> <given-names>Y.-H.</given-names></name> <name><surname>Xu</surname> <given-names>X.</given-names></name> <name><surname>Echipare</surname> <given-names>L.</given-names></name> <name><surname>Komashko</surname> <given-names>V. M.</given-names></name> <name><surname>He</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>Genome-wide binding of the orphan nuclear receptor TR4 suggests its general role in fundamental biological processes.</article-title> <source><italic>BMC Genomics</italic></source> <volume>11</volume>:<issue>689</issue>. <pub-id pub-id-type="doi">10.1186/1471-2164-11-689</pub-id> <pub-id pub-id-type="pmid">21126370</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parisuthiman</surname> <given-names>D.</given-names></name> <name><surname>Mochida</surname> <given-names>Y.</given-names></name> <name><surname>Duarte</surname> <given-names>W. R.</given-names></name> <name><surname>Yamauchi</surname> <given-names>M.</given-names></name></person-group> (<year>2005</year>). <article-title>Biglycan modulates osteoblast differentiation and matrix mineralization.</article-title> <source><italic>J. Bone Miner. Res.</italic></source> <volume>20</volume> <fpage>1878</fpage>&#x2013;<lpage>1886</lpage>. <pub-id pub-id-type="doi">10.1359/JBMR.050612</pub-id> <pub-id pub-id-type="pmid">16160746</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pollheimer</surname> <given-names>J.</given-names></name> <name><surname>Kn&#x00F6;fler</surname> <given-names>M.</given-names></name></person-group> (<year>2005</year>). <article-title>Signalling pathways regulating the invasive differentiation of human trophoblasts: a review.</article-title> <source><italic>Placenta</italic></source> <volume>26</volume> <fpage>S21</fpage>&#x2013;<lpage>S30</lpage>. <pub-id pub-id-type="doi">10.1016/j.placenta.2004.11.013</pub-id> <pub-id pub-id-type="pmid">15837062</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pruitt</surname> <given-names>K. D.</given-names></name> <name><surname>Tatusova</surname> <given-names>T.</given-names></name> <name><surname>Maglott</surname> <given-names>D. R.</given-names></name></person-group> (<year>2007</year>). <article-title>NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>35</volume> <fpage>D61</fpage>&#x2013;<lpage>D65</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkl842</pub-id> <pub-id pub-id-type="pmid">17130148</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Puc&#x00E9;at</surname> <given-names>M.</given-names></name></person-group> (<year>2007</year>). <article-title>TGF&#x03B2; in the differentiation of embryonic stem cells.</article-title> <source><italic>Cardiovasc. Res.</italic></source> <volume>74</volume> <fpage>256</fpage>&#x2013;<lpage>261</lpage>. <pub-id pub-id-type="doi">10.1016/j.cardiores.2006.12.012</pub-id> <pub-id pub-id-type="pmid">17250814</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ritchie</surname> <given-names>M. E.</given-names></name> <name><surname>Phipson</surname> <given-names>B.</given-names></name> <name><surname>Wu</surname> <given-names>D.</given-names></name> <name><surname>Hu</surname> <given-names>Y.</given-names></name> <name><surname>Law</surname> <given-names>C. W.</given-names></name> <name><surname>Shi</surname> <given-names>W.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>limma powers differential expression analyses for RNA-sequencing and microarray studies.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>43</volume>:<issue>e47</issue>. <pub-id pub-id-type="doi">10.1093/nar/gkv007</pub-id> <pub-id pub-id-type="pmid">25605792</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roadmap Epigenomics Consortium Kundaje</surname> <given-names>A.</given-names></name> <name><surname>Meuleman</surname> <given-names>W.</given-names></name> <name><surname>Ernst</surname> <given-names>J.</given-names></name> <name><surname>Bilenky</surname> <given-names>M.</given-names></name> <name><surname>Yen</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Integrative analysis of 111 reference human epigenomes.</article-title> <source><italic>Nature</italic></source> <volume>518</volume> <fpage>317</fpage>&#x2013;<lpage>330</lpage>. <pub-id pub-id-type="doi">10.1038/nature14248</pub-id> <pub-id pub-id-type="pmid">25693563</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rouillard</surname> <given-names>A. D.</given-names></name> <name><surname>Gundersen</surname> <given-names>G. W.</given-names></name> <name><surname>Fernandez</surname> <given-names>N. F.</given-names></name> <name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>Monteiro</surname> <given-names>C. D.</given-names></name> <name><surname>McDermott</surname> <given-names>M. G.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>The harmonizome: a collection of processed datasets gathered to serve and mine knowledge about genes and proteins.</article-title> <source><italic>Database</italic></source> <volume>2016</volume>:<issue>baw100</issue>. <pub-id pub-id-type="doi">10.1093/database/baw100</pub-id> <pub-id pub-id-type="pmid">27374120</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sheridan</surname> <given-names>M. A.</given-names></name> <name><surname>Yang</surname> <given-names>Y.</given-names></name> <name><surname>Jain</surname> <given-names>A.</given-names></name> <name><surname>Lyons</surname> <given-names>A. S.</given-names></name> <name><surname>Yang</surname> <given-names>P.</given-names></name> <name><surname>Brahmasani</surname> <given-names>S. R.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Early onset preeclampsia in a model for human placental trophoblast.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>116</volume> <fpage>4336</fpage>&#x2013;<lpage>4345</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1816150116</pub-id> <pub-id pub-id-type="pmid">30787190</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sherry</surname> <given-names>S. T.</given-names></name> <name><surname>Ward</surname> <given-names>M. H.</given-names></name> <name><surname>Kholodov</surname> <given-names>M.</given-names></name> <name><surname>Baker</surname> <given-names>J.</given-names></name> <name><surname>Phan</surname> <given-names>L.</given-names></name> <name><surname>Smigielski</surname> <given-names>E. M.</given-names></name><etal/></person-group> (<year>2001</year>). <article-title>dbSNP: the NCBI database of genetic variation.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>29</volume> <fpage>308</fpage>&#x2013;<lpage>311</lpage>. <pub-id pub-id-type="doi">10.1093/nar/29.1.308</pub-id> <pub-id pub-id-type="pmid">11125122</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shimasaki</surname> <given-names>S.</given-names></name> <name><surname>Moore</surname> <given-names>R. K.</given-names></name> <name><surname>Otsuka</surname> <given-names>F.</given-names></name> <name><surname>Erickson</surname> <given-names>G. F.</given-names></name></person-group> (<year>2004</year>). <article-title>The bone morphogenetic protein system in mammalian reproduction.</article-title> <source><italic>Endocr. Rev.</italic></source> <volume>25</volume> <fpage>72</fpage>&#x2013;<lpage>101</lpage>. <pub-id pub-id-type="doi">10.1210/er.2003-0007</pub-id> <pub-id pub-id-type="pmid">14769828</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shyr</surname> <given-names>C.-R.</given-names></name> <name><surname>Kang</surname> <given-names>H.-Y.</given-names></name> <name><surname>Tsai</surname> <given-names>M.-Y.</given-names></name> <name><surname>Liu</surname> <given-names>N.-C.</given-names></name> <name><surname>Ku</surname> <given-names>P.-Y.</given-names></name> <name><surname>Huang</surname> <given-names>K.-E.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>Roles of testicular orphan nuclear receptors 2 and 4 in early embryonic development and embryonic stem cells.</article-title> <source><italic>Endocrinology</italic></source> <volume>150</volume> <fpage>2454</fpage>&#x2013;<lpage>2462</lpage>. <pub-id pub-id-type="doi">10.1210/en.2008-1165</pub-id> <pub-id pub-id-type="pmid">19131575</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Su</surname> <given-names>J.</given-names></name> <name><surname>Shao</surname> <given-names>X.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Liu</surname> <given-names>S.</given-names></name> <name><surname>Wu</surname> <given-names>Q.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name></person-group> (<year>2012</year>). <article-title>Genome-wide dynamic changes of DNA methylation of repetitive elements in human embryonic stem cells and fetal fibroblasts.</article-title> <source><italic>Genomics</italic></source> <volume>99</volume> <fpage>10</fpage>&#x2013;<lpage>17</lpage>. <pub-id pub-id-type="doi">10.1016/j.ygeno.2011.10.004</pub-id> <pub-id pub-id-type="pmid">22044633</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Takahashi</surname> <given-names>K.</given-names></name> <name><surname>Yamanaka</surname> <given-names>S.</given-names></name></person-group> (<year>2006</year>). <article-title>Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors.</article-title> <source><italic>Cell</italic></source> <volume>126</volume> <fpage>663</fpage>&#x2013;<lpage>676</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2006.07.024</pub-id> <pub-id pub-id-type="pmid">16904174</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><collab>The 1000 Genomes Project Consortium</collab> (<year>2010</year>). <article-title>A map of human genome variation from population scale sequencing.</article-title> <source><italic>Nature</italic></source> <volume>467</volume> <fpage>1061</fpage>&#x2013;<lpage>1073</lpage>. <pub-id pub-id-type="doi">10.1038/nature09534</pub-id> <pub-id pub-id-type="pmid">20981092</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>K.</given-names></name> <name><surname>Li</surname> <given-names>M.</given-names></name> <name><surname>Hakonarson</surname> <given-names>H.</given-names></name></person-group> (<year>2010</year>). <article-title>ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>38</volume>:<issue>e164</issue>. <pub-id pub-id-type="doi">10.1093/nar/gkq603</pub-id> <pub-id pub-id-type="pmid">20601685</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Song</surname> <given-names>F.</given-names></name> <name><surname>Zhu</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>S.</given-names></name> <name><surname>Yang</surname> <given-names>Y.</given-names></name> <name><surname>Chen</surname> <given-names>T.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>GSA: genome sequence archive<sup>&#x2217;</sup>.</article-title> <source><italic>Genomics Proteom. Bioinform.</italic></source> <volume>15</volume> <fpage>14</fpage>&#x2013;<lpage>18</lpage>. <pub-id pub-id-type="doi">10.1016/j.gpb.2017.01.001</pub-id> <pub-id pub-id-type="pmid">28387199</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>Oron</surname> <given-names>E.</given-names></name> <name><surname>Nelson</surname> <given-names>B.</given-names></name> <name><surname>Razis</surname> <given-names>S.</given-names></name> <name><surname>Ivanova</surname> <given-names>N.</given-names></name></person-group> (<year>2012</year>). <article-title>Distinct lineage specification roles for NANOG, OCT4, and SOX2 in human embryonic stem cells.</article-title> <source><italic>Cell Stem Cell</italic></source> <volume>10</volume> <fpage>440</fpage>&#x2013;<lpage>454</lpage>. <pub-id pub-id-type="doi">10.1016/j.stem.2012.02.016</pub-id> <pub-id pub-id-type="pmid">22482508</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xie</surname> <given-names>W.</given-names></name> <name><surname>Schultz</surname> <given-names>M. D.</given-names></name> <name><surname>Lister</surname> <given-names>R.</given-names></name> <name><surname>Hou</surname> <given-names>Z.</given-names></name> <name><surname>Rajagopal</surname> <given-names>N.</given-names></name> <name><surname>Ray</surname> <given-names>P.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Epigenomic analysis of multilineage differentiation of human embryonic stem cells.</article-title> <source><italic>Cell</italic></source> <volume>153</volume> <fpage>1134</fpage>&#x2013;<lpage>1148</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2013.04.022</pub-id> <pub-id pub-id-type="pmid">23664764</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>J. G.</given-names></name> <name><surname>Zhu</surname> <given-names>S. Y.</given-names></name> <name><surname>Heng</surname> <given-names>B. C.</given-names></name> <name><surname>Dissanayaka</surname> <given-names>W. L.</given-names></name> <name><surname>Zhang</surname> <given-names>C. F.</given-names></name></person-group> (<year>2017</year>). <article-title>TGF-&#x03B2;1-induced differentiation of SHED into functional smooth muscle cells.</article-title> <source><italic>Stem Cell Res. Ther.</italic></source> <volume>8</volume>:<issue>10</issue>. <pub-id pub-id-type="doi">10.1186/s13287-016-0459-450</pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>R.-H.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Li</surname> <given-names>D. S.</given-names></name> <name><surname>Li</surname> <given-names>R.</given-names></name> <name><surname>Addicks</surname> <given-names>G. C.</given-names></name> <name><surname>Glennon</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>2002</year>). <article-title>BMP4 initiates human embryonic stem cell differentiation to trophoblast.</article-title> <source><italic>Nat. Biotechnol.</italic></source> <volume>20</volume>:<issue>1261</issue>. <pub-id pub-id-type="doi">10.1038/nbt761</pub-id> <pub-id pub-id-type="pmid">12426580</pub-id></citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yabe</surname> <given-names>S.</given-names></name> <name><surname>Alexenko</surname> <given-names>A. P.</given-names></name> <name><surname>Amita</surname> <given-names>M.</given-names></name> <name><surname>Yang</surname> <given-names>Y.</given-names></name> <name><surname>Schust</surname> <given-names>D. J.</given-names></name> <name><surname>Sadovsky</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Comparison of syncytiotrophoblast generated from human embryonic stem cells and from term placentas.</article-title> <source><italic>PNAS</italic></source> <volume>113</volume> <fpage>E2598</fpage>&#x2013;<lpage>E2607</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1601630113</pub-id> <pub-id pub-id-type="pmid">27051068</pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Liu</surname> <given-names>R.</given-names></name> <name><surname>Luan</surname> <given-names>Y.</given-names></name> <name><surname>Yao</surname> <given-names>Y.</given-names></name></person-group> (<year>2018</year>). <article-title>Tumor necrosis factor-&#x03B1; induced protein 8: pathophysiology, clinical significance, and regulatory mechanism.</article-title> <source><italic>Int. J. Biol. Sci.</italic></source> <volume>14</volume> <fpage>398</fpage>&#x2013;<lpage>405</lpage>. <pub-id pub-id-type="doi">10.7150/ijbs.23268</pub-id> <pub-id pub-id-type="pmid">29725261</pub-id></citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Liu</surname> <given-names>T.</given-names></name> <name><surname>Meyer</surname> <given-names>C. A.</given-names></name> <name><surname>Eeckhoute</surname> <given-names>J.</given-names></name> <name><surname>Johnson</surname> <given-names>D. S.</given-names></name> <name><surname>Bernstein</surname> <given-names>B. E.</given-names></name><etal/></person-group> (<year>2008</year>). <article-title>Model-based analysis of ChIP-Seq (MACS).</article-title> <source><italic>Genome Biol.</italic></source> <volume>9</volume>:<issue>R137</issue>. <pub-id pub-id-type="doi">10.1186/gb-2008-9-9-r137</pub-id> <pub-id pub-id-type="pmid">18798982</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Z.</given-names></name> <name><surname>Zhao</surname> <given-names>W.</given-names></name> <name><surname>Xiao</surname> <given-names>J.</given-names></name> <name><surname>Bao</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>F.</given-names></name> <name><surname>Hao</surname> <given-names>L.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Database resources of the BIG data center in 2019.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>47</volume> <fpage>D8</fpage>&#x2013;<lpage>D14</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gky993</pub-id> <pub-id pub-id-type="pmid">30365034</pub-id></citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>H. Y.</given-names></name> <name><surname>Katsman</surname> <given-names>Y.</given-names></name> <name><surname>Dhaliwal</surname> <given-names>N. K.</given-names></name> <name><surname>Davidson</surname> <given-names>S.</given-names></name> <name><surname>Macpherson</surname> <given-names>N. N.</given-names></name> <name><surname>Sakthidevi</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>A Sox2 distal enhancer cluster regulates embryonic stem cell differentiation potential.</article-title> <source><italic>Genes Dev.</italic></source> <volume>28</volume> <fpage>2699</fpage>&#x2013;<lpage>2711</lpage>. <pub-id pub-id-type="doi">10.1101/gad.248526.114</pub-id> <pub-id pub-id-type="pmid">25512558</pub-id></citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zweier</surname> <given-names>M.</given-names></name> <name><surname>Gregor</surname> <given-names>A.</given-names></name> <name><surname>Zweier</surname> <given-names>C.</given-names></name> <name><surname>Engels</surname> <given-names>H.</given-names></name> <name><surname>Sticht</surname> <given-names>H.</given-names></name> <name><surname>Wohlleber</surname> <given-names>E.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>Mutations in MEF2C from the 5q14.3<italic>q</italic>15 microdeletion syndrome region are a frequent cause of severe mental retardation and diminish MECP2 and CDKL5 expression.</article-title> <source><italic>Hum. Mutat.</italic></source> <volume>31</volume> <fpage>722</fpage>&#x2013;<lpage>733</lpage>. <pub-id pub-id-type="doi">10.1002/humu.21253</pub-id> <pub-id pub-id-type="pmid">20513142</pub-id></citation></ref>
</ref-list><fn-group>
<fn id="footnote1">
<label>1</label>
<p><ext-link ext-link-type="uri" xlink:href="http://broadinstitute.github.io/picard">http://broadinstitute.github.io/picard</ext-link></p></fn>
<fn id="footnote2">
<label>2</label>
<p><ext-link ext-link-type="uri" xlink:href="https://www.encodeproject.org/">https://www.encodeproject.org/</ext-link></p></fn>
<fn id="footnote3">
<label>3</label>
<p><ext-link ext-link-type="uri" xlink:href="http://jaspar.genereg.net/">http://jaspar.genereg.net/</ext-link></p></fn>
</fn-group>
</back>
</article>
