Data Accessibility and Benefit-Sharing
Raw sequence reads (SRA) have been deposited at NCBI under Bioproject
PRJNA952864.
Figure legends
Figure 1. A. Metagene profile of analyzed individuals from 2kb upstream
of the Transcription Start Site (TSS) to 2 kb downstream of the
Transcription End Sites (TES) B. Mean methylation profiles across
genomic context centered on the D. labrax genome in freshwater- (FW) and
seawater-acclimated fish (SW). All genomic regions were scaled to the
same length.
Figure 2. Cluster analysis based on CpG methylation profiles in 5
freshwater-exposed (FW, black discs) and 5 seawater-exposed D. labrax
(SW, white discs). A. Hierarchical clustering of samples was performed
using Ward’s method based on Pearson’s correlation distance for cytosine
CpG methylation. B. Principal Component Analysis (PCA) showing (PC1xPC2)
coordination plane.
Figure 3. Cluster analysis based on gene expression levels in 5
freshwater-exposed (FW, black discs) and 5
seawater-exposed D. labrax (SW, white discs). A. Hierarchical clustering
of samples was performed using Ward’s method based on Pearson’s
correlation distance for gene expression levels. B. Principal Component
Analysis (PCA) showing (PC1xPC2) coordination plane.
Figure 4. DNA methylation levels (in %) in different genomic context
for different gene expression levels. Violin plots showing DNA
methylation in promoter, first exon, first intron and gene body at
different gene expression levels (divided into deciles based on
increasing ranking of gene expression measured as log2‑transformed
normalized counts from DEseq2). Central red dots represent the median of
the distribution. Correlations between DNA methylation and gene
expression were measured using Spearman’s rank correlation coefficient.
Figure 5. Genome-wide profile of CpG methylation. A. Chromosomal
distribution of hyper- and hypo- methylated regions in freshwater-
compared to seawater exposed D. labrax. p-value < 0.05 and
methylation difference ≥ 25 %. B. Distribution of the differentially
methylated regions (DMRs) in different genomic context. C. and D.
Percentage of DMRs per exon and intron position normalized by the total
size (in bp) of exons and introns respectively.
Figure 6. Enrichment analysis of genetic ontology (GO) terms. The 5 most
significant GO terms for each group are represented. A. DNA methylome.
B. RNA transcriptome. FW : fresh water; GB : gene body;
PR : promoter; SW : seawater. The size of the dots represent the ratio
of genes associated with the GO term and the colors indicate the
p-values (threshold set at 0.05). Discussed GO terms are indicated by
stars.
Figure 7. Venn analysis of methylome and transcriptome showing
significantly overrepresented (p < 0.05) GO terms related to
DEGs and DMRs (in genes and promoters) in FW vs SW conditions. A.
Biological process. B. Molecular function. C. Cellular component. DMRs:
differentially methylated regions ; DEGs: differentially expressed
genes.
Figure 8. Comparison between freshwater (FW)-triggered gene expression
changes and methylation changes in different genomic contexts. A:
Proportion of down- and upregulated genes that are hypomethylated. B:
Proportion of down- and upregulated genes that are hypermethylated. Blue
color = downregulation in FW and red color = upregulation in FW.
Figure 9. Enrichment plot of the KEGG pathways enrichment analysis of
differentially expressed and differentially methylated genes in
promoters, first exons or first introns. The 5 KEGG pathways of each
dataset displaying the most significant p-values have been represented.
The proportion of clusters in the pie chart was determined by the
proportion of genes in a specific category. The size of each circle
represents the number of genes involved in each pathways (p-value
< 0.05).
Table 1. List of selected differentially expressed and methylated genes
Figure 1S. Heatmap of significant differentially expressed genes across
all samples with adjusted p-value < 0.05. Each row represents
a differentially expressed gene between salinities arranged by ascending
order of fold change from top to bottom. The red and purple colors
indicate the log10 (baseMean+1) values. On the right, the colors
indicate log2FC for downregulated genes (upper part of the figure, in
light green) or upregulated genes (bottom of the figure, in red).
Samples were visually clustered using hierarchical clustering. Dark
green (top of the figure) = 5 SW individuals and purple = 5 FW
individuals.