Correlation between mRNA expression and DNA methylation
Lymphoid Neoplasm Diffuse Large B-cell Lymphoma (Primary solid tumor)
15 July 2014  |  analyses__2014_07_15
Maintainer Information
Citation Information
Maintained by Richard Park (Boston University/Harvard Medical School)
Cite as Broad Institute TCGA Genome Data Analysis Center (2014): Correlation between mRNA expression and DNA methylation. Broad Institute of MIT and Harvard. doi:10.7908/C1668BX8
Overview
Introduction

The role of general epigenetic mechanisms in carcinogenesis and tumor aggressiveness is well documented: CpG island hyper-methylation silences tumor suppressor genes, whereas hypo-methylation promotes the transcriptional activation of oncogenes and induces chromosomal instability. This pipeline calculates and identifies correlations between DNA methylation and gene expression profiles using the available array technologies.

Summary

The top 25 correlated methylation probes per gene are displayed. Total number of matched samples = 28. Number of gene expression samples = 28. Number of methylation samples = 48.

Results
Correlation Histogram

Figure 1.  Histogram of methylation correlation values. n is the number of matched samples between Level 3 CpG site methylation and Level 3 gene expression arrays. Number of Matched Samples = 28

Qvalue Summary Plots

Figure 2.  Plot 1. The estimated pi_0 versus the tuning parameter lambda. Plot 2. The q-values versus the p-values. Plot 3. The number of significant tests versus each q-value cutoff. Plot 4. The number of expected false positives versus the number of significant tests.The first is a plot of the estimate of pi_0 versus its tuning parameter lambda. In most cases, as lambda gets larger, the bias of the estimate decreases, yet the variance increases. Comparing your estimate of pi_0 to this plot allows one to guage its quality. The remaining three plots show how many tests are significant, as well as how many false positives to expect for each q-value cut-off.

Negative Correlation between Methylation and Gene Expression

Table 1.  Get Full Table Top 25 most negatively correlated methylation probes. Correlation Coefficient: See Methods & Data below. Pval and Qval: P- and Q-values of the correlation coefficient. Expression Mean: mean detection level of gene expression probes. Methylation Mean: mean detection level of CpG methylation probes.

Meth_Probe Gene Chrom Position Corr_Coeff Pval Qval Expr_Mean Meth_Mean
cg01312482 ZNF879 5 178451176 -0.94 2.8e-07 0.000098 4.5 0.59
cg15708153 ZNF542 19 56879554 -0.93 3.6e-07 0.000098 6.6 0.38
cg18633684 EID3 12 104697387 -0.92 4.3e-07 0.000098 5.3 0.45
cg24886770 ZNF43 19 22018538 -0.92 4.7e-07 0.000098 7.1 0.15
cg01467419 ZNF57 19 2900365 -0.9 6.5e-07 0.000098 5.2 0.29
cg08221207 FEZ1 11 125366421 -0.9 6.9e-07 0.000098 8.7 0.28
cg14556070 ZNF256 19 58458917 -0.9 7.1e-07 0.000098 5.8 0.25
cg22512531 CRTAM 11 122709551 -0.9 7.2e-07 0.000098 6.9 0.76
cg24040502 NLRP2 19 55477810 -0.89 8.2e-07 0.000098 7.1 0.73
cg07058027 CYB5R2 11 7694188 -0.89 8.9e-07 0.000098 8.1 0.65
cg07393419 TRIM50 7 72741883 -0.88 0 0 -0.11 0.98
cg00007269 ZNF77 19 2945106 -0.88 9e-07 0.000098 5 0.26
cg07225598 HIST3H2A 1 228646916 -0.88 9.1e-07 0.000098 3.7 0.44
cg11763394 LOC654433 2 113992921 -0.88 9.2e-07 0.000098 7.1 0.71
cg24934561 KIAA0495 1 3663902 -0.88 9.8e-07 0.000098 8 0.3
cg21550016 LOC654433 2 113992930 -0.87 1e-06 0.000098 7.1 0.7
cg16452866 BCL11B 14 99655676 -0.87 1e-06 0.000098 7.6 0.78
cg16468187 C16orf5 16 4588687 -0.87 1.1e-06 0.000098 9 0.41
cg19220282 SLC1A4 2 65217528 -0.87 1.1e-06 0.000098 11 0.21
cg17271561 ZNF610 19 52839334 -0.87 1.1e-06 0.000098 4 0.27
cg06978388 ZNF512B 20 62600831 -0.87 1.1e-06 0.000098 8.3 0.21
cg15167155 ZNF347 19 53662499 -0.86 1.1e-06 0.000098 5.2 0.18
cg25619607 IMPACT 18 22006735 -0.86 1.1e-06 0.000098 7.6 0.33
cg11280525 LOC645323 5 87955695 -0.86 1.9e-06 0.000098 2.2 0.2
cg11497957 ZNF572 8 125985846 -0.86 1.2e-06 0.000098 5.4 0.35
cg18734591 WDR60 7 158657749 -0.86 1.2e-06 0.000098 7.2 0.92
Methods & Data
Input

Methylation Array Platforms: Illumina Infinium HumanMethylation27, Illumina Infinium HumanMethylation450, Illumina DNA Methylation OMA002, Illumina DNA Methylation OMA003

  • methylation file = /xchip/cga/gdac-prod/tcga-gdac/jobResults/GDAC_MethylationPreprocess/DLBC-TP/9830998/DLBC-TP.meth.by_min_expr_corr.data.txt

Gene Expression Platforms: Agilent 244K Gene Expression G4502A-07-1, Agilent 244K Gene Expression G4502A-07-2, Agilent 244K Gene Expression G4502A-07-3, Affymetrix Human Exon 1.0 ST Array, Affymetrix HT Human Genome U133 Array

  • gene expression file = /xchip/cga/gdac-prod/tcga-gdac/jobResults/mRNAseq_preprocessor/DLBC-TP/9829872/DLBC-TP.uncv2.mRNAseq_RSEM_normalized_log2.txt

Correlation Coefficient

Level 3 methylation and gene expression arrays were paired on the basis of Entrez Gene ID concordance. The association between CpG site methylation and the level of expression of the corresponding genes was determined by calculating a correlation measure between the two platforms.

  • correlation measure = Spearman

Download Results

In addition to the links below, the full results of the analysis summarized in this report can also be downloaded programmatically using firehose_get, or interactively from either the Broad GDAC website or TCGA Data Coordination Center Portal.