Colon Adenocarcinoma: Correlation between mRNA expression and DNA methylation
(primary solid tumor cohort)
Maintained by Richard Park (Boston University/Harvard Medical School)
Overview
Introduction

The role of general epigenetic mechanisms in carcinogenesis and tumor aggressiveness is well documented: CpG island hyper-methylation silences tumor suppressor genes, whereas hypo-methylation promotes the transcriptional activation of oncogenes and induces chromosomal instability. This pipeline calculates and identifies correlations between DNA methylation and gene expression profiles using the available array technologies.

Summary

The top 25 correlated methylation probes per gene are displayed. Total number of matched samples = 31. Number of gene expression samples = 192. Number of methylation samples = 31.

Results
Correlation Histogram

Figure 1.  Histogram of methylation correlation values. n is the number of matched samples between Level 3 CpG site methylation and Level 3 gene expression arrays. Number of Matched Samples = 31

Qvalue Summary Plots

Figure 2.  Plot 1. The estimated pi_0 versus the tuning parameter lambda. Plot 2. The q-values versus the p-values. Plot 3. The number of significant tests versus each q-value cutoff. Plot 4. The number of expected false positives versus the number of significant tests.The first is a plot of the estimate of pi_0 versus its tuning parameter lambda. In most cases, as lambda gets larger, the bias of the estimate decreases, yet the variance increases. Comparing your estimate of pi_0 to this plot allows one to guage its quality. The remaining three plots show how many tests are significant, as well as how many false positives to expect for each q-value cut-off.

Negative Correlation between Methylation and Gene Expression

Table 1.  Get Full Table Top 25 most negatively correlated methylation probes. Correlation Coefficient: See Methods & Data below. Pval and Qval: P- and Q-values of the correlation coefficient. Expression Mean: mean detection level of gene expression probes. Methylation Mean: mean detection level of CpG methylation probes.

Meth_Probe Gene Chrom Position Corr_Coeff Pval Qval Expr_Mean Meth_Mean
cg18542098 C19orf46 19 36499326 -0.91 6.1e-08 0.000036 6.135 0.466128
cg16021126 SERP2 13 44947611 -0.91 7e-08 0.000036 5.413 0.381769
cg05218245 LY6G6D 6 31683352 -0.9 4.7e-07 0.000036 NA 0.580318
cg12889195 LOC654433 2 113992843 -0.89 5.3e-07 0.000036 NA 0.703368
cg25732104 GJB5 1 35220710 -0.89 1.2e-06 0.000036 NA 0.645302
cg25062184 DUSP27 1 167088558 -0.89 2.1e-07 0.000036 7.437 0.686318
cg04515996 TBX18 6 85474028 -0.89 3.8e-07 0.000036 NA 0.568789
cg02075410 MAB21L2 4 151504138 -0.88 3.9e-07 0.000036 NA 0.824065
cg23059946 IGFBP2 2 217500594 -0.88 2.4e-07 0.000036 11.608 0.464579
cg15063355 CHFR 12 133464596 -0.88 2.5e-07 0.000036 8.500 0.248936
cg09360501 ZNF43 19 22018958 -0.87 3.2e-07 0.000036 4.918 0.436748
cg08124399 DDX43 6 74104868 -0.87 0 0 NA 0.789264
cg12497786 FAM50B 6 3849577 -0.87 3.5e-07 0.000036 6.378 0.668238
cg20092122 BST2 19 17517221 -0.86 4e-07 0.000036 10.013 0.401719
cg08204280 IRX2 5 2751839 -0.86 1.2e-06 0.000036 NA 0.320043
cg14830003 DNALI1 1 38022657 -0.85 4.7e-07 0.000036 5.711 0.772415
cg02512860 CLDN15 7 100880751 -0.85 5.1e-07 0.000036 8.733 0.519830
cg21626848 FKBP10 17 39969267 -0.85 5.3e-07 0.000036 10.754 0.446329
cg16920620 CHN2 7 29519651 -0.85 5.3e-07 0.000036 8.534 0.516679
cg19817882 LEFTY1 1 226075069 -0.84 6e-07 0.000036 11.048 0.657545
cg11161150 RGMB 5 98119892 -0.84 6e-07 0.000036 9.598 0.721245
cg02798548 ZNF543 19 57832235 -0.84 8.4e-07 0.000036 NA 0.274889
cg13672300 ZNF528 19 52901307 -0.84 8.4e-07 0.000036 NA 0.544574
cg08293086 MRPS21 1 150266477 -0.83 6.5e-07 0.000036 9.446 0.379514
cg27315497 SLC43A1 11 57281561 -0.83 6.8e-07 0.000036 9.538 0.414590
cg15647527 C13orf1 13 50510698 -0.83 6.8e-07 0.000036 8.796 0.023991
Methods & Data
Input

Methylation Array Platforms: Illumina Infinium HumanMethylation27, Illumina DNA Methylation OMA002, Illumina DNA Methylation OMA003

  • methylation file = /xchip/cga/gdac-prod/tcga-gdac/jobResults/GDAC_MethylationPreprocess/COAD-TP/2401411/0.GDAC_MethylationPreprocess.Finished/COAD-TP.meth.for_correlation.filtered_data.txt

Gene Expression Platforms: Agilent 244K Gene Expression G4502A-07-1, Agilent 244K Gene Expression G4502A-07-2, Agilent 244K Gene Expression G4502A-07-3, Affymetrix Human Exon 1.0 ST Array, Affymetrix HT Human Genome U133 Array

  • gene expression file = /xchip/cga/gdac-prod/tcga-gdac/jobResults/mRNAseq_preprocessor/COAD-TP/2398216/0.mRNAseq_preprocessor.Finished/COAD-TP.uncv2.mRNAseq_RSEM_normalized_log2.txt

Correlation Coefficient

Level 3 methylation and gene expression arrays were paired on the basis of Entrez Gene ID concordance. The association between CpG site methylation and the level of expression of the corresponding genes was determined by calculating a correlation measure between the two platforms.

  • correlation measure = Spearman

Download Results

This is an experimental feature. The full results of the analysis summarized in this report can be downloaded from the TCGA Data Coordination Center.