This pipeline uses various statistical tests to identify genes whose promoter methylation levels correlated to selected clinical features.
Testing the association between 20183 genes and 9 clinical features across 285 samples, statistically thresholded by Q value < 0.05, 6 clinical features related to at least one genes.
-
19 genes correlated to 'AGE'.
-
ELOVL2 , MRPS33 , TSPYL5 , DOK6 , ZYG11A , ...
-
516 genes correlated to 'NEOPLASM.DISEASESTAGE'.
-
OPRK1 , AVPR1A , NOS1 , GRIA2 , CRHBP , ...
-
805 genes correlated to 'PATHOLOGY.T.STAGE'.
-
NOS1 , ACTA1 , OPRK1 , AVPR1A , DBX2 , ...
-
8 genes correlated to 'PATHOLOGY.N.STAGE'.
-
CARD16 , CASP1 , ZFP64 , TSPO , SFXN5 , ...
-
88 genes correlated to 'PATHOLOGY.M.STAGE'.
-
C20ORF112 , PLCD1 , HTR6 , SESN1__1 , STK24 , ...
-
90 genes correlated to 'GENDER'.
-
ALG11__1 , UTP14C , CCBL2 , RBMXL1 , KIF4B , ...
-
No genes correlated to 'Time to Death', 'KARNOFSKY.PERFORMANCE.SCORE', and 'NUMBERPACKYEARSSMOKED'.
Complete statistical result table is provided in Supplement Table 1
Clinical feature | Statistical test | Significant genes | Associated with | Associated with | ||
---|---|---|---|---|---|---|
Time to Death | Cox regression test | N=0 | ||||
AGE | Spearman correlation test | N=19 | older | N=15 | younger | N=4 |
NEOPLASM DISEASESTAGE | ANOVA test | N=516 | ||||
PATHOLOGY T STAGE | Spearman correlation test | N=805 | higher stage | N=358 | lower stage | N=447 |
PATHOLOGY N STAGE | t test | N=8 | class1 | N=0 | class0 | N=8 |
PATHOLOGY M STAGE | t test | N=88 | m1 | N=80 | m0 | N=8 |
GENDER | t test | N=90 | male | N=11 | female | N=79 |
KARNOFSKY PERFORMANCE SCORE | Spearman correlation test | N=0 | ||||
NUMBERPACKYEARSSMOKED | Spearman correlation test | N=0 |
Time to Death | Duration (Months) | 0.2-120.6 (median=31.3) |
censored | N = 188 | |
death | N = 97 | |
Significant markers | N = 0 |
AGE | Mean (SD) | 61.53 (12) |
Significant markers | N = 19 | |
pos. correlated | 15 | |
neg. correlated | 4 |
SpearmanCorr | corrP | Q | |
---|---|---|---|
ELOVL2 | 0.4638 | 1.31e-16 | 2.64e-12 |
MRPS33 | 0.3371 | 5.272e-09 | 0.000106 |
TSPYL5 | 0.3261 | 1.746e-08 | 0.000352 |
DOK6 | 0.322 | 2.691e-08 | 0.000543 |
ZYG11A | 0.3163 | 4.844e-08 | 0.000977 |
ME3 | -0.3131 | 6.727e-08 | 0.00136 |
PVT1 | -0.3063 | 1.319e-07 | 0.00266 |
RANBP17 | 0.3052 | 1.48e-07 | 0.00299 |
UNC80 | 0.2973 | 3.184e-07 | 0.00642 |
PCOLCE2 | -0.2952 | 3.882e-07 | 0.00783 |
NEOPLASM.DISEASESTAGE | Labels | N |
STAGE I | 130 | |
STAGE II | 28 | |
STAGE III | 73 | |
STAGE IV | 54 | |
Significant markers | N = 516 |
ANOVA_P | Q | |
---|---|---|
OPRK1 | 3.256e-16 | 6.57e-12 |
AVPR1A | 1.204e-15 | 2.43e-11 |
NOS1 | 5.587e-14 | 1.13e-09 |
GRIA2 | 7.964e-14 | 1.61e-09 |
CRHBP | 8.798e-14 | 1.78e-09 |
PCDHGA1__5 | 1.348e-13 | 2.72e-09 |
PCDHGA10__2 | 1.348e-13 | 2.72e-09 |
PCDHGA11__1 | 1.348e-13 | 2.72e-09 |
PCDHGA2__5 | 1.348e-13 | 2.72e-09 |
PCDHGA3__5 | 1.348e-13 | 2.72e-09 |
PATHOLOGY.T.STAGE | Mean (SD) | 1.96 (0.98) |
N | ||
1 | 133 | |
2 | 37 | |
3 | 107 | |
4 | 8 | |
Significant markers | N = 805 | |
pos. correlated | 358 | |
neg. correlated | 447 |
SpearmanCorr | corrP | Q | |
---|---|---|---|
NOS1 | 0.471 | 3.822e-17 | 7.71e-13 |
ACTA1 | 0.4688 | 5.581e-17 | 1.13e-12 |
OPRK1 | 0.4602 | 2.418e-16 | 4.88e-12 |
AVPR1A | 0.4463 | 2.348e-15 | 4.74e-11 |
DBX2 | 0.445 | 2.877e-15 | 5.81e-11 |
SYN2 | 0.4436 | 3.609e-15 | 7.28e-11 |
SLC35F1 | 0.4432 | 3.834e-15 | 7.73e-11 |
CECR1 | -0.4367 | 1.066e-14 | 2.15e-10 |
RRM2 | -0.4303 | 2.844e-14 | 5.74e-10 |
SOX17 | 0.4189 | 1.543e-13 | 3.11e-09 |
PATHOLOGY.N.STAGE | Labels | N |
class0 | 127 | |
class1 | 9 | |
Significant markers | N = 8 | |
Higher in class1 | 0 | |
Higher in class0 | 8 |
T(pos if higher in 'class1') | ttestP | Q | AUC | |
---|---|---|---|---|
CARD16 | -6.6 | 3.144e-09 | 6.35e-05 | 0.6745 |
CASP1 | -6.6 | 3.144e-09 | 6.35e-05 | 0.6745 |
ZFP64 | -5.43 | 3.028e-07 | 0.00611 | 0.629 |
TSPO | -5.59 | 4.472e-07 | 0.00903 | 0.6325 |
SFXN5 | -6.12 | 5.343e-07 | 0.0108 | 0.7019 |
VGF | -5.2 | 7.459e-07 | 0.0151 | 0.6527 |
CHCHD7__1 | -5.09 | 1.193e-06 | 0.0241 | 0.776 |
PLAG1__1 | -5.09 | 1.193e-06 | 0.0241 | 0.776 |
PATHOLOGY.M.STAGE | Labels | N |
M0 | 233 | |
M1 | 52 | |
Significant markers | N = 88 | |
Higher in M1 | 80 | |
Higher in M0 | 8 |
T(pos if higher in 'M1') | ttestP | Q | AUC | |
---|---|---|---|---|
C20ORF112 | 7.96 | 3.528e-13 | 7.12e-09 | 0.7739 |
PLCD1 | 6.73 | 2.361e-10 | 4.77e-06 | 0.7195 |
HTR6 | 7.05 | 4.663e-10 | 9.41e-06 | 0.7646 |
SESN1__1 | 6.47 | 1.647e-09 | 3.32e-05 | 0.7147 |
STK24 | 6.52 | 3.172e-09 | 6.4e-05 | 0.754 |
PDGFB | 6.28 | 3.259e-09 | 6.58e-05 | 0.7221 |
ASB4 | 6.45 | 3.305e-09 | 6.67e-05 | 0.7364 |
CSDC2 | 6.59 | 3.618e-09 | 7.3e-05 | 0.7508 |
HAND2__1 | 6.37 | 6.029e-09 | 0.000122 | 0.7162 |
NBLA00301__1 | 6.37 | 6.029e-09 | 0.000122 | 0.7162 |
GENDER | Labels | N |
FEMALE | 96 | |
MALE | 189 | |
Significant markers | N = 90 | |
Higher in MALE | 11 | |
Higher in FEMALE | 79 |
T(pos if higher in 'MALE') | ttestP | Q | AUC | |
---|---|---|---|---|
ALG11__1 | 18.71 | 9.907e-35 | 2e-30 | 0.9804 |
UTP14C | 18.71 | 9.907e-35 | 2e-30 | 0.9804 |
CCBL2 | 13.68 | 3.031e-31 | 6.12e-27 | 0.8779 |
RBMXL1 | 13.68 | 3.031e-31 | 6.12e-27 | 0.8779 |
KIF4B | -11.76 | 5.928e-25 | 1.2e-20 | 0.8729 |
CCDC146__1 | -10.91 | 3.153e-23 | 6.36e-19 | 0.8083 |
C5ORF27 | -10.37 | 1.125e-20 | 2.27e-16 | 0.8165 |
LRRC41 | 10.26 | 1.547e-20 | 3.12e-16 | 0.7615 |
UQCRH | 10.26 | 1.547e-20 | 3.12e-16 | 0.7615 |
DNAJB13 | -10.1 | 1.567e-20 | 3.16e-16 | 0.794 |
No gene related to 'KARNOFSKY.PERFORMANCE.SCORE'.
KARNOFSKY.PERFORMANCE.SCORE | Mean (SD) | 92.5 (8) |
Score | N | |
70 | 1 | |
80 | 3 | |
90 | 12 | |
100 | 12 | |
Significant markers | N = 0 |
-
Expresson data file = KIRC-TP.meth.by_min_clin_corr.data.txt
-
Clinical data file = KIRC-TP.merged_data.txt
-
Number of patients = 285
-
Number of genes = 20183
-
Number of clinical features = 9
For survival clinical features, Wald's test in univariate Cox regression analysis with proportional hazards model (Andersen and Gill 1982) was used to estimate the P values using the 'coxph' function in R. Kaplan-Meier survival curves were plot using the four quartile subgroups of patients based on expression levels
For continuous numerical clinical features, Spearman's rank correlation coefficients (Spearman 1904) and two-tailed P values were estimated using 'cor.test' function in R
For multi-class clinical features (ordinal or nominal), one-way analysis of variance (Howell 2002) was applied to compare the log2-expression levels between different clinical classes using 'anova' function in R
For two-class clinical features, two-tailed Student's t test with unequal variance (Lehmann and Romano 2005) was applied to compare the log2-expression levels between the two clinical classes using 't.test' function in R
For multiple hypothesis correction, Q value is the False Discovery Rate (FDR) analogue of the P value (Benjamini and Hochberg 1995), defined as the minimum FDR at which the test may be called significant. We used the 'Benjamini and Hochberg' method of 'p.adjust' function in R to convert P values into Q values.
In addition to the links below, the full results of the analysis summarized in this report can also be downloaded programmatically using firehose_get, or interactively from either the Broad GDAC website or TCGA Data Coordination Center Portal.