This pipeline computes the correlation between significant copy number variation (cnv focal) genes and selected clinical features.
Testing the association between copy number variation 21 focal events and 3 clinical features across 191 patients, 6 significant findings detected with Q value < 0.25.
-
amp_21q22.2 cnv correlated to 'Time to Death'.
-
del_3p13 cnv correlated to 'Time to Death'.
-
del_3q26.31 cnv correlated to 'AGE'.
-
del_5q31.2 cnv correlated to 'Time to Death'.
-
del_12p13.2 cnv correlated to 'Time to Death'.
-
del_12q21.33 cnv correlated to 'AGE'.
Clinical Features |
Time to Death |
AGE | GENDER | ||
nCNV (%) | nWild-Type | logrank test | t-test | Fisher's exact test | |
amp 21q22 2 | 14 (7%) | 177 |
0.000747 (0.0441) |
0.066 (1.00) |
0.0922 (1.00) |
del 3p13 | 9 (5%) | 182 |
3.8e-05 (0.00228) |
0.16 (1.00) |
0.185 (1.00) |
del 3q26 31 | 3 (2%) | 188 |
0.00241 (0.137) |
0.252 (1.00) |
|
del 5q31 2 | 18 (9%) | 173 |
0.00338 (0.189) |
0.00588 (0.318) |
0.0464 (1.00) |
del 12p13 2 | 10 (5%) | 181 |
0.0011 (0.0641) |
0.582 (1.00) |
0.351 (1.00) |
del 12q21 33 | 3 (2%) | 188 |
8.96e-17 (5.46e-15) |
0.252 (1.00) |
|
amp 1p33 | 7 (4%) | 184 |
0.337 (1.00) |
0.0374 (1.00) |
1 (1.00) |
amp 1q43 | 7 (4%) | 184 |
0.734 (1.00) |
0.0234 (1.00) |
1 (1.00) |
amp 11q23 3 | 17 (9%) | 174 |
0.122 (1.00) |
0.0107 (0.566) |
0.45 (1.00) |
amp 13q31 3 | 7 (4%) | 184 |
0.931 (1.00) |
0.0716 (1.00) |
1 (1.00) |
amp 19p13 2 | 6 (3%) | 185 |
0.978 (1.00) |
0.991 (1.00) |
0.223 (1.00) |
amp 20q11 21 | 3 (2%) | 188 |
0.114 (1.00) |
0.372 (1.00) |
0.252 (1.00) |
del 7p12 1 | 16 (8%) | 175 |
0.0737 (1.00) |
0.211 (1.00) |
0.604 (1.00) |
del 7q32 3 | 23 (12%) | 168 |
0.0282 (1.00) |
0.089 (1.00) |
0.656 (1.00) |
del 7q34 | 24 (13%) | 167 |
0.0705 (1.00) |
0.0814 (1.00) |
0.512 (1.00) |
del 9q21 32 | 5 (3%) | 186 |
0.9 (1.00) |
0.745 (1.00) |
0.378 (1.00) |
del 16q23 1 | 9 (5%) | 182 |
0.124 (1.00) |
0.111 (1.00) |
0.513 (1.00) |
del 17p13 2 | 15 (8%) | 176 |
0.0435 (1.00) |
0.226 (1.00) |
0.0565 (1.00) |
del 17q11 2 | 13 (7%) | 178 |
0.0318 (1.00) |
0.548 (1.00) |
0.775 (1.00) |
del 18p11 21 | 9 (5%) | 182 |
0.00546 (0.3) |
0.176 (1.00) |
0.185 (1.00) |
del 20q13 13 | 4 (2%) | 187 |
0.0388 (1.00) |
0.113 (1.00) |
0.627 (1.00) |
P value = 0.000747 (logrank test), Q value = 0.044
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 168 | 106 | 0.9 - 94.1 (12.0) |
AMP PEAK 7(21Q22.2) MUTATED | 12 | 10 | 1.0 - 24.0 (5.5) |
AMP PEAK 7(21Q22.2) WILD-TYPE | 156 | 96 | 0.9 - 94.1 (12.5) |
P value = 3.8e-05 (logrank test), Q value = 0.0023
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 168 | 106 | 0.9 - 94.1 (12.0) |
DEL PEAK 2(3P13) MUTATED | 8 | 7 | 1.0 - 14.0 (2.0) |
DEL PEAK 2(3P13) WILD-TYPE | 160 | 99 | 0.9 - 94.1 (12.5) |
P value = 0.00241 (t-test), Q value = 0.14
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 191 | 55.2 (16.1) |
DEL PEAK 3(3Q26.31) MUTATED | 3 | 74.0 (3.6) |
DEL PEAK 3(3Q26.31) WILD-TYPE | 188 | 54.9 (16.0) |
P value = 0.00338 (logrank test), Q value = 0.19
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 168 | 106 | 0.9 - 94.1 (12.0) |
DEL PEAK 4(5Q31.2) MUTATED | 17 | 15 | 1.0 - 73.0 (10.0) |
DEL PEAK 4(5Q31.2) WILD-TYPE | 151 | 91 | 0.9 - 94.1 (12.9) |
P value = 0.0011 (logrank test), Q value = 0.064
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 168 | 106 | 0.9 - 94.1 (12.0) |
DEL PEAK 10(12P13.2) MUTATED | 8 | 8 | 1.0 - 22.1 (7.0) |
DEL PEAK 10(12P13.2) WILD-TYPE | 160 | 98 | 0.9 - 94.1 (12.5) |
P value = 8.96e-17 (t-test), Q value = 5.5e-15
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 191 | 55.2 (16.1) |
DEL PEAK 11(12Q21.33) MUTATED | 3 | 72.0 (1.0) |
DEL PEAK 11(12Q21.33) WILD-TYPE | 188 | 55.0 (16.1) |
-
Copy number data file = transformed.cor.cli.txt
-
Clinical data file = LAML-TB.merged_data.txt
-
Number of patients = 191
-
Number of significantly focal cnvs = 21
-
Number of selected clinical features = 3
-
Exclude genes that fewer than K tumors have mutations, K = 3
For survival clinical features, the Kaplan-Meier survival curves of tumors with and without gene mutations were plotted and the statistical significance P values were estimated by logrank test (Bland and Altman 2004) using the 'survdiff' function in R
For continuous numerical clinical features, two-tailed Student's t test with unequal variance (Lehmann and Romano 2005) was applied to compare the clinical values between tumors with and without gene mutations using 't.test' function in R
For binary or multi-class clinical features (nominal or ordinal), two-tailed Fisher's exact tests (Fisher 1922) were used to estimate the P values using the 'fisher.test' function in R
For multiple hypothesis correction, Q value is the False Discovery Rate (FDR) analogue of the P value (Benjamini and Hochberg 1995), defined as the minimum FDR at which the test may be called significant. We used the 'Benjamini and Hochberg' method of 'p.adjust' function in R to convert P values into Q values.
In addition to the links below, the full results of the analysis summarized in this report can also be downloaded programmatically using firehose_get, or interactively from either the Broad GDAC website or TCGA Data Coordination Center Portal.