This pipeline computes the correlation between significant copy number variation (cnv) genes and selected clinical features.
Testing the association between copy number variation of 17 peak regions and 8 clinical features across 101 patients, 2 significant findings detected with Q value < 0.25.
-
Del Peak 16(Xq21.31) cnvs correlated to 'Time to Death'.
-
Del Peak 17(Xq28) cnvs correlated to 'Time to Death'.
Clinical Features |
Time to Death |
AGE | GENDER |
KARNOFSKY PERFORMANCE SCORE |
PATHOLOGY T |
PATHOLOGY N |
PATHOLOGICSPREAD(M) |
TUMOR STAGE |
||
nCNV (%) | nWild-Type | logrank test | t-test | Fisher's exact test | t-test | Fisher's exact test | Fisher's exact test | Fisher's exact test | Fisher's exact test | |
Del Peak 16(Xq21 31) | 15 (15%) | 86 |
0.000103 (0.0128) |
0.209 (1.00) |
0.23 (1.00) |
0.424 (1.00) |
0.0502 (1.00) |
0.312 (1.00) |
0.153 (1.00) |
0.177 (1.00) |
Del Peak 17(Xq28) | 14 (14%) | 87 |
4.26e-05 (0.00536) |
0.283 (1.00) |
0.13 (1.00) |
0.475 (1.00) |
0.358 (1.00) |
0.329 (1.00) |
0.45 (1.00) |
0.371 (1.00) |
Amp Peak 1(19p13 2) | 13 (13%) | 88 |
0.999 (1.00) |
0.0725 (1.00) |
1 (1.00) |
0.144 (1.00) |
0.0877 (1.00) |
1 (1.00) |
1 (1.00) |
0.746 (1.00) |
Del Peak 1(1p36 31) | 21 (21%) | 80 |
0.236 (1.00) |
0.482 (1.00) |
0.599 (1.00) |
0.396 (1.00) |
0.0831 (1.00) |
0.263 (1.00) |
0.0401 (1.00) |
0.0704 (1.00) |
Del Peak 2(2p11 2) | 3 (3%) | 98 |
0.357 (1.00) |
0.717 (1.00) |
1 (1.00) |
0.704 (1.00) |
1 (1.00) |
0.573 (1.00) |
||
Del Peak 3(2q37 3) | 5 (5%) | 96 |
0.652 (1.00) |
0.336 (1.00) |
0.323 (1.00) |
0.01 (1.00) |
0.437 (1.00) |
0.0259 (1.00) |
||
Del Peak 4(3p22 1) | 6 (6%) | 95 |
0.211 (1.00) |
0.0512 (1.00) |
0.662 (1.00) |
0.0787 (1.00) |
0.55 (1.00) |
0.43 (1.00) |
0.0978 (1.00) |
|
Del Peak 5(4q32 1) | 12 (12%) | 89 |
0.0661 (1.00) |
0.338 (1.00) |
0.512 (1.00) |
0.197 (1.00) |
0.359 (1.00) |
0.135 (1.00) |
0.173 (1.00) |
|
Del Peak 6(5p15 33) | 7 (7%) | 94 |
0.108 (1.00) |
0.366 (1.00) |
0.676 (1.00) |
0.0294 (1.00) |
0.0276 (1.00) |
0.43 (1.00) |
0.188 (1.00) |
|
Del Peak 7(5q15) | 8 (8%) | 93 |
0.61 (1.00) |
0.0537 (1.00) |
0.259 (1.00) |
0.0336 (1.00) |
0.138 (1.00) |
0.128 (1.00) |
0.165 (1.00) |
|
Del Peak 8(5q35 2) | 8 (8%) | 93 |
0.61 (1.00) |
0.113 (1.00) |
0.259 (1.00) |
0.0169 (1.00) |
0.138 (1.00) |
0.304 (1.00) |
0.0487 (1.00) |
|
Del Peak 9(6q22 31) | 12 (12%) | 89 |
0.0444 (1.00) |
0.908 (1.00) |
0.0481 (1.00) |
0.392 (1.00) |
0.026 (1.00) |
0.836 (1.00) |
0.547 (1.00) |
0.168 (1.00) |
Del Peak 10(9p21 3) | 16 (16%) | 85 |
0.00552 (0.679) |
0.327 (1.00) |
0.0373 (1.00) |
0.424 (1.00) |
0.00494 (0.613) |
0.311 (1.00) |
0.0419 (1.00) |
0.0119 (1.00) |
Del Peak 12(14q11 2) | 22 (22%) | 79 |
0.252 (1.00) |
0.855 (1.00) |
0.611 (1.00) |
0.457 (1.00) |
0.218 (1.00) |
0.663 (1.00) |
0.149 (1.00) |
0.0765 (1.00) |
Del Peak 13(14q23 3) | 21 (21%) | 80 |
0.231 (1.00) |
0.445 (1.00) |
0.599 (1.00) |
0.396 (1.00) |
0.169 (1.00) |
0.239 (1.00) |
0.0975 (1.00) |
0.0562 (1.00) |
Del Peak 14(14q32 2) | 21 (21%) | 80 |
0.231 (1.00) |
0.409 (1.00) |
1 (1.00) |
0.396 (1.00) |
0.393 (1.00) |
0.239 (1.00) |
0.0465 (1.00) |
0.0531 (1.00) |
Del Peak 15(19q13 42) | 7 (7%) | 94 |
0.386 (1.00) |
0.897 (1.00) |
1 (1.00) |
0.216 (1.00) |
0.681 (1.00) |
1 (1.00) |
0.239 (1.00) |
P value = 0.000103 (logrank test), Q value = 0.013
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 94 | 14 | 0.0 - 182.7 (13.9) |
DEL PEAK 16(XQ21.31) MUTATED | 14 | 5 | 0.0 - 79.8 (9.2) |
DEL PEAK 16(XQ21.31) WILD-TYPE | 80 | 9 | 0.0 - 182.7 (15.7) |
P value = 4.26e-05 (logrank test), Q value = 0.0054
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 94 | 14 | 0.0 - 182.7 (13.9) |
DEL PEAK 17(XQ28) MUTATED | 13 | 4 | 0.0 - 22.9 (9.6) |
DEL PEAK 17(XQ28) WILD-TYPE | 81 | 10 | 0.0 - 182.7 (15.9) |
-
Copy number data file = All Lesions File (all_lesions.conf_##.txt, where ## is the confidence level). The all lesions file is from GISTIC pipeline and summarizes the results from the GISTIC run. It contains data about the significant regions of amplification and deletion as well as which samples are amplified or deleted in each of these regions. The identified regions are listed down the first column, and the samples are listed across the first row, starting in column 10.
-
Clinical data file = KIRP-TP.clin.merged.picked.txt
-
Number of patients = 101
-
Number of copy number variation regions = 17
-
Number of selected clinical features = 8
-
Exclude regions that fewer than K tumors have alterations, K = 3
For survival clinical features, the Kaplan-Meier survival curves of tumors with and without gene cnvs were plotted and the statistical significance P values were estimated by logrank test (Bland and Altman 2004) using the 'survdiff' function in R
For continuous numerical clinical features, two-tailed Student's t test with unequal variance (Lehmann and Romano 2005) was applied to compare the clinical values between tumors with and without gene cnvs using 't.test' function in R
For binary or multi-class clinical features (nominal or ordinal), two-tailed Fisher's exact tests (Fisher 1922) were used to estimate the P values using the 'fisher.test' function in R
For multiple hypothesis correction, Q value is the False Discovery Rate (FDR) analogue of the P value (Benjamini and Hochberg 1995), defined as the minimum FDR at which the test may be called significant. We used the 'Benjamini and Hochberg' method of 'p.adjust' function in R to convert P values into Q values.
This is an experimental feature. The full results of the analysis summarized in this report can be downloaded from the TCGA Data Coordination Center.