This pipeline computes the correlation between cancer subtypes identified by different molecular patterns and selected clinical features.
Testing the association between subtypes identified by 7 different clustering approaches and 8 clinical features across 500 patients, 26 significant findings detected with P value < 0.05.
-
CNMF clustering analysis on array-based mRNA expression data identified 3 subtypes that correlate to 'PATHOLOGY.T'.
-
Consensus hierarchical clustering analysis on array-based mRNA expression data identified 3 subtypes that correlate to 'PATHOLOGY.T'.
-
3 subtypes identified in current cancer cohort by 'METHLYATION CNMF'. These subtypes correlate to 'Time to Death', 'GENDER', 'PATHOLOGY.T', 'PATHOLOGY.N', and 'PATHOLOGICSPREAD(M)'.
-
CNMF clustering analysis on sequencing-based mRNA expression data identified 3 subtypes that correlate to 'Time to Death', 'GENDER', 'PATHOLOGY.T', 'PATHOLOGY.N', and 'PATHOLOGICSPREAD(M)'.
-
Consensus hierarchical clustering analysis on sequencing-based mRNA expression data identified 3 subtypes that correlate to 'Time to Death', 'GENDER', 'PATHOLOGY.T', 'PATHOLOGY.N', and 'PATHOLOGICSPREAD(M)'.
-
CNMF clustering analysis on sequencing-based miR expression data identified 3 subtypes that correlate to 'Time to Death', 'GENDER', 'PATHOLOGY.T', 'PATHOLOGY.N', and 'PATHOLOGICSPREAD(M)'.
-
Consensus hierarchical clustering analysis on sequencing-based miR expression data identified 3 subtypes that correlate to 'Time to Death', 'PATHOLOGY.T', 'PATHOLOGY.N', and 'NEOADJUVANT.THERAPY'.
Clinical Features |
Statistical Tests |
mRNA CNMF subtypes |
mRNA cHierClus subtypes |
METHLYATION CNMF |
RNAseq CNMF subtypes |
RNAseq cHierClus subtypes |
MIRseq CNMF subtypes |
MIRseq cHierClus subtypes |
Time to Death | logrank test | 0.0868 | 0.189 | 0.000454 | 1.14e-07 | 2.35e-09 | 3.83e-06 | 0.000319 |
AGE | ANOVA | 0.795 | 0.607 | 0.256 | 0.099 | 0.0589 | 0.0827 | 0.108 |
GENDER | Fisher's exact test | 0.634 | 0.82 | 0.00307 | 2.33e-05 | 0.00775 | 0.0076 | 0.0901 |
KARNOFSKY PERFORMANCE SCORE | ANOVA | 0.709 | 0.338 | 0.201 | 0.926 | |||
PATHOLOGY T | Chi-square test | 0.00911 | 0.00779 | 0.000248 | 4.98e-06 | 3.6e-12 | 0.000289 | 0.000543 |
PATHOLOGY N | Fisher's exact test | 0.0704 | 0.124 | 0.0104 | 0.0102 | 0.00115 | 0.0158 | 0.0333 |
PATHOLOGICSPREAD(M) | Fisher's exact test | 0.12 | 0.0988 | 0.000576 | 0.000382 | 1.75e-06 | 0.00365 | 0.844 |
NEOADJUVANT THERAPY | Fisher's exact test | 0.194 | 0.208 | 1 | 0.37 | 0.186 | 0.54 | 0.00445 |
Cluster Labels | 1 | 2 | 3 |
---|---|---|---|
Number of samples | 34 | 24 | 14 |
P value = 0.0868 (logrank test)
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 71 | 13 | 0.5 - 101.1 (32.6) |
subtype1 | 33 | 4 | 0.5 - 101.1 (31.0) |
subtype2 | 24 | 8 | 0.5 - 93.3 (36.7) |
subtype3 | 14 | 1 | 1.3 - 84.4 (25.0) |
P value = 0.795 (ANOVA)
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 71 | 60.5 (12.4) |
subtype1 | 33 | 60.2 (13.8) |
subtype2 | 24 | 59.9 (11.1) |
subtype3 | 14 | 62.6 (11.3) |
P value = 0.634 (Fisher's exact test)
nPatients | FEMALE | MALE |
---|---|---|
ALL | 43 | 29 |
subtype1 | 19 | 15 |
subtype2 | 14 | 10 |
subtype3 | 10 | 4 |
P value = 0.00911 (Chi-square test)
nPatients | T1 | T2 | T3 |
---|---|---|---|
ALL | 41 | 14 | 17 |
subtype1 | 23 | 4 | 7 |
subtype2 | 10 | 4 | 10 |
subtype3 | 8 | 6 | 0 |
P value = 0.0704 (Fisher's exact test)
nPatients | 0 | 1 |
---|---|---|
ALL | 35 | 3 |
subtype1 | 18 | 0 |
subtype2 | 10 | 3 |
subtype3 | 7 | 0 |
P value = 0.12 (Fisher's exact test)
nPatients | M0 | M1 |
---|---|---|
ALL | 67 | 5 |
subtype1 | 33 | 1 |
subtype2 | 20 | 4 |
subtype3 | 14 | 0 |
P value = 0.194 (Fisher's exact test)
nPatients | NO | YES |
---|---|---|
ALL | 71 | 1 |
subtype1 | 34 | 0 |
subtype2 | 24 | 0 |
subtype3 | 13 | 1 |
Cluster Labels | 1 | 2 | 3 |
---|---|---|---|
Number of samples | 15 | 23 | 34 |
P value = 0.189 (logrank test)
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 71 | 13 | 0.5 - 101.1 (32.6) |
subtype1 | 15 | 2 | 1.3 - 84.4 (24.2) |
subtype2 | 23 | 7 | 0.5 - 93.3 (36.8) |
subtype3 | 33 | 4 | 0.5 - 101.1 (31.0) |
P value = 0.607 (ANOVA)
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 71 | 60.5 (12.4) |
subtype1 | 15 | 63.2 (11.2) |
subtype2 | 23 | 59.1 (10.7) |
subtype3 | 33 | 60.4 (14.0) |
P value = 0.82 (Fisher's exact test)
nPatients | FEMALE | MALE |
---|---|---|
ALL | 43 | 29 |
subtype1 | 10 | 5 |
subtype2 | 14 | 9 |
subtype3 | 19 | 15 |
P value = 0.00779 (Chi-square test)
nPatients | T1 | T2 | T3 |
---|---|---|---|
ALL | 41 | 14 | 17 |
subtype1 | 9 | 6 | 0 |
subtype2 | 9 | 4 | 10 |
subtype3 | 23 | 4 | 7 |
P value = 0.124 (Fisher's exact test)
nPatients | 0 | 1 |
---|---|---|
ALL | 35 | 3 |
subtype1 | 7 | 0 |
subtype2 | 11 | 3 |
subtype3 | 17 | 0 |
P value = 0.0988 (Fisher's exact test)
nPatients | M0 | M1 |
---|---|---|
ALL | 67 | 5 |
subtype1 | 15 | 0 |
subtype2 | 19 | 4 |
subtype3 | 33 | 1 |
P value = 0.208 (Fisher's exact test)
nPatients | NO | YES |
---|---|---|
ALL | 71 | 1 |
subtype1 | 14 | 1 |
subtype2 | 23 | 0 |
subtype3 | 34 | 0 |
Cluster Labels | 1 | 2 | 3 |
---|---|---|---|
Number of samples | 77 | 91 | 51 |
P value = 0.000454 (logrank test)
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 218 | 65 | 0.1 - 111.0 (39.7) |
subtype1 | 77 | 32 | 0.5 - 90.3 (36.3) |
subtype2 | 90 | 16 | 1.3 - 111.0 (45.3) |
subtype3 | 51 | 17 | 0.1 - 101.1 (42.4) |
P value = 0.256 (ANOVA)
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 218 | 59.5 (12.5) |
subtype1 | 76 | 61.3 (12.0) |
subtype2 | 91 | 59.0 (13.4) |
subtype3 | 51 | 57.7 (11.5) |
P value = 0.00307 (Fisher's exact test)
nPatients | FEMALE | MALE |
---|---|---|
ALL | 142 | 77 |
subtype1 | 57 | 20 |
subtype2 | 47 | 44 |
subtype3 | 38 | 13 |
P value = 0.000248 (Chi-square test)
nPatients | T1 | T2 | T3 | T4 |
---|---|---|---|---|
ALL | 117 | 29 | 70 | 3 |
subtype1 | 30 | 8 | 36 | 3 |
subtype2 | 63 | 12 | 16 | 0 |
subtype3 | 24 | 9 | 18 | 0 |
P value = 0.0104 (Fisher's exact test)
nPatients | 0 | 1 |
---|---|---|
ALL | 106 | 9 |
subtype1 | 35 | 7 |
subtype2 | 41 | 0 |
subtype3 | 30 | 2 |
P value = 0.000576 (Fisher's exact test)
nPatients | M0 | M1 |
---|---|---|
ALL | 193 | 26 |
subtype1 | 59 | 18 |
subtype2 | 87 | 4 |
subtype3 | 47 | 4 |
P value = 1 (Fisher's exact test)
nPatients | NO | YES |
---|---|---|
ALL | 217 | 2 |
subtype1 | 76 | 1 |
subtype2 | 90 | 1 |
subtype3 | 51 | 0 |
Cluster Labels | 1 | 2 | 3 |
---|---|---|---|
Number of samples | 200 | 102 | 167 |
P value = 1.14e-07 (logrank test)
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 467 | 155 | 0.1 - 111.0 (34.3) |
subtype1 | 199 | 45 | 0.1 - 111.0 (37.5) |
subtype2 | 101 | 31 | 0.1 - 93.3 (35.2) |
subtype3 | 167 | 79 | 0.1 - 90.3 (29.1) |
P value = 0.099 (ANOVA)
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 468 | 60.6 (12.2) |
subtype1 | 199 | 61.8 (12.4) |
subtype2 | 102 | 58.7 (12.3) |
subtype3 | 167 | 60.3 (11.8) |
P value = 2.33e-05 (Fisher's exact test)
nPatients | FEMALE | MALE |
---|---|---|
ALL | 307 | 162 |
subtype1 | 109 | 91 |
subtype2 | 69 | 33 |
subtype3 | 129 | 38 |
P value = 0.709 (ANOVA)
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 31 | 91.0 (18.7) |
subtype1 | 14 | 92.1 (8.9) |
subtype2 | 7 | 94.3 (7.9) |
subtype3 | 10 | 87.0 (31.3) |
P value = 4.98e-06 (Chi-square test)
nPatients | T1 | T2 | T3 | T4 |
---|---|---|---|---|
ALL | 228 | 59 | 171 | 11 |
subtype1 | 117 | 24 | 57 | 2 |
subtype2 | 59 | 11 | 29 | 3 |
subtype3 | 52 | 24 | 85 | 6 |
P value = 0.0102 (Fisher's exact test)
nPatients | 0 | 1 |
---|---|---|
ALL | 223 | 17 |
subtype1 | 96 | 2 |
subtype2 | 48 | 3 |
subtype3 | 79 | 12 |
P value = 0.000382 (Fisher's exact test)
nPatients | M0 | M1 |
---|---|---|
ALL | 392 | 77 |
subtype1 | 178 | 22 |
subtype2 | 90 | 12 |
subtype3 | 124 | 43 |
P value = 0.37 (Fisher's exact test)
nPatients | NO | YES |
---|---|---|
ALL | 464 | 5 |
subtype1 | 199 | 1 |
subtype2 | 100 | 2 |
subtype3 | 165 | 2 |
Cluster Labels | 1 | 2 | 3 |
---|---|---|---|
Number of samples | 70 | 187 | 212 |
P value = 2.35e-09 (logrank test)
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 467 | 155 | 0.1 - 111.0 (34.3) |
subtype1 | 69 | 13 | 0.2 - 92.0 (34.3) |
subtype2 | 187 | 92 | 0.1 - 93.3 (29.1) |
subtype3 | 211 | 50 | 0.1 - 111.0 (37.5) |
P value = 0.0589 (ANOVA)
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 468 | 60.6 (12.2) |
subtype1 | 70 | 57.4 (12.9) |
subtype2 | 187 | 61.2 (11.6) |
subtype3 | 211 | 61.1 (12.3) |
P value = 0.00775 (Fisher's exact test)
nPatients | FEMALE | MALE |
---|---|---|
ALL | 307 | 162 |
subtype1 | 48 | 22 |
subtype2 | 136 | 51 |
subtype3 | 123 | 89 |
P value = 0.338 (ANOVA)
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 31 | 91.0 (18.7) |
subtype1 | 11 | 95.5 (9.3) |
subtype2 | 9 | 83.3 (32.4) |
subtype3 | 11 | 92.7 (6.5) |
P value = 3.6e-12 (Chi-square test)
nPatients | T1 | T2 | T3 | T4 |
---|---|---|---|---|
ALL | 228 | 59 | 171 | 11 |
subtype1 | 54 | 6 | 9 | 1 |
subtype2 | 53 | 26 | 100 | 8 |
subtype3 | 121 | 27 | 62 | 2 |
P value = 0.00115 (Fisher's exact test)
nPatients | 0 | 1 |
---|---|---|
ALL | 223 | 17 |
subtype1 | 28 | 2 |
subtype2 | 92 | 14 |
subtype3 | 103 | 1 |
P value = 1.75e-06 (Fisher's exact test)
nPatients | M0 | M1 |
---|---|---|
ALL | 392 | 77 |
subtype1 | 67 | 3 |
subtype2 | 137 | 50 |
subtype3 | 188 | 24 |
P value = 0.186 (Fisher's exact test)
nPatients | NO | YES |
---|---|---|
ALL | 464 | 5 |
subtype1 | 68 | 2 |
subtype2 | 185 | 2 |
subtype3 | 211 | 1 |
Cluster Labels | 1 | 2 | 3 |
---|---|---|---|
Number of samples | 195 | 102 | 166 |
P value = 3.83e-06 (logrank test)
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 461 | 149 | 0.1 - 111.0 (33.5) |
subtype1 | 195 | 44 | 0.1 - 111.0 (34.6) |
subtype2 | 102 | 30 | 0.1 - 109.9 (37.2) |
subtype3 | 164 | 75 | 0.2 - 93.3 (29.5) |
P value = 0.0827 (ANOVA)
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 463 | 60.7 (12.2) |
subtype1 | 195 | 62.1 (12.2) |
subtype2 | 102 | 58.9 (12.0) |
subtype3 | 166 | 60.3 (12.1) |
P value = 0.0076 (Fisher's exact test)
nPatients | FEMALE | MALE |
---|---|---|
ALL | 304 | 159 |
subtype1 | 113 | 82 |
subtype2 | 69 | 33 |
subtype3 | 122 | 44 |
P value = 0.201 (ANOVA)
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 35 | 88.0 (23.2) |
subtype1 | 15 | 92.0 (8.6) |
subtype2 | 8 | 95.0 (7.6) |
subtype3 | 12 | 78.3 (37.1) |
P value = 0.000289 (Chi-square test)
nPatients | T1 | T2 | T3 | T4 |
---|---|---|---|---|
ALL | 225 | 59 | 169 | 10 |
subtype1 | 106 | 26 | 61 | 2 |
subtype2 | 61 | 6 | 31 | 4 |
subtype3 | 58 | 27 | 77 | 4 |
P value = 0.0158 (Fisher's exact test)
nPatients | 0 | 1 |
---|---|---|
ALL | 213 | 16 |
subtype1 | 91 | 2 |
subtype2 | 47 | 3 |
subtype3 | 75 | 11 |
P value = 0.00365 (Fisher's exact test)
nPatients | M0 | M1 |
---|---|---|
ALL | 391 | 72 |
subtype1 | 170 | 25 |
subtype2 | 93 | 9 |
subtype3 | 128 | 38 |
P value = 0.54 (Fisher's exact test)
nPatients | NO | YES |
---|---|---|
ALL | 458 | 5 |
subtype1 | 194 | 1 |
subtype2 | 101 | 1 |
subtype3 | 163 | 3 |
Cluster Labels | 1 | 2 | 3 |
---|---|---|---|
Number of samples | 46 | 154 | 263 |
P value = 0.000319 (logrank test)
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 461 | 149 | 0.1 - 111.0 (33.5) |
subtype1 | 45 | 15 | 0.6 - 93.3 (36.4) |
subtype2 | 154 | 67 | 0.1 - 109.9 (30.9) |
subtype3 | 262 | 67 | 0.1 - 111.0 (34.5) |
P value = 0.108 (ANOVA)
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 463 | 60.7 (12.2) |
subtype1 | 46 | 57.5 (11.8) |
subtype2 | 154 | 60.4 (12.8) |
subtype3 | 263 | 61.5 (11.8) |
P value = 0.0901 (Fisher's exact test)
nPatients | FEMALE | MALE |
---|---|---|
ALL | 304 | 159 |
subtype1 | 31 | 15 |
subtype2 | 111 | 43 |
subtype3 | 162 | 101 |
P value = 0.926 (ANOVA)
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 35 | 88.0 (23.2) |
subtype1 | 1 | 90.0 (NA) |
subtype2 | 13 | 88.5 (27.3) |
subtype3 | 21 | 87.6 (21.7) |
P value = 0.000543 (Chi-square test)
nPatients | T1 | T2 | T3 | T4 |
---|---|---|---|---|
ALL | 225 | 59 | 169 | 10 |
subtype1 | 21 | 11 | 13 | 1 |
subtype2 | 60 | 15 | 72 | 7 |
subtype3 | 144 | 33 | 84 | 2 |
P value = 0.0333 (Fisher's exact test)
nPatients | 0 | 1 |
---|---|---|
ALL | 213 | 16 |
subtype1 | 21 | 3 |
subtype2 | 71 | 9 |
subtype3 | 121 | 4 |
P value = 0.844 (Fisher's exact test)
nPatients | M0 | M1 |
---|---|---|
ALL | 391 | 72 |
subtype1 | 39 | 7 |
subtype2 | 128 | 26 |
subtype3 | 224 | 39 |
P value = 0.00445 (Fisher's exact test)
nPatients | NO | YES |
---|---|---|
ALL | 458 | 5 |
subtype1 | 43 | 3 |
subtype2 | 154 | 0 |
subtype3 | 261 | 2 |
-
Cluster data file = KIRC.mergedcluster.txt
-
Clinical data file = KIRC.clin.merged.picked.txt
-
Number of patients = 500
-
Number of clustering approaches = 7
-
Number of selected clinical features = 8
-
Exclude small clusters that include fewer than K patients, K = 3
consensus non-negative matrix factorization clustering approach (Brunet et al. 2004)
Resampling-based clustering method (Monti et al. 2003)
For survival clinical features, the Kaplan-Meier survival curves of tumors with and without gene mutations were plotted and the statistical significance P values were estimated by logrank test (Bland and Altman 2004) using the 'survdiff' function in R
For continuous numerical clinical features, one-way analysis of variance (Howell 2002) was applied to compare the clinical values between tumor subtypes using 'anova' function in R
For binary clinical features, two-tailed Fisher's exact tests (Fisher 1922) were used to estimate the P values using the 'fisher.test' function in R
For multi-class clinical features (nominal or ordinal), Chi-square tests (Greenwood and Nikulin 1996) were used to estimate the P values using the 'chisq.test' function in R
This is an experimental feature. Location of data archives could not be determined.