This pipeline computes the correlation between cancer subtypes identified by different molecular patterns and selected clinical features.
Testing the association between subtypes identified by 4 different clustering approaches and 12 clinical features across 23 patients, 12 significant findings detected with P value < 0.05 and Q value < 0.25.
-
6 subtypes identified in current cancer cohort by 'MIRSEQ CNMF'. These subtypes correlate to 'Time to Death', 'TUMOR_TISSUE_SITE', 'GENDER', and 'HISTOLOGICAL_TYPE'.
-
5 subtypes identified in current cancer cohort by 'MIRSEQ CHIERARCHICAL'. These subtypes correlate to 'TUMOR_TISSUE_SITE', 'GENDER', and 'HISTOLOGICAL_TYPE'.
-
3 subtypes identified in current cancer cohort by 'MIRseq Mature CNMF subtypes'. These subtypes correlate to 'TUMOR_TISSUE_SITE' and 'HISTOLOGICAL_TYPE'.
-
5 subtypes identified in current cancer cohort by 'MIRseq Mature cHierClus subtypes'. These subtypes correlate to 'TUMOR_TISSUE_SITE', 'GENDER', and 'HISTOLOGICAL_TYPE'.
Table 1. Get Full Table Overview of the association between subtypes identified by 4 different clustering approaches and 12 clinical features. Shown in the table are P values (Q values). Thresholded by P value < 0.05 and Q value < 0.25, 12 significant findings detected.
Clinical Features |
Statistical Tests |
MIRSEQ CNMF |
MIRSEQ CHIERARCHICAL |
MIRseq Mature CNMF subtypes |
MIRseq Mature cHierClus subtypes |
Time to Death | logrank test |
0.0268 (0.107) |
0.0672 (0.215) |
0.548 (0.844) |
100 (1.00) |
YEARS TO BIRTH | Kruskal-Wallis (anova) |
0.924 (1.00) |
0.841 (1.00) |
0.118 (0.355) |
0.914 (1.00) |
TUMOR TISSUE SITE | Fisher's exact test |
1e-05 (0.00016) |
1e-05 (0.00016) |
0.00024 (0.00165) |
1e-05 (0.00016) |
PATHOLOGIC STAGE | Fisher's exact test |
0.186 (0.449) |
0.35 (0.668) |
0.215 (0.449) |
0.38 (0.668) |
PATHOLOGY T STAGE | Fisher's exact test |
0.13 (0.368) |
0.403 (0.668) |
0.0664 (0.215) |
0.147 (0.391) |
PATHOLOGY N STAGE | Fisher's exact test |
1 (1.00) |
1 (1.00) |
0.208 (0.449) |
1 (1.00) |
GENDER | Fisher's exact test |
0.00176 (0.00939) |
0.00812 (0.0354) |
0.0642 (0.215) |
0.00087 (0.00522) |
RADIATION THERAPY | Fisher's exact test |
0.458 (0.733) |
0.368 (0.668) |
0.193 (0.449) |
0.261 (0.522) |
HISTOLOGICAL TYPE | Fisher's exact test |
0.00011 (0.00088) |
2e-05 (0.00024) |
0.00407 (0.0195) |
6e-05 (0.000576) |
NUMBER PACK YEARS SMOKED | Kruskal-Wallis (anova) | ||||
NUMBER OF LYMPH NODES | Fisher's exact test |
0.835 (1.00) |
0.728 (1.00) |
0.205 (0.449) |
0.563 (0.844) |
RACE | Fisher's exact test |
0.758 (1.00) |
0.785 (1.00) |
0.596 (0.867) |
0.39 (0.668) |
Table S1. Description of clustering approach #1: 'MIRSEQ CNMF'
Cluster Labels | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
---|---|---|---|---|---|---|---|
Number of samples | 5 | 4 | 3 | 3 | 4 | 1 | 3 |
P value = 0.0268 (logrank test), Q value = 0.11
Table S2. Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #1: 'Time to Death'
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 22 | 3 | 2.7 - 40.1 (21.9) |
subtype1 | 5 | 0 | 9.9 - 32.9 (20.1) |
subtype2 | 4 | 0 | 20.6 - 33.1 (29.6) |
subtype3 | 3 | 0 | 23.2 - 35.0 (24.1) |
subtype4 | 3 | 1 | 10.2 - 31.7 (20.2) |
subtype5 | 4 | 0 | 10.4 - 35.9 (22.2) |
subtype7 | 3 | 2 | 2.7 - 40.1 (7.3) |
Figure S1. Get High-res Image Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #1: 'Time to Death'

P value = 0.924 (Kruskal-Wallis (anova)), Q value = 1
Table S3. Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #2: 'YEARS_TO_BIRTH'
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 22 | 64.1 (9.7) |
subtype1 | 5 | 64.8 (12.9) |
subtype2 | 4 | 60.8 (10.1) |
subtype3 | 3 | 68.0 (6.1) |
subtype4 | 3 | 60.7 (14.0) |
subtype5 | 4 | 65.5 (9.3) |
subtype7 | 3 | 65.0 (8.0) |
Figure S2. Get High-res Image Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #2: 'YEARS_TO_BIRTH'

P value = 1e-05 (Fisher's exact test), Q value = 0.00016
Table S4. Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #3: 'TUMOR_TISSUE_SITE'
nPatients | BLADDER | BREAST | COLON | ENDOMETRIAL | KIDNEY | LUNG |
---|---|---|---|---|---|---|
ALL | 3 | 5 | 4 | 3 | 4 | 3 |
subtype1 | 0 | 0 | 4 | 0 | 0 | 1 |
subtype2 | 0 | 3 | 0 | 0 | 0 | 1 |
subtype3 | 0 | 0 | 0 | 2 | 0 | 1 |
subtype4 | 0 | 2 | 0 | 1 | 0 | 0 |
subtype5 | 0 | 0 | 0 | 0 | 4 | 0 |
subtype7 | 3 | 0 | 0 | 0 | 0 | 0 |
Figure S3. Get High-res Image Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #3: 'TUMOR_TISSUE_SITE'

P value = 0.186 (Fisher's exact test), Q value = 0.45
Table S5. Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #4: 'PATHOLOGIC_STAGE'
nPatients | STAGE I | STAGE IA | STAGE IB | STAGE II | STAGE IIA | STAGE IIB | STAGE III | STAGE IIIA | STAGE IIIC | STAGE IV |
---|---|---|---|---|---|---|---|---|---|---|
ALL | 5 | 1 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | 1 |
subtype1 | 1 | 0 | 0 | 0 | 2 | 1 | 0 | 0 | 1 | 0 |
subtype2 | 0 | 1 | 0 | 0 | 2 | 0 | 0 | 1 | 0 | 0 |
subtype3 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
subtype4 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 |
subtype5 | 3 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
subtype7 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 |
Figure S4. Get High-res Image Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #4: 'PATHOLOGIC_STAGE'

P value = 0.13 (Fisher's exact test), Q value = 0.37
Table S6. Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #5: 'PATHOLOGY_T_STAGE'
nPatients | T1 | T2 | T3+T4 |
---|---|---|---|
ALL | 5 | 7 | 7 |
subtype1 | 0 | 1 | 4 |
subtype2 | 1 | 2 | 1 |
subtype3 | 0 | 1 | 0 |
subtype4 | 0 | 2 | 0 |
subtype5 | 3 | 1 | 0 |
subtype7 | 1 | 0 | 2 |
Figure S5. Get High-res Image Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #5: 'PATHOLOGY_T_STAGE'

P value = 1 (Fisher's exact test), Q value = 1
Table S7. Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #6: 'PATHOLOGY_N_STAGE'
nPatients | N0 | N1+N2 |
---|---|---|
ALL | 10 | 4 |
subtype1 | 4 | 1 |
subtype2 | 2 | 1 |
subtype3 | 1 | 0 |
subtype4 | 1 | 1 |
subtype5 | 1 | 0 |
subtype7 | 1 | 1 |
Figure S6. Get High-res Image Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #6: 'PATHOLOGY_N_STAGE'

P value = 0.00176 (Fisher's exact test), Q value = 0.0094
Table S8. Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #7: 'GENDER'
nPatients | FEMALE | MALE |
---|---|---|
ALL | 12 | 10 |
subtype1 | 1 | 4 |
subtype2 | 4 | 0 |
subtype3 | 3 | 0 |
subtype4 | 3 | 0 |
subtype5 | 0 | 4 |
subtype7 | 1 | 2 |
Figure S7. Get High-res Image Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #7: 'GENDER'

P value = 0.458 (Fisher's exact test), Q value = 0.73
Table S9. Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #8: 'RADIATION_THERAPY'
nPatients | NO | YES |
---|---|---|
ALL | 14 | 4 |
subtype1 | 5 | 0 |
subtype2 | 2 | 2 |
subtype3 | 2 | 1 |
subtype4 | 2 | 1 |
subtype5 | 2 | 0 |
subtype7 | 1 | 0 |
Figure S8. Get High-res Image Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #8: 'RADIATION_THERAPY'

P value = 0.00011 (Fisher's exact test), Q value = 0.00088
Table S10. Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #9: 'HISTOLOGICAL_TYPE'
nPatients | COLON ADENOCARCINOMA | COLON MUCINOUS ADENOCARCINOMA | ENDOMETRIOID ENDOMETRIAL ADENOCARCINOMA | INFILTRATING DUCTAL CARCINOMA | INFILTRATING LOBULAR CARCINOMA | KIDNEY CLEAR CELL RENAL CARCINOMA | LUNG ADENOCARCINOMA- NOT OTHERWISE SPECIFIED (NOS) | LUNG MUCINOUS ADENOCARCINOMA | MIXED SEROUS AND ENDOMETRIOID | MUSCLE INVASIVE UROTHELIAL CARCINOMA (PT2 OR ABOVE) | SEROUS ENDOMETRIAL ADENOCARCINOMA |
---|---|---|---|---|---|---|---|---|---|---|---|
ALL | 2 | 2 | 1 | 4 | 1 | 4 | 2 | 1 | 1 | 2 | 1 |
subtype1 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
subtype2 | 0 | 0 | 0 | 2 | 1 | 0 | 1 | 0 | 0 | 0 | 0 |
subtype3 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 |
subtype4 | 0 | 0 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
subtype5 | 0 | 0 | 0 | 0 | 0 | 4 | 0 | 0 | 0 | 0 | 0 |
subtype7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 |
Figure S9. Get High-res Image Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #9: 'HISTOLOGICAL_TYPE'

P value = 0.835 (Fisher's exact test), Q value = 1
Table S11. Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #11: 'NUMBER_OF_LYMPH_NODES'
nPatients | 0 | 2 |
---|---|---|
ALL | 9 | 3 |
subtype1 | 3 | 1 |
subtype2 | 3 | 0 |
subtype3 | 0 | 0 |
subtype4 | 1 | 1 |
subtype5 | 0 | 0 |
subtype7 | 2 | 1 |
Figure S10. Get High-res Image Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #11: 'NUMBER_OF_LYMPH_NODES'

P value = 0.758 (Fisher's exact test), Q value = 1
Table S12. Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #12: 'RACE'
nPatients | BLACK OR AFRICAN AMERICAN | WHITE |
---|---|---|
ALL | 2 | 20 |
subtype1 | 1 | 4 |
subtype2 | 0 | 4 |
subtype3 | 0 | 3 |
subtype4 | 1 | 2 |
subtype5 | 0 | 4 |
subtype7 | 0 | 3 |
Figure S11. Get High-res Image Clustering Approach #1: 'MIRSEQ CNMF' versus Clinical Feature #12: 'RACE'

Table S13. Description of clustering approach #2: 'MIRSEQ CHIERARCHICAL'
Cluster Labels | 1 | 2 | 3 | 4 | 5 | 6 |
---|---|---|---|---|---|---|
Number of samples | 2 | 4 | 4 | 6 | 4 | 3 |
P value = 0.0672 (logrank test), Q value = 0.22
Table S14. Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #1: 'Time to Death'
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 21 | 3 | 2.7 - 40.1 (20.6) |
subtype2 | 4 | 0 | 20.6 - 33.1 (29.6) |
subtype3 | 4 | 0 | 19.7 - 32.9 (20.4) |
subtype4 | 6 | 3 | 2.7 - 40.1 (15.2) |
subtype5 | 4 | 0 | 10.4 - 35.9 (22.2) |
subtype6 | 3 | 0 | 10.2 - 35.0 (24.1) |
Figure S12. Get High-res Image Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #1: 'Time to Death'

P value = 0.841 (Kruskal-Wallis (anova)), Q value = 1
Table S15. Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #2: 'YEARS_TO_BIRTH'
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 21 | 64.0 (9.8) |
subtype2 | 4 | 60.8 (10.1) |
subtype3 | 4 | 65.0 (14.9) |
subtype4 | 6 | 62.8 (10.5) |
subtype5 | 4 | 65.5 (9.3) |
subtype6 | 3 | 67.0 (5.3) |
Figure S13. Get High-res Image Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #2: 'YEARS_TO_BIRTH'

P value = 1e-05 (Fisher's exact test), Q value = 0.00016
Table S16. Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #3: 'TUMOR_TISSUE_SITE'
nPatients | BLADDER | BREAST | COLON | ENDOMETRIAL | KIDNEY | LUNG |
---|---|---|---|---|---|---|
ALL | 3 | 5 | 4 | 4 | 4 | 1 |
subtype2 | 0 | 3 | 0 | 0 | 0 | 1 |
subtype3 | 0 | 0 | 4 | 0 | 0 | 0 |
subtype4 | 3 | 2 | 0 | 1 | 0 | 0 |
subtype5 | 0 | 0 | 0 | 0 | 4 | 0 |
subtype6 | 0 | 0 | 0 | 3 | 0 | 0 |
Figure S14. Get High-res Image Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #3: 'TUMOR_TISSUE_SITE'

P value = 0.35 (Fisher's exact test), Q value = 0.67
Table S17. Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #4: 'PATHOLOGIC_STAGE'
nPatients | STAGE I | STAGE IA | STAGE II | STAGE IIA | STAGE IIB | STAGE III | STAGE IIIA | STAGE IIIC | STAGE IV |
---|---|---|---|---|---|---|---|---|---|
ALL | 5 | 1 | 1 | 5 | 1 | 1 | 1 | 1 | 1 |
subtype2 | 0 | 1 | 0 | 2 | 0 | 0 | 1 | 0 | 0 |
subtype3 | 1 | 0 | 0 | 2 | 0 | 0 | 0 | 1 | 0 |
subtype4 | 1 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 1 |
subtype5 | 3 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
subtype6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Figure S15. Get High-res Image Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #4: 'PATHOLOGIC_STAGE'

P value = 0.403 (Fisher's exact test), Q value = 0.67
Table S18. Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #5: 'PATHOLOGY_T_STAGE'
nPatients | T1 | T2 | T3+T4 |
---|---|---|---|
ALL | 5 | 6 | 6 |
subtype2 | 1 | 2 | 1 |
subtype3 | 0 | 1 | 3 |
subtype4 | 1 | 2 | 2 |
subtype5 | 3 | 1 | 0 |
subtype6 | 0 | 0 | 0 |
Figure S16. Get High-res Image Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #5: 'PATHOLOGY_T_STAGE'

P value = 1 (Fisher's exact test), Q value = 1
Table S19. Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #6: 'PATHOLOGY_N_STAGE'
nPatients | N0 | N1+N2 |
---|---|---|
ALL | 8 | 4 |
subtype2 | 2 | 1 |
subtype3 | 3 | 1 |
subtype4 | 2 | 2 |
subtype5 | 1 | 0 |
subtype6 | 0 | 0 |
Figure S17. Get High-res Image Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #6: 'PATHOLOGY_N_STAGE'

P value = 0.00812 (Fisher's exact test), Q value = 0.035
Table S20. Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #7: 'GENDER'
nPatients | FEMALE | MALE |
---|---|---|
ALL | 12 | 9 |
subtype2 | 4 | 0 |
subtype3 | 1 | 3 |
subtype4 | 4 | 2 |
subtype5 | 0 | 4 |
subtype6 | 3 | 0 |
Figure S18. Get High-res Image Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #7: 'GENDER'

P value = 0.368 (Fisher's exact test), Q value = 0.67
Table S21. Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #8: 'RADIATION_THERAPY'
nPatients | NO | YES |
---|---|---|
ALL | 12 | 5 |
subtype2 | 2 | 2 |
subtype3 | 4 | 0 |
subtype4 | 3 | 1 |
subtype5 | 2 | 0 |
subtype6 | 1 | 2 |
Figure S19. Get High-res Image Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #8: 'RADIATION_THERAPY'

P value = 2e-05 (Fisher's exact test), Q value = 0.00024
Table S22. Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #9: 'HISTOLOGICAL_TYPE'
nPatients | COLON ADENOCARCINOMA | COLON MUCINOUS ADENOCARCINOMA | ENDOMETRIOID ENDOMETRIAL ADENOCARCINOMA | INFILTRATING DUCTAL CARCINOMA | INFILTRATING LOBULAR CARCINOMA | KIDNEY CLEAR CELL RENAL CARCINOMA | LUNG ADENOCARCINOMA- NOT OTHERWISE SPECIFIED (NOS) | MIXED SEROUS AND ENDOMETRIOID | MUSCLE INVASIVE UROTHELIAL CARCINOMA (PT2 OR ABOVE) | SEROUS ENDOMETRIAL ADENOCARCINOMA |
---|---|---|---|---|---|---|---|---|---|---|
ALL | 2 | 2 | 1 | 4 | 1 | 4 | 1 | 1 | 2 | 2 |
subtype2 | 0 | 0 | 0 | 2 | 1 | 0 | 1 | 0 | 0 | 0 |
subtype3 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
subtype4 | 0 | 0 | 1 | 2 | 0 | 0 | 0 | 0 | 2 | 0 |
subtype5 | 0 | 0 | 0 | 0 | 0 | 4 | 0 | 0 | 0 | 0 |
subtype6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 2 |
Figure S20. Get High-res Image Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #9: 'HISTOLOGICAL_TYPE'

P value = 0.728 (Fisher's exact test), Q value = 1
Table S23. Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #11: 'NUMBER_OF_LYMPH_NODES'
nPatients | 0 | 2 |
---|---|---|
ALL | 9 | 3 |
subtype2 | 3 | 0 |
subtype3 | 3 | 1 |
subtype4 | 3 | 2 |
subtype5 | 0 | 0 |
subtype6 | 0 | 0 |
Figure S21. Get High-res Image Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #11: 'NUMBER_OF_LYMPH_NODES'

P value = 0.785 (Fisher's exact test), Q value = 1
Table S24. Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #12: 'RACE'
nPatients | BLACK OR AFRICAN AMERICAN | WHITE |
---|---|---|
ALL | 3 | 18 |
subtype2 | 0 | 4 |
subtype3 | 1 | 3 |
subtype4 | 1 | 5 |
subtype5 | 0 | 4 |
subtype6 | 1 | 2 |
Figure S22. Get High-res Image Clustering Approach #2: 'MIRSEQ CHIERARCHICAL' versus Clinical Feature #12: 'RACE'

Table S25. Description of clustering approach #3: 'MIRseq Mature CNMF subtypes'
Cluster Labels | 1 | 2 | 3 |
---|---|---|---|
Number of samples | 6 | 9 | 8 |
P value = 0.548 (logrank test), Q value = 0.84
Table S26. Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #1: 'Time to Death'
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 23 | 3 | 2.7 - 40.1 (20.6) |
subtype1 | 6 | 0 | 9.9 - 35.9 (19.8) |
subtype2 | 9 | 2 | 2.7 - 33.1 (23.2) |
subtype3 | 8 | 1 | 7.3 - 40.1 (20.4) |
Figure S23. Get High-res Image Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #1: 'Time to Death'

P value = 0.118 (Kruskal-Wallis (anova)), Q value = 0.35
Table S27. Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #2: 'YEARS_TO_BIRTH'
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 23 | 64.3 (9.5) |
subtype1 | 6 | 65.5 (7.3) |
subtype2 | 9 | 59.4 (11.4) |
subtype3 | 8 | 68.9 (6.5) |
Figure S24. Get High-res Image Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #2: 'YEARS_TO_BIRTH'

P value = 0.00024 (Fisher's exact test), Q value = 0.0016
Table S28. Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #3: 'TUMOR_TISSUE_SITE'
nPatients | BLADDER | BREAST | COLON | ENDOMETRIAL | KIDNEY | LUNG |
---|---|---|---|---|---|---|
ALL | 3 | 5 | 4 | 4 | 4 | 3 |
subtype1 | 0 | 0 | 0 | 0 | 4 | 2 |
subtype2 | 1 | 5 | 1 | 1 | 0 | 1 |
subtype3 | 2 | 0 | 3 | 3 | 0 | 0 |
Figure S25. Get High-res Image Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #3: 'TUMOR_TISSUE_SITE'

P value = 0.215 (Fisher's exact test), Q value = 0.45
Table S29. Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #4: 'PATHOLOGIC_STAGE'
nPatients | STAGE I | STAGE IA | STAGE IB | STAGE II | STAGE IIA | STAGE IIB | STAGE III | STAGE IIIA | STAGE IIIC | STAGE IV |
---|---|---|---|---|---|---|---|---|---|---|
ALL | 5 | 1 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | 1 |
subtype1 | 3 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 |
subtype2 | 0 | 0 | 1 | 0 | 3 | 1 | 0 | 1 | 1 | 1 |
subtype3 | 2 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0 | 0 |
Figure S26. Get High-res Image Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #4: 'PATHOLOGIC_STAGE'

P value = 0.0664 (Fisher's exact test), Q value = 0.22
Table S30. Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #5: 'PATHOLOGY_T_STAGE'
nPatients | T1 | T2 | T3+T4 |
---|---|---|---|
ALL | 5 | 7 | 7 |
subtype1 | 4 | 1 | 1 |
subtype2 | 0 | 5 | 3 |
subtype3 | 1 | 1 | 3 |
Figure S27. Get High-res Image Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #5: 'PATHOLOGY_T_STAGE'

P value = 0.208 (Fisher's exact test), Q value = 0.45
Table S31. Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #6: 'PATHOLOGY_N_STAGE'
nPatients | N0 | N1+N2 |
---|---|---|
ALL | 10 | 4 |
subtype1 | 2 | 0 |
subtype2 | 4 | 4 |
subtype3 | 4 | 0 |
Figure S28. Get High-res Image Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #6: 'PATHOLOGY_N_STAGE'

P value = 0.0642 (Fisher's exact test), Q value = 0.22
Table S32. Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #7: 'GENDER'
nPatients | FEMALE | MALE |
---|---|---|
ALL | 13 | 10 |
subtype1 | 1 | 5 |
subtype2 | 7 | 2 |
subtype3 | 5 | 3 |
Figure S29. Get High-res Image Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #7: 'GENDER'

P value = 0.193 (Fisher's exact test), Q value = 0.45
Table S33. Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #8: 'RADIATION_THERAPY'
nPatients | NO | YES |
---|---|---|
ALL | 14 | 5 |
subtype1 | 4 | 0 |
subtype2 | 4 | 4 |
subtype3 | 6 | 1 |
Figure S30. Get High-res Image Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #8: 'RADIATION_THERAPY'

P value = 0.00407 (Fisher's exact test), Q value = 0.02
Table S34. Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #9: 'HISTOLOGICAL_TYPE'
nPatients | COLON ADENOCARCINOMA | COLON MUCINOUS ADENOCARCINOMA | ENDOMETRIOID ENDOMETRIAL ADENOCARCINOMA | INFILTRATING DUCTAL CARCINOMA | INFILTRATING LOBULAR CARCINOMA | KIDNEY CLEAR CELL RENAL CARCINOMA | LUNG ADENOCARCINOMA- NOT OTHERWISE SPECIFIED (NOS) | LUNG MUCINOUS ADENOCARCINOMA | MIXED SEROUS AND ENDOMETRIOID | MUSCLE INVASIVE UROTHELIAL CARCINOMA (PT2 OR ABOVE) | SEROUS ENDOMETRIAL ADENOCARCINOMA |
---|---|---|---|---|---|---|---|---|---|---|---|
ALL | 2 | 2 | 1 | 4 | 1 | 4 | 2 | 1 | 1 | 2 | 2 |
subtype1 | 0 | 0 | 0 | 0 | 0 | 4 | 1 | 1 | 0 | 0 | 0 |
subtype2 | 0 | 1 | 0 | 4 | 1 | 0 | 1 | 0 | 0 | 1 | 1 |
subtype3 | 2 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 |
Figure S31. Get High-res Image Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #9: 'HISTOLOGICAL_TYPE'

P value = 0.205 (Fisher's exact test), Q value = 0.45
Table S35. Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #11: 'NUMBER_OF_LYMPH_NODES'
nPatients | 0 | 2 |
---|---|---|
ALL | 9 | 3 |
subtype1 | 0 | 0 |
subtype2 | 4 | 3 |
subtype3 | 5 | 0 |
Figure S32. Get High-res Image Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #11: 'NUMBER_OF_LYMPH_NODES'

P value = 0.596 (Fisher's exact test), Q value = 0.87
Table S36. Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #12: 'RACE'
nPatients | BLACK OR AFRICAN AMERICAN | WHITE |
---|---|---|
ALL | 3 | 20 |
subtype1 | 0 | 6 |
subtype2 | 1 | 8 |
subtype3 | 2 | 6 |
Figure S33. Get High-res Image Clustering Approach #3: 'MIRseq Mature CNMF subtypes' versus Clinical Feature #12: 'RACE'

Table S37. Description of clustering approach #4: 'MIRseq Mature cHierClus subtypes'
Cluster Labels | 1 | 2 | 3 | 4 | 5 | 6 |
---|---|---|---|---|---|---|
Number of samples | 5 | 6 | 3 | 3 | 4 | 2 |
P value = 100 (logrank test), Q value = 1
Table S38. Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #1: 'Time to Death'
nPatients | nDeath | Duration Range (Median), Month | |
---|---|---|---|
ALL | 21 | 2 | 2.7 - 40.1 (23.2) |
subtype1 | 5 | 0 | 9.9 - 32.9 (20.1) |
subtype2 | 6 | 0 | 20.6 - 35.0 (29.6) |
subtype3 | 3 | 1 | 2.7 - 40.1 (31.7) |
subtype4 | 3 | 1 | 10.2 - 24.1 (20.2) |
subtype5 | 4 | 0 | 10.4 - 35.9 (22.2) |
Figure S34. Get High-res Image Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #1: 'Time to Death'

P value = 0.914 (Kruskal-Wallis (anova)), Q value = 1
Table S39. Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #2: 'YEARS_TO_BIRTH'
nPatients | Mean (Std.Dev) | |
---|---|---|
ALL | 21 | 64.2 (9.6) |
subtype1 | 5 | 64.8 (12.9) |
subtype2 | 6 | 62.7 (9.0) |
subtype3 | 3 | 61.3 (13.9) |
subtype4 | 3 | 67.3 (4.7) |
subtype5 | 4 | 65.5 (9.3) |
Figure S35. Get High-res Image Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #2: 'YEARS_TO_BIRTH'

P value = 1e-05 (Fisher's exact test), Q value = 0.00016
Table S40. Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #3: 'TUMOR_TISSUE_SITE'
nPatients | BLADDER | BREAST | COLON | ENDOMETRIAL | KIDNEY | LUNG |
---|---|---|---|---|---|---|
ALL | 2 | 5 | 4 | 3 | 4 | 3 |
subtype1 | 0 | 0 | 4 | 0 | 0 | 1 |
subtype2 | 0 | 3 | 0 | 1 | 0 | 2 |
subtype3 | 2 | 1 | 0 | 0 | 0 | 0 |
subtype4 | 0 | 1 | 0 | 2 | 0 | 0 |
subtype5 | 0 | 0 | 0 | 0 | 4 | 0 |
Figure S36. Get High-res Image Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #3: 'TUMOR_TISSUE_SITE'

P value = 0.38 (Fisher's exact test), Q value = 0.67
Table S41. Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #4: 'PATHOLOGIC_STAGE'
nPatients | STAGE I | STAGE IA | STAGE IB | STAGE II | STAGE IIA | STAGE IIB | STAGE IIIA | STAGE IIIC | STAGE IV |
---|---|---|---|---|---|---|---|---|---|
ALL | 5 | 1 | 1 | 1 | 5 | 2 | 1 | 1 | 1 |
subtype1 | 1 | 0 | 0 | 0 | 2 | 1 | 0 | 1 | 0 |
subtype2 | 0 | 1 | 1 | 0 | 2 | 0 | 1 | 0 | 0 |
subtype3 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 |
subtype4 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
subtype5 | 3 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
Figure S37. Get High-res Image Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #4: 'PATHOLOGIC_STAGE'

P value = 0.147 (Fisher's exact test), Q value = 0.39
Table S42. Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #5: 'PATHOLOGY_T_STAGE'
nPatients | T1 | T2 | T3+T4 |
---|---|---|---|
ALL | 5 | 7 | 6 |
subtype1 | 0 | 1 | 4 |
subtype2 | 1 | 3 | 1 |
subtype3 | 1 | 1 | 1 |
subtype4 | 0 | 1 | 0 |
subtype5 | 3 | 1 | 0 |
Figure S38. Get High-res Image Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #5: 'PATHOLOGY_T_STAGE'

P value = 1 (Fisher's exact test), Q value = 1
Table S43. Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #6: 'PATHOLOGY_N_STAGE'
nPatients | N0 | N1+N2 |
---|---|---|
ALL | 9 | 4 |
subtype1 | 4 | 1 |
subtype2 | 3 | 1 |
subtype3 | 1 | 1 |
subtype4 | 0 | 1 |
subtype5 | 1 | 0 |
Figure S39. Get High-res Image Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #6: 'PATHOLOGY_N_STAGE'

P value = 0.00087 (Fisher's exact test), Q value = 0.0052
Table S44. Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #7: 'GENDER'
nPatients | FEMALE | MALE |
---|---|---|
ALL | 11 | 10 |
subtype1 | 1 | 4 |
subtype2 | 6 | 0 |
subtype3 | 1 | 2 |
subtype4 | 3 | 0 |
subtype5 | 0 | 4 |
Figure S40. Get High-res Image Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #7: 'GENDER'

P value = 0.261 (Fisher's exact test), Q value = 0.52
Table S45. Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #8: 'RADIATION_THERAPY'
nPatients | NO | YES |
---|---|---|
ALL | 13 | 5 |
subtype1 | 5 | 0 |
subtype2 | 4 | 2 |
subtype3 | 1 | 1 |
subtype4 | 1 | 2 |
subtype5 | 2 | 0 |
Figure S41. Get High-res Image Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #8: 'RADIATION_THERAPY'

P value = 6e-05 (Fisher's exact test), Q value = 0.00058
Table S46. Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #9: 'HISTOLOGICAL_TYPE'
nPatients | COLON ADENOCARCINOMA | COLON MUCINOUS ADENOCARCINOMA | INFILTRATING DUCTAL CARCINOMA | INFILTRATING LOBULAR CARCINOMA | KIDNEY CLEAR CELL RENAL CARCINOMA | LUNG ADENOCARCINOMA- NOT OTHERWISE SPECIFIED (NOS) | LUNG MUCINOUS ADENOCARCINOMA | MIXED SEROUS AND ENDOMETRIOID | MUSCLE INVASIVE UROTHELIAL CARCINOMA (PT2 OR ABOVE) | SEROUS ENDOMETRIAL ADENOCARCINOMA |
---|---|---|---|---|---|---|---|---|---|---|
ALL | 2 | 2 | 4 | 1 | 4 | 2 | 1 | 1 | 1 | 2 |
subtype1 | 2 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
subtype2 | 0 | 0 | 2 | 1 | 0 | 2 | 0 | 1 | 0 | 0 |
subtype3 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
subtype4 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
subtype5 | 0 | 0 | 0 | 0 | 4 | 0 | 0 | 0 | 0 | 0 |
Figure S42. Get High-res Image Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #9: 'HISTOLOGICAL_TYPE'

P value = 0.563 (Fisher's exact test), Q value = 0.84
Table S47. Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #11: 'NUMBER_OF_LYMPH_NODES'
nPatients | 0 | 2 |
---|---|---|
ALL | 8 | 3 |
subtype1 | 3 | 1 |
subtype2 | 3 | 0 |
subtype3 | 2 | 1 |
subtype4 | 0 | 1 |
subtype5 | 0 | 0 |
Figure S43. Get High-res Image Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #11: 'NUMBER_OF_LYMPH_NODES'

P value = 0.39 (Fisher's exact test), Q value = 0.67
Table S48. Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #12: 'RACE'
nPatients | BLACK OR AFRICAN AMERICAN | WHITE |
---|---|---|
ALL | 3 | 18 |
subtype1 | 1 | 4 |
subtype2 | 0 | 6 |
subtype3 | 1 | 2 |
subtype4 | 1 | 2 |
subtype5 | 0 | 4 |
Figure S44. Get High-res Image Clustering Approach #4: 'MIRseq Mature cHierClus subtypes' versus Clinical Feature #12: 'RACE'

-
Cluster data file = /xchip/cga/gdac-prod/tcga-gdac/jobResults/GDAC_mergedClustering/FPPP-TP/22538209/FPPP-TP.mergedcluster.txt
-
Clinical data file = /xchip/cga/gdac-prod/tcga-gdac/jobResults/Append_Data/FPPP-TP/22506420/FPPP-TP.merged_data.txt
-
Number of patients = 23
-
Number of clustering approaches = 4
-
Number of selected clinical features = 12
-
Exclude small clusters that include fewer than K patients, K = 3
For survival clinical features, the Kaplan-Meier survival curves of tumors with and without gene mutations were plotted and the statistical significance P values were estimated by logrank test (Bland and Altman 2004) using the 'survdiff' function in R
For binary clinical features, two-tailed Fisher's exact tests (Fisher 1922) were used to estimate the P values using the 'fisher.test' function in R
For multiple hypothesis correction, Q value is the False Discovery Rate (FDR) analogue of the P value (Benjamini and Hochberg 1995), defined as the minimum FDR at which the test may be called significant. We used the 'Benjamini and Hochberg' method of 'p.adjust' function in R to convert P values into Q values.