Correlation between copy number variations of arm-level result and selected clinical features
Glioblastoma Multiforme (Primary solid tumor)
22 February 2013  |  analyses__2013_02_22
Maintainer Information
Citation Information
Maintained by TCGA GDAC Team (Broad Institute/MD Anderson Cancer Center/Harvard Medical School)
Cite as Broad Institute TCGA Genome Data Analysis Center (2013): Correlation between copy number variations of arm-level result and selected clinical features. Broad Institute of MIT and Harvard. doi:10.7908/C1BP010G
Overview
Introduction

This pipeline computes the correlation between significant arm-level copy number variations (cnvs) and selected clinical features.

Summary

Testing the association between subtypes identified by 79 different clustering approaches and 5 clinical features across 544 patients, 10 significant findings detected with Q value < 0.25.

  • 2 subtypes identified in current cancer cohort by '1p gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '1q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '2p gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '2q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '3p gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '3q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '4p gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '4q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '5p gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '5q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '6p gain mutation analysis'. These subtypes correlate to 'KARNOFSKY.PERFORMANCE.SCORE'.

  • 2 subtypes identified in current cancer cohort by '6q gain mutation analysis'. These subtypes correlate to 'KARNOFSKY.PERFORMANCE.SCORE'.

  • 2 subtypes identified in current cancer cohort by '7p gain mutation analysis'. These subtypes correlate to 'AGE'.

  • 2 subtypes identified in current cancer cohort by '7q gain mutation analysis'. These subtypes correlate to 'AGE'.

  • 2 subtypes identified in current cancer cohort by '8p gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '8q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '9p gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '9q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '10p gain mutation analysis'. These subtypes correlate to 'AGE' and 'KARNOFSKY.PERFORMANCE.SCORE'.

  • 2 subtypes identified in current cancer cohort by '11p gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '11q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '12p gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '12q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '13q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '14q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '15q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '16p gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '16q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '17p gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '17q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '18p gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '18q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '19p gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '19q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '20p gain mutation analysis'. These subtypes correlate to 'AGE'.

  • 2 subtypes identified in current cancer cohort by '20q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '21q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '22q gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by 'Xq gain mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '1p loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '1q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '2p loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '2q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '3p loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '3q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '4p loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '4q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '5p loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '5q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '6p loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '6q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '7p loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '7q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '8p loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '8q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '9p loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '9q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '10p loss mutation analysis'. These subtypes correlate to 'Time to Death' and 'AGE'.

  • 2 subtypes identified in current cancer cohort by '10q loss mutation analysis'. These subtypes correlate to 'AGE'.

  • 2 subtypes identified in current cancer cohort by '11p loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '11q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '12p loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '12q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '13q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '14q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '15q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '16p loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '16q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '17p loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '17q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '18p loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '18q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '19p loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '19q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '20p loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '20q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '21q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by '22q loss mutation analysis'. These subtypes do not correlate to any clinical features.

  • 2 subtypes identified in current cancer cohort by 'Xq loss mutation analysis'. These subtypes do not correlate to any clinical features.

Results
Overview of the results

Table 1.  Get Full Table Overview of the association between subtypes identified by 79 different clustering approaches and 5 clinical features. Shown in the table are P values (Q values). Thresholded by Q value < 0.25, 10 significant findings detected.

Clinical
Features
Time
to
Death
AGE GENDER KARNOFSKY
PERFORMANCE
SCORE
RADIATIONS
RADIATION
REGIMENINDICATION
Statistical Tests logrank test t-test Fisher's exact test t-test Fisher's exact test
1p gain 0.607
(1.00)
0.617
(1.00)
0.606
(1.00)
0.868
(1.00)
0.476
(1.00)
1q gain 0.987
(1.00)
0.503
(1.00)
1
(1.00)
0.979
(1.00)
0.612
(1.00)
2p gain 0.577
(1.00)
0.196
(1.00)
0.219
(1.00)
0.301
(1.00)
0.448
(1.00)
2q gain 0.772
(1.00)
0.501
(1.00)
0.44
(1.00)
0.737
(1.00)
0.29
(1.00)
3p gain 0.124
(1.00)
0.532
(1.00)
1
(1.00)
0.837
(1.00)
0.322
(1.00)
3q gain 0.126
(1.00)
0.437
(1.00)
1
(1.00)
0.892
(1.00)
0.322
(1.00)
4p gain 0.0811
(1.00)
0.0695
(1.00)
0.791
(1.00)
0.127
(1.00)
0.407
(1.00)
4q gain 0.031
(1.00)
0.342
(1.00)
0.789
(1.00)
0.146
(1.00)
0.245
(1.00)
5p gain 0.916
(1.00)
0.206
(1.00)
0.697
(1.00)
0.0435
(1.00)
0.144
(1.00)
5q gain 0.711
(1.00)
0.52
(1.00)
0.497
(1.00)
0.301
(1.00)
0.238
(1.00)
6p gain 0.1
(1.00)
0.0989
(1.00)
1
(1.00)
0.000254
(0.098)
0.438
(1.00)
6q gain 0.134
(1.00)
0.0449
(1.00)
0.745
(1.00)
0.000254
(0.098)
0.725
(1.00)
7p gain 0.00488
(1.00)
5.79e-06
(0.00226)
1
(1.00)
0.53
(1.00)
0.0976
(1.00)
7q gain 0.00601
(1.00)
7.78e-06
(0.00303)
0.594
(1.00)
0.584
(1.00)
0.148
(1.00)
8p gain 0.765
(1.00)
0.553
(1.00)
0.539
(1.00)
0.402
(1.00)
0.831
(1.00)
8q gain 0.833
(1.00)
0.205
(1.00)
0.345
(1.00)
0.389
(1.00)
1
(1.00)
9p gain 0.24
(1.00)
0.0378
(1.00)
0.314
(1.00)
0.00622
(1.00)
0.6
(1.00)
9q gain 0.0339
(1.00)
0.0482
(1.00)
0.0612
(1.00)
0.0256
(1.00)
0.439
(1.00)
10p gain 0.00302
(1.00)
9.55e-05
(0.0371)
0.0296
(1.00)
0.000254
(0.098)
1
(1.00)
11p gain 0.568
(1.00)
0.383
(1.00)
0.65
(1.00)
0.841
(1.00)
1
(1.00)
11q gain 0.152
(1.00)
0.211
(1.00)
0.254
(1.00)
0.0913
(1.00)
0.438
(1.00)
12p gain 0.928
(1.00)
0.15
(1.00)
1
(1.00)
0.93
(1.00)
0.6
(1.00)
12q gain 0.498
(1.00)
0.667
(1.00)
0.445
(1.00)
0.554
(1.00)
0.322
(1.00)
13q gain 0.991
(1.00)
0.104
(1.00)
0.0612
(1.00)
0.0329
(1.00)
14q gain 0.0771
(1.00)
0.816
(1.00)
0.685
(1.00)
0.514
(1.00)
1
(1.00)
15q gain 0.427
(1.00)
0.19
(1.00)
1
(1.00)
0.292
(1.00)
0.659
(1.00)
16p gain 0.044
(1.00)
0.0244
(1.00)
0.13
(1.00)
0.71
(1.00)
0.11
(1.00)
16q gain 0.0449
(1.00)
0.0676
(1.00)
0.0422
(1.00)
0.971
(1.00)
0.291
(1.00)
17p gain 0.0168
(1.00)
0.0354
(1.00)
0.747
(1.00)
0.0913
(1.00)
1
(1.00)
17q gain 0.0339
(1.00)
0.123
(1.00)
1
(1.00)
0.0318
(1.00)
0.801
(1.00)
18p gain 0.848
(1.00)
0.929
(1.00)
1
(1.00)
0.0964
(1.00)
0.831
(1.00)
18q gain 0.82
(1.00)
0.938
(1.00)
0.697
(1.00)
0.301
(1.00)
0.836
(1.00)
19p gain 0.197
(1.00)
0.452
(1.00)
0.85
(1.00)
0.795
(1.00)
0.277
(1.00)
19q gain 0.141
(1.00)
0.246
(1.00)
0.694
(1.00)
0.773
(1.00)
0.837
(1.00)
20p gain 0.888
(1.00)
0.000315
(0.121)
0.707
(1.00)
0.371
(1.00)
0.279
(1.00)
20q gain 0.651
(1.00)
0.000657
(0.251)
0.573
(1.00)
0.406
(1.00)
0.167
(1.00)
21q gain 0.137
(1.00)
0.856
(1.00)
1
(1.00)
0.634
(1.00)
0.439
(1.00)
22q gain 0.97
(1.00)
0.391
(1.00)
1
(1.00)
0.968
(1.00)
0.516
(1.00)
Xq gain 0.128
(1.00)
0.593
(1.00)
1
(1.00)
1
(1.00)
1p loss 0.0316
(1.00)
0.707
(1.00)
0.653
(1.00)
0.868
(1.00)
1
(1.00)
1q loss 0.721
(1.00)
0.0658
(1.00)
0.65
(1.00)
0.0873
(1.00)
1
(1.00)
2p loss 0.179
(1.00)
0.516
(1.00)
0.204
(1.00)
0.76
(1.00)
1
(1.00)
2q loss 0.235
(1.00)
0.743
(1.00)
0.328
(1.00)
0.76
(1.00)
0.725
(1.00)
3p loss 0.293
(1.00)
0.0224
(1.00)
1
(1.00)
0.846
(1.00)
0.0184
(1.00)
3q loss 0.774
(1.00)
0.231
(1.00)
0.44
(1.00)
0.708
(1.00)
0.29
(1.00)
4p loss 0.14
(1.00)
0.297
(1.00)
0.142
(1.00)
0.825
(1.00)
0.083
(1.00)
4q loss 0.21
(1.00)
0.229
(1.00)
0.67
(1.00)
0.167
(1.00)
0.51
(1.00)
5p loss 0.0545
(1.00)
0.277
(1.00)
0.821
(1.00)
0.503
(1.00)
0.238
(1.00)
5q loss 0.0597
(1.00)
0.294
(1.00)
0.645
(1.00)
0.786
(1.00)
0.33
(1.00)
6p loss 0.008
(1.00)
0.761
(1.00)
0.878
(1.00)
0.711
(1.00)
0.747
(1.00)
6q loss 0.277
(1.00)
0.656
(1.00)
0.534
(1.00)
0.581
(1.00)
0.897
(1.00)
7p loss 0.514
(1.00)
0.658
(1.00)
1
(1.00)
0.61
(1.00)
1
(1.00)
7q loss 0.617
(1.00)
0.362
(1.00)
1
(1.00)
1
(1.00)
8p loss 0.972
(1.00)
0.422
(1.00)
0.209
(1.00)
0.541
(1.00)
0.0233
(1.00)
8q loss 0.995
(1.00)
0.435
(1.00)
0.497
(1.00)
0.133
(1.00)
0.00375
(1.00)
9p loss 0.748
(1.00)
0.41
(1.00)
0.773
(1.00)
0.427
(1.00)
0.42
(1.00)
9q loss 0.396
(1.00)
0.575
(1.00)
0.0139
(1.00)
0.384
(1.00)
0.776
(1.00)
10p loss 0.000152
(0.0586)
1.26e-09
(4.92e-07)
0.395
(1.00)
0.358
(1.00)
0.316
(1.00)
10q loss 0.00068
(0.259)
2.4e-07
(9.39e-05)
0.58
(1.00)
0.0968
(1.00)
0.562
(1.00)
11p loss 0.312
(1.00)
0.222
(1.00)
0.221
(1.00)
0.89
(1.00)
0.254
(1.00)
11q loss 0.96
(1.00)
0.814
(1.00)
0.772
(1.00)
0.934
(1.00)
0.0146
(1.00)
12p loss 0.598
(1.00)
0.913
(1.00)
0.864
(1.00)
0.89
(1.00)
0.586
(1.00)
12q loss 0.578
(1.00)
0.992
(1.00)
1
(1.00)
0.61
(1.00)
0.706
(1.00)
13q loss 0.978
(1.00)
0.39
(1.00)
0.417
(1.00)
0.538
(1.00)
0.749
(1.00)
14q loss 0.995
(1.00)
0.891
(1.00)
0.752
(1.00)
0.769
(1.00)
1
(1.00)
15q loss 0.433
(1.00)
0.181
(1.00)
0.132
(1.00)
0.486
(1.00)
0.25
(1.00)
16p loss 0.155
(1.00)
0.119
(1.00)
0.514
(1.00)
0.0929
(1.00)
1
(1.00)
16q loss 0.0807
(1.00)
0.927
(1.00)
1
(1.00)
0.204
(1.00)
1
(1.00)
17p loss 0.466
(1.00)
0.285
(1.00)
0.722
(1.00)
0.643
(1.00)
1
(1.00)
17q loss 0.936
(1.00)
0.315
(1.00)
0.815
(1.00)
0.62
(1.00)
0.626
(1.00)
18p loss 0.302
(1.00)
0.768
(1.00)
0.871
(1.00)
0.625
(1.00)
0.612
(1.00)
18q loss 0.384
(1.00)
0.432
(1.00)
0.497
(1.00)
0.349
(1.00)
0.589
(1.00)
19p loss 0.635
(1.00)
0.187
(1.00)
0.775
(1.00)
0.58
(1.00)
1
(1.00)
19q loss 0.194
(1.00)
0.667
(1.00)
1
(1.00)
0.276
(1.00)
0.801
(1.00)
20p loss 0.456
(1.00)
0.14
(1.00)
0.357
(1.00)
0.7
(1.00)
0.752
(1.00)
20q loss 0.473
(1.00)
0.468
(1.00)
0.527
(1.00)
0.612
(1.00)
0.304
(1.00)
21q loss 0.849
(1.00)
0.235
(1.00)
0.212
(1.00)
0.629
(1.00)
0.666
(1.00)
22q loss 0.647
(1.00)
0.0288
(1.00)
0.764
(1.00)
0.613
(1.00)
1
(1.00)
Xq loss 0.23
(1.00)
0.0255
(1.00)
0.759
(1.00)
0.914
(1.00)
0.516
(1.00)
Clustering Approach #1: '1p gain mutation analysis'

Table S1.  Get Full Table Description of clustering approach #1: '1p gain mutation analysis'

Cluster Labels 1P GAIN MUTATED 1P GAIN WILD-TYPE
Number of samples 38 506
Clustering Approach #2: '1q gain mutation analysis'

Table S2.  Get Full Table Description of clustering approach #2: '1q gain mutation analysis'

Cluster Labels 1Q GAIN MUTATED 1Q GAIN WILD-TYPE
Number of samples 43 501
Clustering Approach #3: '2p gain mutation analysis'

Table S3.  Get Full Table Description of clustering approach #3: '2p gain mutation analysis'

Cluster Labels 2P GAIN MUTATED 2P GAIN WILD-TYPE
Number of samples 18 526
Clustering Approach #4: '2q gain mutation analysis'

Table S4.  Get Full Table Description of clustering approach #4: '2q gain mutation analysis'

Cluster Labels 2Q GAIN MUTATED 2Q GAIN WILD-TYPE
Number of samples 16 528
Clustering Approach #5: '3p gain mutation analysis'

Table S5.  Get Full Table Description of clustering approach #5: '3p gain mutation analysis'

Cluster Labels 3P GAIN MUTATED 3P GAIN WILD-TYPE
Number of samples 30 514
Clustering Approach #6: '3q gain mutation analysis'

Table S6.  Get Full Table Description of clustering approach #6: '3q gain mutation analysis'

Cluster Labels 3Q GAIN MUTATED 3Q GAIN WILD-TYPE
Number of samples 30 514
Clustering Approach #7: '4p gain mutation analysis'

Table S7.  Get Full Table Description of clustering approach #7: '4p gain mutation analysis'

Cluster Labels 4P GAIN MUTATED 4P GAIN WILD-TYPE
Number of samples 15 529
Clustering Approach #8: '4q gain mutation analysis'

Table S8.  Get Full Table Description of clustering approach #8: '4q gain mutation analysis'

Cluster Labels 4Q GAIN MUTATED 4Q GAIN WILD-TYPE
Number of samples 14 530
Clustering Approach #9: '5p gain mutation analysis'

Table S9.  Get Full Table Description of clustering approach #9: '5p gain mutation analysis'

Cluster Labels 5P GAIN MUTATED 5P GAIN WILD-TYPE
Number of samples 28 516
Clustering Approach #10: '5q gain mutation analysis'

Table S10.  Get Full Table Description of clustering approach #10: '5q gain mutation analysis'

Cluster Labels 5Q GAIN MUTATED 5Q GAIN WILD-TYPE
Number of samples 21 523
Clustering Approach #11: '6p gain mutation analysis'

Table S11.  Get Full Table Description of clustering approach #11: '6p gain mutation analysis'

Cluster Labels 6P GAIN MUTATED 6P GAIN WILD-TYPE
Number of samples 7 537
'6p gain mutation analysis' versus 'KARNOFSKY.PERFORMANCE.SCORE'

P value = 0.000254 (t-test), Q value = 0.098

Table S12.  Clustering Approach #11: '6p gain mutation analysis' versus Clinical Feature #4: 'KARNOFSKY.PERFORMANCE.SCORE'

nPatients Mean (Std.Dev)
ALL 409 77.3 (14.7)
6P GAIN MUTATED 4 80.0 (0.0)
6P GAIN WILD-TYPE 405 77.3 (14.8)

Figure S1.  Get High-res Image Clustering Approach #11: '6p gain mutation analysis' versus Clinical Feature #4: 'KARNOFSKY.PERFORMANCE.SCORE'

Clustering Approach #12: '6q gain mutation analysis'

Table S13.  Get Full Table Description of clustering approach #12: '6q gain mutation analysis'

Cluster Labels 6Q GAIN MUTATED 6Q GAIN WILD-TYPE
Number of samples 9 535
'6q gain mutation analysis' versus 'KARNOFSKY.PERFORMANCE.SCORE'

P value = 0.000254 (t-test), Q value = 0.098

Table S14.  Clustering Approach #12: '6q gain mutation analysis' versus Clinical Feature #4: 'KARNOFSKY.PERFORMANCE.SCORE'

nPatients Mean (Std.Dev)
ALL 409 77.3 (14.7)
6Q GAIN MUTATED 5 80.0 (0.0)
6Q GAIN WILD-TYPE 404 77.3 (14.8)

Figure S2.  Get High-res Image Clustering Approach #12: '6q gain mutation analysis' versus Clinical Feature #4: 'KARNOFSKY.PERFORMANCE.SCORE'

Clustering Approach #13: '7p gain mutation analysis'

Table S15.  Get Full Table Description of clustering approach #13: '7p gain mutation analysis'

Cluster Labels 7P GAIN MUTATED 7P GAIN WILD-TYPE
Number of samples 423 121
'7p gain mutation analysis' versus 'AGE'

P value = 5.79e-06 (t-test), Q value = 0.0023

Table S16.  Clustering Approach #13: '7p gain mutation analysis' versus Clinical Feature #2: 'AGE'

nPatients Mean (Std.Dev)
ALL 544 57.8 (14.3)
7P GAIN MUTATED 423 59.7 (12.2)
7P GAIN WILD-TYPE 121 51.3 (18.5)

Figure S3.  Get High-res Image Clustering Approach #13: '7p gain mutation analysis' versus Clinical Feature #2: 'AGE'

Clustering Approach #14: '7q gain mutation analysis'

Table S17.  Get Full Table Description of clustering approach #14: '7q gain mutation analysis'

Cluster Labels 7Q GAIN MUTATED 7Q GAIN WILD-TYPE
Number of samples 427 117
'7q gain mutation analysis' versus 'AGE'

P value = 7.78e-06 (t-test), Q value = 0.003

Table S18.  Clustering Approach #14: '7q gain mutation analysis' versus Clinical Feature #2: 'AGE'

nPatients Mean (Std.Dev)
ALL 544 57.8 (14.3)
7Q GAIN MUTATED 427 59.7 (12.1)
7Q GAIN WILD-TYPE 117 51.1 (19.1)

Figure S4.  Get High-res Image Clustering Approach #14: '7q gain mutation analysis' versus Clinical Feature #2: 'AGE'

Clustering Approach #15: '8p gain mutation analysis'

Table S19.  Get Full Table Description of clustering approach #15: '8p gain mutation analysis'

Cluster Labels 8P GAIN MUTATED 8P GAIN WILD-TYPE
Number of samples 26 518
Clustering Approach #16: '8q gain mutation analysis'

Table S20.  Get Full Table Description of clustering approach #16: '8q gain mutation analysis'

Cluster Labels 8Q GAIN MUTATED 8Q GAIN WILD-TYPE
Number of samples 31 513
Clustering Approach #17: '9p gain mutation analysis'

Table S21.  Get Full Table Description of clustering approach #17: '9p gain mutation analysis'

Cluster Labels 9P GAIN MUTATED 9P GAIN WILD-TYPE
Number of samples 17 527
Clustering Approach #18: '9q gain mutation analysis'

Table S22.  Get Full Table Description of clustering approach #18: '9q gain mutation analysis'

Cluster Labels 9Q GAIN MUTATED 9Q GAIN WILD-TYPE
Number of samples 32 512
Clustering Approach #19: '10p gain mutation analysis'

Table S23.  Get Full Table Description of clustering approach #19: '10p gain mutation analysis'

Cluster Labels 10P GAIN MUTATED 10P GAIN WILD-TYPE
Number of samples 11 533
'10p gain mutation analysis' versus 'AGE'

P value = 9.55e-05 (t-test), Q value = 0.037

Table S24.  Clustering Approach #19: '10p gain mutation analysis' versus Clinical Feature #2: 'AGE'

nPatients Mean (Std.Dev)
ALL 544 57.8 (14.3)
10P GAIN MUTATED 11 34.3 (12.9)
10P GAIN WILD-TYPE 533 58.3 (13.9)

Figure S5.  Get High-res Image Clustering Approach #19: '10p gain mutation analysis' versus Clinical Feature #2: 'AGE'

'10p gain mutation analysis' versus 'KARNOFSKY.PERFORMANCE.SCORE'

P value = 0.000254 (t-test), Q value = 0.098

Table S25.  Clustering Approach #19: '10p gain mutation analysis' versus Clinical Feature #4: 'KARNOFSKY.PERFORMANCE.SCORE'

nPatients Mean (Std.Dev)
ALL 409 77.3 (14.7)
10P GAIN MUTATED 11 80.0 (0.0)
10P GAIN WILD-TYPE 398 77.2 (14.9)

Figure S6.  Get High-res Image Clustering Approach #19: '10p gain mutation analysis' versus Clinical Feature #4: 'KARNOFSKY.PERFORMANCE.SCORE'

Clustering Approach #20: '11p gain mutation analysis'

Table S26.  Get Full Table Description of clustering approach #20: '11p gain mutation analysis'

Cluster Labels 11P GAIN MUTATED 11P GAIN WILD-TYPE
Number of samples 4 540
Clustering Approach #21: '11q gain mutation analysis'

Table S27.  Get Full Table Description of clustering approach #21: '11q gain mutation analysis'

Cluster Labels 11Q GAIN MUTATED 11Q GAIN WILD-TYPE
Number of samples 7 537
Clustering Approach #22: '12p gain mutation analysis'

Table S28.  Get Full Table Description of clustering approach #22: '12p gain mutation analysis'

Cluster Labels 12P GAIN MUTATED 12P GAIN WILD-TYPE
Number of samples 40 504
Clustering Approach #23: '12q gain mutation analysis'

Table S29.  Get Full Table Description of clustering approach #23: '12q gain mutation analysis'

Cluster Labels 12Q GAIN MUTATED 12Q GAIN WILD-TYPE
Number of samples 30 514
Clustering Approach #24: '13q gain mutation analysis'

Table S30.  Get Full Table Description of clustering approach #24: '13q gain mutation analysis'

Cluster Labels 13Q GAIN MUTATED 13Q GAIN WILD-TYPE
Number of samples 3 541
Clustering Approach #25: '14q gain mutation analysis'

Table S31.  Get Full Table Description of clustering approach #25: '14q gain mutation analysis'

Cluster Labels 14Q GAIN MUTATED 14Q GAIN WILD-TYPE
Number of samples 6 538
Clustering Approach #26: '15q gain mutation analysis'

Table S32.  Get Full Table Description of clustering approach #26: '15q gain mutation analysis'

Cluster Labels 15Q GAIN MUTATED 15Q GAIN WILD-TYPE
Number of samples 5 539
Clustering Approach #27: '16p gain mutation analysis'

Table S33.  Get Full Table Description of clustering approach #27: '16p gain mutation analysis'

Cluster Labels 16P GAIN MUTATED 16P GAIN WILD-TYPE
Number of samples 17 527
Clustering Approach #28: '16q gain mutation analysis'

Table S34.  Get Full Table Description of clustering approach #28: '16q gain mutation analysis'

Cluster Labels 16Q GAIN MUTATED 16Q GAIN WILD-TYPE
Number of samples 17 527
Clustering Approach #29: '17p gain mutation analysis'

Table S35.  Get Full Table Description of clustering approach #29: '17p gain mutation analysis'

Cluster Labels 17P GAIN MUTATED 17P GAIN WILD-TYPE
Number of samples 10 534
Clustering Approach #30: '17q gain mutation analysis'

Table S36.  Get Full Table Description of clustering approach #30: '17q gain mutation analysis'

Cluster Labels 17Q GAIN MUTATED 17Q GAIN WILD-TYPE
Number of samples 18 526
Clustering Approach #31: '18p gain mutation analysis'

Table S37.  Get Full Table Description of clustering approach #31: '18p gain mutation analysis'

Cluster Labels 18P GAIN MUTATED 18P GAIN WILD-TYPE
Number of samples 26 518
Clustering Approach #32: '18q gain mutation analysis'

Table S38.  Get Full Table Description of clustering approach #32: '18q gain mutation analysis'

Cluster Labels 18Q GAIN MUTATED 18Q GAIN WILD-TYPE
Number of samples 28 516
Clustering Approach #33: '19p gain mutation analysis'

Table S39.  Get Full Table Description of clustering approach #33: '19p gain mutation analysis'

Cluster Labels 19P GAIN MUTATED 19P GAIN WILD-TYPE
Number of samples 170 374
Clustering Approach #34: '19q gain mutation analysis'

Table S40.  Get Full Table Description of clustering approach #34: '19q gain mutation analysis'

Cluster Labels 19Q GAIN MUTATED 19Q GAIN WILD-TYPE
Number of samples 147 397
Clustering Approach #35: '20p gain mutation analysis'

Table S41.  Get Full Table Description of clustering approach #35: '20p gain mutation analysis'

Cluster Labels 20P GAIN MUTATED 20P GAIN WILD-TYPE
Number of samples 174 370
'20p gain mutation analysis' versus 'AGE'

P value = 0.000315 (t-test), Q value = 0.12

Table S42.  Clustering Approach #35: '20p gain mutation analysis' versus Clinical Feature #2: 'AGE'

nPatients Mean (Std.Dev)
ALL 544 57.8 (14.3)
20P GAIN MUTATED 174 60.9 (13.1)
20P GAIN WILD-TYPE 370 56.4 (14.6)

Figure S7.  Get High-res Image Clustering Approach #35: '20p gain mutation analysis' versus Clinical Feature #2: 'AGE'

Clustering Approach #36: '20q gain mutation analysis'

Table S43.  Get Full Table Description of clustering approach #36: '20q gain mutation analysis'

Cluster Labels 20Q GAIN MUTATED 20Q GAIN WILD-TYPE
Number of samples 172 372
Clustering Approach #37: '21q gain mutation analysis'

Table S44.  Get Full Table Description of clustering approach #37: '21q gain mutation analysis'

Cluster Labels 21Q GAIN MUTATED 21Q GAIN WILD-TYPE
Number of samples 32 512
Clustering Approach #38: '22q gain mutation analysis'

Table S45.  Get Full Table Description of clustering approach #38: '22q gain mutation analysis'

Cluster Labels 22Q GAIN MUTATED 22Q GAIN WILD-TYPE
Number of samples 11 533
Clustering Approach #39: 'Xq gain mutation analysis'

Table S46.  Get Full Table Description of clustering approach #39: 'Xq gain mutation analysis'

Cluster Labels XQ GAIN MUTATED XQ GAIN WILD-TYPE
Number of samples 3 541
Clustering Approach #40: '1p loss mutation analysis'

Table S47.  Get Full Table Description of clustering approach #40: '1p loss mutation analysis'

Cluster Labels 1P LOSS MUTATED 1P LOSS WILD-TYPE
Number of samples 5 539
Clustering Approach #41: '1q loss mutation analysis'

Table S48.  Get Full Table Description of clustering approach #41: '1q loss mutation analysis'

Cluster Labels 1Q LOSS MUTATED 1Q LOSS WILD-TYPE
Number of samples 4 540
Clustering Approach #42: '2p loss mutation analysis'

Table S49.  Get Full Table Description of clustering approach #42: '2p loss mutation analysis'

Cluster Labels 2P LOSS MUTATED 2P LOSS WILD-TYPE
Number of samples 10 534
Clustering Approach #43: '2q loss mutation analysis'

Table S50.  Get Full Table Description of clustering approach #43: '2q loss mutation analysis'

Cluster Labels 2Q LOSS MUTATED 2Q LOSS WILD-TYPE
Number of samples 9 535
Clustering Approach #44: '3p loss mutation analysis'

Table S51.  Get Full Table Description of clustering approach #44: '3p loss mutation analysis'

Cluster Labels 3P LOSS MUTATED 3P LOSS WILD-TYPE
Number of samples 22 522
Clustering Approach #45: '3q loss mutation analysis'

Table S52.  Get Full Table Description of clustering approach #45: '3q loss mutation analysis'

Cluster Labels 3Q LOSS MUTATED 3Q LOSS WILD-TYPE
Number of samples 16 528
Clustering Approach #46: '4p loss mutation analysis'

Table S53.  Get Full Table Description of clustering approach #46: '4p loss mutation analysis'

Cluster Labels 4P LOSS MUTATED 4P LOSS WILD-TYPE
Number of samples 25 519
Clustering Approach #47: '4q loss mutation analysis'

Table S54.  Get Full Table Description of clustering approach #47: '4q loss mutation analysis'

Cluster Labels 4Q LOSS MUTATED 4Q LOSS WILD-TYPE
Number of samples 24 520
Clustering Approach #48: '5p loss mutation analysis'

Table S55.  Get Full Table Description of clustering approach #48: '5p loss mutation analysis'

Cluster Labels 5P LOSS MUTATED 5P LOSS WILD-TYPE
Number of samples 21 523
Clustering Approach #49: '5q loss mutation analysis'

Table S56.  Get Full Table Description of clustering approach #49: '5q loss mutation analysis'

Cluster Labels 5Q LOSS MUTATED 5Q LOSS WILD-TYPE
Number of samples 20 524
Clustering Approach #50: '6p loss mutation analysis'

Table S57.  Get Full Table Description of clustering approach #50: '6p loss mutation analysis'

Cluster Labels 6P LOSS MUTATED 6P LOSS WILD-TYPE
Number of samples 47 497
Clustering Approach #51: '6q loss mutation analysis'

Table S58.  Get Full Table Description of clustering approach #51: '6q loss mutation analysis'

Cluster Labels 6Q LOSS MUTATED 6Q LOSS WILD-TYPE
Number of samples 79 465
Clustering Approach #52: '7p loss mutation analysis'

Table S59.  Get Full Table Description of clustering approach #52: '7p loss mutation analysis'

Cluster Labels 7P LOSS MUTATED 7P LOSS WILD-TYPE
Number of samples 5 539
Clustering Approach #53: '7q loss mutation analysis'

Table S60.  Get Full Table Description of clustering approach #53: '7q loss mutation analysis'

Cluster Labels 7Q LOSS MUTATED 7Q LOSS WILD-TYPE
Number of samples 4 540
Clustering Approach #54: '8p loss mutation analysis'

Table S61.  Get Full Table Description of clustering approach #54: '8p loss mutation analysis'

Cluster Labels 8P LOSS MUTATED 8P LOSS WILD-TYPE
Number of samples 34 510
Clustering Approach #55: '8q loss mutation analysis'

Table S62.  Get Full Table Description of clustering approach #55: '8q loss mutation analysis'

Cluster Labels 8Q LOSS MUTATED 8Q LOSS WILD-TYPE
Number of samples 21 523
Clustering Approach #56: '9p loss mutation analysis'

Table S63.  Get Full Table Description of clustering approach #56: '9p loss mutation analysis'

Cluster Labels 9P LOSS MUTATED 9P LOSS WILD-TYPE
Number of samples 160 384
Clustering Approach #57: '9q loss mutation analysis'

Table S64.  Get Full Table Description of clustering approach #57: '9q loss mutation analysis'

Cluster Labels 9Q LOSS MUTATED 9Q LOSS WILD-TYPE
Number of samples 63 481
Clustering Approach #58: '10p loss mutation analysis'

Table S65.  Get Full Table Description of clustering approach #58: '10p loss mutation analysis'

Cluster Labels 10P LOSS MUTATED 10P LOSS WILD-TYPE
Number of samples 426 118
'10p loss mutation analysis' versus 'Time to Death'

P value = 0.000152 (logrank test), Q value = 0.059

Table S66.  Clustering Approach #58: '10p loss mutation analysis' versus Clinical Feature #1: 'Time to Death'

nPatients nDeath Duration Range (Median), Month
ALL 544 411 0.1 - 127.6 (9.6)
10P LOSS MUTATED 426 323 0.1 - 127.6 (9.3)
10P LOSS WILD-TYPE 118 88 0.2 - 108.8 (10.8)

Figure S8.  Get High-res Image Clustering Approach #58: '10p loss mutation analysis' versus Clinical Feature #1: 'Time to Death'

'10p loss mutation analysis' versus 'AGE'

P value = 1.26e-09 (t-test), Q value = 4.9e-07

Table S67.  Clustering Approach #58: '10p loss mutation analysis' versus Clinical Feature #2: 'AGE'

nPatients Mean (Std.Dev)
ALL 544 57.8 (14.3)
10P LOSS MUTATED 426 60.3 (11.8)
10P LOSS WILD-TYPE 118 48.8 (18.3)

Figure S9.  Get High-res Image Clustering Approach #58: '10p loss mutation analysis' versus Clinical Feature #2: 'AGE'

Clustering Approach #59: '10q loss mutation analysis'

Table S68.  Get Full Table Description of clustering approach #59: '10q loss mutation analysis'

Cluster Labels 10Q LOSS MUTATED 10Q LOSS WILD-TYPE
Number of samples 439 105
'10q loss mutation analysis' versus 'AGE'

P value = 2.4e-07 (t-test), Q value = 9.4e-05

Table S69.  Clustering Approach #59: '10q loss mutation analysis' versus Clinical Feature #2: 'AGE'

nPatients Mean (Std.Dev)
ALL 544 57.8 (14.3)
10Q LOSS MUTATED 439 59.8 (12.4)
10Q LOSS WILD-TYPE 105 49.6 (18.2)

Figure S10.  Get High-res Image Clustering Approach #59: '10q loss mutation analysis' versus Clinical Feature #2: 'AGE'

Clustering Approach #60: '11p loss mutation analysis'

Table S70.  Get Full Table Description of clustering approach #60: '11p loss mutation analysis'

Cluster Labels 11P LOSS MUTATED 11P LOSS WILD-TYPE
Number of samples 64 480
Clustering Approach #61: '11q loss mutation analysis'

Table S71.  Get Full Table Description of clustering approach #61: '11q loss mutation analysis'

Cluster Labels 11Q LOSS MUTATED 11Q LOSS WILD-TYPE
Number of samples 55 489
Clustering Approach #62: '12p loss mutation analysis'

Table S72.  Get Full Table Description of clustering approach #62: '12p loss mutation analysis'

Cluster Labels 12P LOSS MUTATED 12P LOSS WILD-TYPE
Number of samples 37 507
Clustering Approach #63: '12q loss mutation analysis'

Table S73.  Get Full Table Description of clustering approach #63: '12q loss mutation analysis'

Cluster Labels 12Q LOSS MUTATED 12Q LOSS WILD-TYPE
Number of samples 34 510
Clustering Approach #64: '13q loss mutation analysis'

Table S74.  Get Full Table Description of clustering approach #64: '13q loss mutation analysis'

Cluster Labels 13Q LOSS MUTATED 13Q LOSS WILD-TYPE
Number of samples 134 410
Clustering Approach #65: '14q loss mutation analysis'

Table S75.  Get Full Table Description of clustering approach #65: '14q loss mutation analysis'

Cluster Labels 14Q LOSS MUTATED 14Q LOSS WILD-TYPE
Number of samples 120 424
Clustering Approach #66: '15q loss mutation analysis'

Table S76.  Get Full Table Description of clustering approach #66: '15q loss mutation analysis'

Cluster Labels 15Q LOSS MUTATED 15Q LOSS WILD-TYPE
Number of samples 62 482
Clustering Approach #67: '16p loss mutation analysis'

Table S77.  Get Full Table Description of clustering approach #67: '16p loss mutation analysis'

Cluster Labels 16P LOSS MUTATED 16P LOSS WILD-TYPE
Number of samples 23 521
Clustering Approach #68: '16q loss mutation analysis'

Table S78.  Get Full Table Description of clustering approach #68: '16q loss mutation analysis'

Cluster Labels 16Q LOSS MUTATED 16Q LOSS WILD-TYPE
Number of samples 40 504
Clustering Approach #69: '17p loss mutation analysis'

Table S79.  Get Full Table Description of clustering approach #69: '17p loss mutation analysis'

Cluster Labels 17P LOSS MUTATED 17P LOSS WILD-TYPE
Number of samples 35 509
Clustering Approach #70: '17q loss mutation analysis'

Table S80.  Get Full Table Description of clustering approach #70: '17q loss mutation analysis'

Cluster Labels 17Q LOSS MUTATED 17Q LOSS WILD-TYPE
Number of samples 19 525
Clustering Approach #71: '18p loss mutation analysis'

Table S81.  Get Full Table Description of clustering approach #71: '18p loss mutation analysis'

Cluster Labels 18P LOSS MUTATED 18P LOSS WILD-TYPE
Number of samples 43 501
Clustering Approach #72: '18q loss mutation analysis'

Table S82.  Get Full Table Description of clustering approach #72: '18q loss mutation analysis'

Cluster Labels 18Q LOSS MUTATED 18Q LOSS WILD-TYPE
Number of samples 38 506
Clustering Approach #73: '19p loss mutation analysis'

Table S83.  Get Full Table Description of clustering approach #73: '19p loss mutation analysis'

Cluster Labels 19P LOSS MUTATED 19P LOSS WILD-TYPE
Number of samples 13 531
Clustering Approach #74: '19q loss mutation analysis'

Table S84.  Get Full Table Description of clustering approach #74: '19q loss mutation analysis'

Cluster Labels 19Q LOSS MUTATED 19Q LOSS WILD-TYPE
Number of samples 18 526
Clustering Approach #75: '20p loss mutation analysis'

Table S85.  Get Full Table Description of clustering approach #75: '20p loss mutation analysis'

Cluster Labels 20P LOSS MUTATED 20P LOSS WILD-TYPE
Number of samples 11 533
Clustering Approach #76: '20q loss mutation analysis'

Table S86.  Get Full Table Description of clustering approach #76: '20q loss mutation analysis'

Cluster Labels 20Q LOSS MUTATED 20Q LOSS WILD-TYPE
Number of samples 10 534
Clustering Approach #77: '21q loss mutation analysis'

Table S87.  Get Full Table Description of clustering approach #77: '21q loss mutation analysis'

Cluster Labels 21Q LOSS MUTATED 21Q LOSS WILD-TYPE
Number of samples 25 519
Clustering Approach #78: '22q loss mutation analysis'

Table S88.  Get Full Table Description of clustering approach #78: '22q loss mutation analysis'

Cluster Labels 22Q LOSS MUTATED 22Q LOSS WILD-TYPE
Number of samples 138 406
Clustering Approach #79: 'Xq loss mutation analysis'

Table S89.  Get Full Table Description of clustering approach #79: 'Xq loss mutation analysis'

Cluster Labels XQ LOSS MUTATED XQ LOSS WILD-TYPE
Number of samples 11 533
Methods & Data
Input
  • Cluster data file = broad_values_by_arm.mutsig.cluster.txt

  • Clinical data file = GBM-TP.clin.merged.picked.txt

  • Number of patients = 544

  • Number of clustering approaches = 79

  • Number of selected clinical features = 5

  • Exclude small clusters that include fewer than K patients, K = 3

Survival analysis

For survival clinical features, the Kaplan-Meier survival curves of tumors with and without gene mutations were plotted and the statistical significance P values were estimated by logrank test (Bland and Altman 2004) using the 'survdiff' function in R

Student's t-test analysis

For continuous numerical clinical features, two-tailed Student's t test with unequal variance (Lehmann and Romano 2005) was applied to compare the clinical values between two tumor subtypes using 't.test' function in R

Fisher's exact test

For binary clinical features, two-tailed Fisher's exact tests (Fisher 1922) were used to estimate the P values using the 'fisher.test' function in R

Q value calculation

For multiple hypothesis correction, Q value is the False Discovery Rate (FDR) analogue of the P value (Benjamini and Hochberg 1995), defined as the minimum FDR at which the test may be called significant. We used the 'Benjamini and Hochberg' method of 'p.adjust' function in R to convert P values into Q values.

Download Results

This is an experimental feature. The full results of the analysis summarized in this report can be downloaded from the TCGA Data Coordination Center.

References
[1] Bland and Altman, Statistics notes: The logrank test, BMJ 328(7447):1073 (2004)
[2] Lehmann and Romano, Testing Statistical Hypotheses (3E ed.), New York: Springer. ISBN 0387988645 (2005)
[3] Fisher, R.A., On the interpretation of chi-square from contingency tables, and the calculation of P, Journal of the Royal Statistical Society 85(1):87-94 (1922)
[4] Benjamini and Hochberg, Controlling the false discovery rate: a practical and powerful approach to multiple testing, Journal of the Royal Statistical Society Series B 59:289-300 (1995)