LowPass Copy number analysis (GISTIC2)
PANCANCER cohort with 12 disease types (Primary solid tumor)
15 January 2014  |  analyses__2014_01_15
Maintainer Information
Citation Information
Maintained by Spring Yingchun Liu (Broad Institute)
Cite as Broad Institute TCGA Genome Data Analysis Center (2014): LowPass Copy number analysis (GISTIC2). Broad Institute of MIT and Harvard. doi:10.7908/C1TQ6001
Overview
Introduction

GISTIC identifies genomic regions that are significantly gained or lost across a set of tumors. The pipeline first filters out normal samples from the segmented copy-number data by inspecting the TCGA barcodes and then executes GISTIC version 2.0.20 (Firehose task version: 126).

Summary

There were 568 tumor samples used in this analysis: 24 significant arm-level results, 48 significant focal amplifications, and 15 significant focal deletions were found.

Results
Focal results

Figure 1.  Genomic positions of amplified regions: the X-axis represents the normalized amplification signals (top) and significance by Q value (bottom). The green line represents the significance cutoff at Q value=0.25.

Table 1.  Get Full Table Amplifications Table - 48 significant amplifications found. Click the link in the last column to view a comprehensive list of candidate genes. If no genes were identified within the peak, the nearest gene appears in brackets.

Cytoband Q value Residual Q value Wide Peak Boundaries # Genes in Wide Peak
11q13.3 6.1722e-53 6.1722e-53 chr11:69433939-69496923 2
12q15 4.6818e-20 4.6818e-20 chr12:69164280-69256976 2
7p11.2 2.1778e-22 9.286e-20 chr7:54966828-55398999 1
5q31.1 3.7367e-12 3.7367e-12 chr5:134286002-134291999 1
8p11.23 2.8085e-19 1.2584e-11 chr8:38149140-38471626 5
1q23.3 1.3989e-15 1.1694e-10 chr1:160838101-161113771 11
8q22.2 1.1099e-10 2.7991e-09 chr8:101104702-101384142 5
17q12 2.8846e-11 5.1256e-09 chr17:37812050-37914706 8
8q24.21 2.0888e-13 7.6826e-07 chr8:128549594-128938392 4
6p22.3 5.4572e-06 5.4572e-06 chr6:18655955-23840353 9
3p25.2 9.3185e-06 9.3185e-06 chr3:11715002-12801999 10
1q21.3 6.9336e-12 9.8258e-06 chr1:150290149-152317480 62
12p12.1 1.0283e-05 1.0283e-05 chr12:24883086-25757762 7
19q13.2 1.4362e-06 2.5728e-05 chr19:39270019-39498815 11
3q26.2 3.0833e-05 3.0833e-05 chr3:169286264-170136608 14
20q11.21 7.3965e-08 6.9585e-05 chr20:30057002-30329931 11
14q11.2 1.8357e-06 8.2395e-05 chr14:21712002-22029999 10
14q13.2 4.0015e-05 0.00022668 chr14:34919665-37209104 21
5p15.33 8.1416e-05 0.0004807 chr5:1-2076204 34
4p16.1 0.00097068 0.00097068 chr4:10187381-10379633 0 [ZNF518B]
7q22.1 0.002451 0.002451 chr7:96277028-103609234 146
1p34.2 0.0028647 0.0028647 chr1:39897002-40544364 16
20q13.2 7.2771e-05 0.0034007 chr20:52038600-52580999 4
7p14.1 0.00029371 0.0037367 chr7:38261002-38435240 4
9p22.3 0.0044349 0.0044349 chr9:13765347-15939999 10
8p11.21 2.5685e-07 0.0059057 chr8:41291218-42811666 22
2q11.2 0.0021888 0.0084813 chr2:97488549-98277749 11
18q11.2 0.022619 0.022619 chr18:20042442-21246928 8
11p15.5 0.025874 0.025874 chr11:2062002-2088574 0 [H19]
16p12.1 0.030514 0.030514 chr16:23878118-25399726 11
5p13.1 0.0074373 0.030987 chr5:40661747-40848719 7
8p11.23 3.2749e-15 0.034725 chr8:36675464-37768616 8
19q13.11 2.5728e-05 0.034725 chr19:24048931-32817092 18
6q12 0.043602 0.043602 chr6:63851432-64384999 3
17q11.2 0.01793 0.04659 chr17:25738351-26390443 6
20q11.22 3.4227e-06 0.049583 chr20:32677940-33534814 15
8q11.1 6.2447e-05 0.057112 chr8:42933758-47889868 5
13q22.1 0.048069 0.060809 chr13:73619539-73702962 1
10p14 0.061731 0.061731 chr10:1-15219269 97
2p25.1 0.084643 0.084643 chr2:11393819-11483999 1
17q24.3 0.03954 0.097398 chr17:59360795-81195210 338
18p11.32 0.097398 0.097398 chr18:2548123-4323320 15
2q31.1 0.048877 0.11471 chr2:170614261-170765285 3
Xp11.1 0.1166 0.1166 chrX:58103757-58310388 0 [ZXDA]
7p22.1 0.02715 0.12734 chr7:1-20660267 128
13q34 0.095809 0.21304 chr13:92199320-115169878 109
4q13.3 0.22454 0.22454 chr4:72965948-75039107 19
22q11.23 0.23376 0.23376 chr22:1-27189932 197
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 11q13.3.

Table S1.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
CCND1
ORAOV1
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 12q15.

Table S2.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
MDM2
CPM
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 7p11.2.

Table S3.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
EGFR
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 5q31.1.

Table S4.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
PCBD2
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 8p11.23.

Table S5.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
FGFR1
WHSC1L1
LETM2
RNF5P1
C8orf86
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 1q23.3.

Table S6.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
NIT1
PFDN2
USF1
DEDD
F11R
ITLN1
PVRL4
KLHDC9
ITLN2
ARHGAP30
TSTD1
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 8q22.2.

Table S7.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
POLR2K
SPAG1
RNF19A
RGS22
FBXO43
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 17q12.

Table S8.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
ERBB2
GRB7
PNMT
TCAP
STARD3
MIEN1
PGAP3
MIR4728
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 8q24.21.

Table S9.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
MYC
hsa-mir-1204
PVT1
MIR1204
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 6p22.3.

Table S10.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
E2F3
ID4
PRL
SOX4
CDKAL1
MBOAT1
HDGFL1
LINC00340
LOC729177
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 3p25.2.

Table S11.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
PPARG
RAF1
SYN2
TIMP4
VGLL4
MKRN2
TMEM40
TSEN2
TAMM41
LOC100129480
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 1q21.3.

Table S12.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
ARNT
hsa-mir-554
hsa-mir-4257
CTSK
CTSS
ECM1
ENSA
FLG
MCL1
PI4KB
PSMB4
PSMD4
RFX5
RORC
S100A10
S100A11
VPS72
TCHH
TUFT1
PIP5K1A
ANXA9
SELENBP1
PRPF3
SETDB1
SEMA6C
MLLT11
TDRKH
CELF3
POGZ
RPRD2
TMOD4
CERS2
OAZ3
ADAMTSL4
C1orf56
GOLPH3L
FAM63A
CDC42SE1
CGN
ZNF687
PRUNE
MRPL9
SCNM1
TNFAIP8L2
TARS2
SNX27
HORMAD1
THEM4
GABPB2
TCHHL1
RPTN
BNIPL
RIIAD1
THEM5
LINGO4
LYSMD1
HRNR
MIR554
LOC100132111
C2CD4D
MIR4257
TNFAIP8L2-SCNM1
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 12p12.1.

Table S13.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
KRAS
BCAT1
LRMP
CASC1
LYRM5
IFLTD1
C12orf77
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 19q13.2.

Table S14.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
ECH1
HNRNPL
LGALS4
NFKBIB
MRPS12
SIRT2
SARS2
FBXO17
RINL
LOC643669
LGALS7B
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 3q26.2.

Table S15.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
MECOM
PRKCI
SKIL
TERC
SEC62
GPR160
MYNN
LRRC31
PHC3
ARPM1
LRRC34
LRRIQ4
SAMD7
LOC100128164
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 20q11.21.

Table S16.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
hsa-mir-3193
BCL2L1
ID1
TPX2
REM1
HM13
COX4I2
LINC00028
DEFB124
PSIMCT-1
MIR3193
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 14q11.2.

Table S17.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
HNRNPC
SALL2
TOX4
SUPT16H
METTL3
RPGRIP1
CHD8
RAB2B
SNORD8
SNORD9
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 14q13.2.

Table S18.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
NKX2-1
CFL2
NFKBIA
PAX9
PSMA6
SRP54
KIAA0391
BAZ1A
NKX2-8
MBIP
PPP2R3C
EAPP
SNX6
BRMS1L
INSM2
SLC25A21
SPTSSA
RALGAPA1
SFTA3
IGBP1P1
FAM177A1
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 5p15.33.

Table S19.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
hsa-mir-4277
NDUFS6
SDHA
SLC6A3
SLC9A3
TERT
TRIP13
PDCD6
SLC12A7
TPPP
EXOC3
PP7080
IRX4
CEP72
AHRR
MRPL36
BRD9
ZDHHC11
LPCAT1
CLPTM1L
NKD2
C5orf55
CCDC127
PLEKHG4B
SLC6A19
SLC6A18
LRRC14B
SDHAP3
LOC728613
MIR4277
LOC100506688
MIR4457
MIR4456
MIR4635
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 7q22.1.

Table S20.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
hsa-mir-548o
hsa-mir-4285
hsa-mir-106b
ACHE
ASNS
AZGP1
AP1S1
CUX1
CYP3A7
CYP3A4
CYP3A5
DLX5
DLX6
EPHB4
EPO
GNB2
AGFG2
LRCH4
MCM7
NPTX2
OCM2
SERPINE1
PCOLCE
PMS2P1
POLR2J
RELN
PSMC2
TAC1
TAF6
TFR2
TRIP6
VGF
ZAN
ZNF3
ZKSCAN1
ZSCAN21
SHFM1
TRRAP
BUD31
PLOD3
AP4M1
PMPCB
ATP5J2
MUC12
ARPC1B
RASA4
LRRC17
POP7
ZNHIT1
ARPC1A
SH2B2
STAG3
CPSF4
COPS6
PDAP1
LMTK2
ZKSCAN5
CLDN15
BRI3
TECPR1
PTCD1
FBXO24
DNAJC2
PILRB
PILRA
FIS1
ACTL6B
SRRT
ALKBH4
ZCWPW1
C7orf43
BAIAP2L1
MEPCE
SLC12A9
ACN9
SMURF1
MOSPD3
GIGYF1
RABL5
CYP3A43
ZNF655
PVRIG
GAL3ST4
PRKRIP1
ORAI2
OR2AE1
TSC22D4
TRIM56
ARMC10
ZNF394
MYH16
TRIM4
MYL10
EMID2
MUC17
BHLHA15
ZNF498
FAM200A
PPP1R35
GPC2
LRWD1
FAM185A
FBXL13
NAPEPLD
TMEM130
NYAP1
CNPY4
POLR2J2
MBLAC1
DLX6-AS1
ZNF789
MOGAT3
GJC3
DPY19L2P2
GATS
NAT16
SLC26A5
MGC72080
C7orf59
KPNA7
C7orf61
UFSP1
MIR106B
MIR25
MIR93
SPDYE3
SPDYE2
POLR2J3
AZGP1P1
SPDYE6
RPL19P12
LOC100129845
UPK3BL
LOC100289187
LOC100289561
SPDYE2L
SAP25
MIR4285
MIR3609
LOC100506136
ATP5J2-PTCD1
MIR4653
MIR4467
MIR4658
LOC100630923
CYP3A7-CYP3AP1
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 1p34.2.

Table S21.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
MYCL1
BMP8B
PPT1
PABPC4
PPIE
CAP1
MACF1
HEYL
HPCAL4
TRIT1
OXCT2
NT5C1A
MFSD2A
BMP8A
SNORA55
PPIEL
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 20q13.2.

Table S22.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
ZNF217
BCAS1
TSHZ2
SUMO1P1
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 7p14.1.

Table S23.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
AMPH
STARD3NL
TARP
LOC100506776
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 9p22.3.

Table S24.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
NFIB
SNAPC3
CER1
PSIP1
TTC39B
FREM1
C9orf93
ZDHHC21
LOC389705
C9orf146
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 8p11.21.

Table S25.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
HOOK3
hsa-mir-486
ANK1
CHRNB3
IKBKB
PLAT
POLB
SLC20A2
VDAC3
KAT6A
CHRNA6
AP3M2
DKK4
GOLGA7
THAP1
RNF170
GINS4
C8orf40
AGPAT6
NKX6-3
MIR486
MIR4469
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 2q11.2.

Table S26.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
COX5B
ACTR1B
CNNM3
ANKRD39
FAM178B
SEMA4C
ANKRD36B
FAHD2B
ANKRD23
ANKRD36
LOC100506123
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 18q11.2.

Table S27.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
NPC1
RBBP8
RIOK3
C18orf8
TMEM241
CABLES1
ANKRD29
MIR4741
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 16p12.1.

Table S28.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
AQP8
PRKCB
RBBP6
CACNG3
TNRC6A
LCMT1
ARHGAP17
SLC5A11
ZKSCAN2
LOC554206
LOC100506655
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 5p13.1.

Table S29.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
PRKAA1
PTGER4
RPL37
TTC33
CARD6
SNORD72
LOC100506548
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 8p11.23.

Table S30.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
ERLIN2
PROSC
GPR124
BRF2
ZNF703
RAB11FIP1
KCNU1
LOC728024
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 19q13.11.

Table S31.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
CCNE1
UQCRFS1
URI1
ZNF254
ZNF536
POP4
TSHZ3
PLEKHF1
C19orf12
DKFZp566F0947
LOC148145
LOC148189
LOC284395
VSTM2B
ZNF726
LOC100101266
LOC100505835
THEG5
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 6q12.

Table S32.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
PTP4A1
PHF3
LGSN
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 17q11.2.

Table S33.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
LGALS9
NOS2
KSR1
NLK
C17orf108
TBC1D3P5
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 20q11.22.

Table S34.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
hsa-mir-644
AHCY
ASIP
GGT7
GSS
EIF2S2
NCOA6
ACSS2
TP53INP2
DYNLRB1
ITCH
MAP1LC3A
PIGU
HMGB3P1
MIR644A
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 8q11.1.

Table S35.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
FNTA
SGK196
HGSNAT
POTEA
LINC00293
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 13q22.1.

Table S36.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
KLF5
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 10p14.

Table S37.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
GATA3
hsa-mir-1265
hsa-mir-548q
hsa-mir-3155
ADARB2
ATP5C1
CALML3
AKR1C4
KLF6
AKR1C1
AKR1C2
GDI2
IDI1
IL2RA
IL15RA
ITIH2
PFKFB3
PFKP
PHYH
PRKCQ
PRPF18
AKR1C3
CDC123
NMT2
USP6NL
OPTN
NET1
PITRM1
RPP38
CELF2
ZMYND11
NUDT5
WDR37
SEPHS1
KIN
DIP2C
LARP4B
GTPBP4
UPF2
HSPA14
CALML5
ANKRD16
FAM208B
SEC61A2
OLAH
MCM10
DHTKD1
FRMD4A
IDI2-AS1
CAMK1D
SFMBT2
DCLRE1C
SUV39H2
ECHDC3
ASB13
TUBAL3
ITIH5
AKR1E2
FAM107B
CCDC3
TAF3
FBXO18
RBM17
IDI2
UCN3
SFTA1P
LOC219731
UCMA
C10orf111
BEND7
LOC254312
C10orf47
LOC282980
LOC283070
LOC338588
AKR1CL1
TUBB8
tAKR
LINC00200
LOC399708
LOC399715
FLJ45983
ACBD7
C10orf108
LOC439949
LOC439950
CDNF
ADARB2-AS1
MEIG1
LOC100192204
LOC100216001
MIR1265
MIR3155A
LOC100507034
LOC100507127
MIR4480
MIR3155B
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 2p25.1.

Table S38.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
ROCK2
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 17q24.3.

Table S39.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
CD79B
DDX5
PRKAR1A
ASPSCR1
BRIP1
CANT1
hsa-mir-3186
hsa-mir-1250
hsa-mir-4316
hsa-mir-636
hsa-mir-635
hsa-mir-548d-2
hsa-mir-634
hsa-mir-4315-2
hsa-mir-633
AANAT
ACOX1
ACTG1
BIRC5
APOH
ARHGDIA
CACNG1
CD7
CDK3
CSH1
CSH2
CSHL1
CSNK1D
SLC25A10
CYB561
ACE
ERN1
EVPL
BPTF
FASN
FDXR
FOXJ1
GAA
GALK1
GCGR
GH1
GH2
UTS2R
GPS1
GRB2
GRIN2C
H3F3B
ICAM2
ICT1
FOXK2
ITGB4
KCNJ2
KCNJ16
KPNA2
LGALS3BP
LLGL2
MAFG
MAP3K3
NPTX1
P4HB
PDE6G
PECAM1
PRKCA
MAP2K6
PRPSAP1
PSMC5
PSMD12
PYCR1
PCYT2
RAC3
RFNG
RPL38
MRPL12
SCN4A
SEC14L1
SECTM1
SRSF2
SGSH
SMARCD2
SUMO2
SOX9
SRP68
SSTR2
TBCD
TBX2
TIMP2
TK1
AXIN2
CBX4
DNAH17
RGS9
GALR2
SPHK1
SOCS3
SLC16A6
SLC16A5
SLC16A3
SYNGR2
HGS
CYTH1
SLC9A3R1
COG1
RECQL5
PGS1
TBX4
AATK
KIAA0195
EIF4A3
MRC2
HELZ
MED13
ALYREF
DCAF7
ABCA10
ABCA9
ABCA8
BAIAP2
ATP5H
ST6GALNAC2
GNA13
SEPT9
CD300C
RAB40B
TLK2
POLG2
CD300A
TMC6
DDX42
ARSG
AZI1
GGA3
JMJD6
EXOC7
ABCA6
ABCA5
KCTD2
WBP2
CDC42EP4
FSCN2
NOL11
TANC2
NAT9
PITPNC1
NARF
CACNG5
CACNG4
SAP30BP
NT5C
CDR2L
MRPS7
HN1
DCXR
TACO1
AMZ2
ANAPC11
SIRT7
SDK2
FAM20A
BCAS3
TMEM104
C17orf80
CCDC40
WIPI1
NPLOC4
ST6GALNAC1
TEX2
GPRC5C
WDR45L
CCDC47
CBX8
MIF4GD
INTS2
CASKIN2
RPTOR
BAHCC1
USP36
RNF213
TNRC6C
SLC25A19
UBE2O
DUS1L
FN3K
DNAI2
SMURF2
ENGASE
MRPL38
CARD14
MFSD11
C17orf62
ARMC7
CHMP6
RHBDF2
FN3KRP
C17orf101
ZNF750
NUP85
MYO15B
C17orf70
LIMD2
KCNH6
TSPAN10
QRICH2
CBX2
FAM104A
MGC16275
FBF1
UNK
CEP95
TRIM47
STRADA
C17orf72
MAFG-AS1
OTOP2
TTYH2
CYGB
RNF157
C1QTNF1
PPP1R27
FTSJ3
METTL23
SLC38A10
USH1G
CD300LB
KIF19
TBC1D16
AFMID
MGAT5B
C17orf56
B3GNTL1
RBFOX3
CD300LF
C17orf77
TCAM1P
EFCAB3
LOC146880
NOTUM
TMC8
MARCH10
CEP112
STRA13
LRRC45
SLC39A11
AMZ2P1
TRIM65
UNC13D
MYADML2
NPB
LINC00469
FADS6
C17orf28
TSEN54
FAM100B
TMEM235
CCDC57
HEXDC
C17orf58
MILR1
SLC26A11
ENDOV
C17orf89
LINC00482
TMEM105
METRNL
RAB37
METTL2A
ENPP7
C17orf90
CCDC137
ARL16
CD300E
NACA2
OTOP3
FAM195B
GPR142
ZACN
LRRC37A3
C17orf82
BTBD17
AATK-AS1
FLJ43681
KCNJ2-AS1
LINC00511
LOC400620
FLJ45079
TEX19
MXRA7
TBC1D3P2
PLEKHM1P
LOC440461
FLJ90757
MIR338
CPSF4L
C17orf109
LINC00338
SCARNA16
SNORA76
SNORD1A
SNORD1B
SNORD1C
SNORD104
MIR634
MIR635
MIR636
MIR657
LOC729683
PRCD
SNORA38B
C17orf110
LOC100131096
CD300LD
TEN1
C17orf99
LOC100287042
LOC100294362
MIR1250
MIR4316
MIR3065
MIR548W
MIR3186
MIR4315-2
MIR4315-1
LOC100499466
LOC100499467
MIR3678
MIR3615
LOC100507218
LOC100507246
LOC100507351
LOC100507410
TEN1-CDK3
MIR4739
MIR4738
MIR4740
MIR4524A
MIR4730
MIR3064
MIR5047
LOC100653515
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 18p11.32.

Table S40.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
TGIF1
MYOM1
DLGAP1
LPIN2
NDC80
MYL12A
SMCHD1
METTL4
EMILIN2
MYL12B
LOC201477
LOC284215
CBX3P2
FLJ35776
LOC727896
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 2q31.1.

Table S41.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
SSB
METTL5
UBR3
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 7p22.1.

Table S42.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
ETV1
PMS2
CARD11
hsa-mir-3146
hsa-mir-1302-6
hsa-mir-589
hsa-mir-339
ACTB
AHR
DGKB
GNA12
GPER
ICA1
ITGB8
LFNG
MEOX2
NUDT1
NDUFA4
PDGFA
PRKAR1B
RAC1
RPA3
FSCN1
TWIST1
ZNF12
AIMP2
MAFK
MAD1L1
EIF3B
CYTH3
PHF14
HDAC9
KIAA0415
ARL4A
AGR2
KDELR2
ADAP1
SNX13
IQCE
SUN1
SOSTDC1
WIPI2
INTS1
TSPAN13
EIF2AK1
BZW2
SNX8
FTSJ2
NXPH1
GET4
CCZ1
MIOS
RNF216
TMEM106B
ZNF853
CYP2W1
HEATR2
ZDHHC4
CHST12
RADIL
PAPOLB
C1GALT1
FAM20C
ANKMY2
RBAK
C7orf26
MICALL2
FBXL18
TTYH3
USP42
PSMG3
C7orf50
TNRC18
C7orf70
SCIN
ZFAND2A
COX19
GLCCI1
KIAA1908
GPR146
AMZ1
AGR3
PER4
TMEM184A
VWDE
PRPS1L1
TWISTNB
BRAT1
SDK1
FOXK1
MMD2
DAGLB
CCZ1B
THSD7A
FERD3L
SLC29A4
RSPH10B
TMEM196
TFAMP1
UNCX
COL28A1
ABCB5
MACC1
LOC389458
ELFN1
AGMO
GRID2IP
ZNF815
RNF216P1
PMS2CL
FLJ44511
LOC442497
MIR339
ZNF890P
OCM
MIR589
RSPH10B2
LOC729852
ISPD
LOC100131257
LOC100288524
MIR3146
LOC100506025
LRRC72
RBAK-LOC389458
MIR4648
MIR4655
MIR4656
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 13q34.

Table S43.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
ERCC5
hsa-mir-1267
hsa-mir-4306
hsa-mir-623
hsa-mir-3170
ATP4B
COL4A1
COL4A2
DCT
GPR183
EFNB2
F7
F10
FGF14
GPC5
GAS6
GPR18
ING1
IPO5
LAMP1
LIG4
PCCA
DNAJC3
RAP2A
GRK1
SLC10A2
SLC15A1
SOX1
TFDP1
TPP2
ZIC2
STK24
CUL4A
IRS2
PROZ
ARHGEF7
CDC16
CLDN10
ITGBL1
TM9SF2
GPC6
MBNL2
FARP1
ABCC4
TUBGCP3
TNFSF13B
SOX21
RASA3
DZIP1
MYO16
ATP11A
MCF2L
DOCK9
TGDS
OXGR1
BIVM
TMCO3
ARGLU1
DCUN1D2
ANKRD10
RAB20
CARKD
UGGT2
PCID2
UPF3A
KDELC1
CARS2
GRTP1
TMTC4
ABHD13
ZIC5
A2LD1
TEX30
ADPRHL1
TEX29
METTL21CP1
SPACA7
RNF113B
GPR180
CLYBL
METTL21C
NALCN
HS6ST3
DAOA
DAOA-AS1
FGF14-IT1
LOC283481
LINC00346
CHAMP1
UBAC2
FAM70B
C13orf35
FLJ44054
CCDC168
FLJ41484
MIR623
LINC00460
FAM155A
LINC00552
FKSG29
UBAC2-AS1
MCF2L-AS1
MIR4306
MIR3170
LOC100506394
BIVM-ERCC5
MIR2681
MIR548AN
MIR4705
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 4q13.3.

Table S44.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
AFM
AFP
ALB
CXCL1
CXCL2
CXCL3
IL8
PF4
PF4V1
PPBP
CXCL6
CXCL5
ADAMTS3
NPFFR2
PPBPL2
ANKRD17
RASSF6
COX18
MTHFD2L
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 22q11.23.

Table S45.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
BCR
SMARCB1
CLTCL1
hsa-mir-548j
hsa-mir-650
hsa-mir-130b
hsa-mir-649
hsa-mir-1286
hsa-mir-1306
hsa-mir-185
hsa-mir-648
hsa-mir-3198
ADORA2A
ADRBK2
ARVCF
ATP6V1E1
BID
COMT
CRKL
CRYBA4
CRYBB1
CRYBB2
CRYBB2P1
CRYBB3
DDT
GGT1
GGT3P
GGT5
GNAZ
GP1BB
GSC2
GSTT1
GSTT2
SERPIND1
IGLL1
MIF
MMP11
PI4KA
SEPT5
MAPK1
PRODH
RANBP1
SLC7A4
SLC25A1
SNRPD3
TBX1
CLDN5
TOP1P2
HIRA
UBE2L3
UFD1L
VPREB1
ZNF70
ZNF74
DGCR6
LZTR1
DGCR14
CDC45
TPST2
TOP3B
P2RX6
SNAP29
RAB36
PPM1F
DGCR2
TXNRD2
USP18
HIC2
SPECC1L
CABIN1
PRAME
SEZ6L
TSSK2
SDF2L1
PPIL2
IL17RA
ANKRD62P1-PARP4P3
POTEH
BCL2L13
TFIP11
GSTTP1
DGCR11
DGCR9
POM121L1P
DGCR5
DGCR10
FBXW4P1
TRMT2A
RTDR1
HSFY1P1
CECR6
CECR5
CECR3
CECR2
POM121L9P
POM121L8P
YPEL1
ZDHHC8
VPREB3
C22orf43
MED15
UPB1
TUBA8
CECR1
DGCR8
GNB1L
PEX26
SUSD2
ASPHD2
MICAL3
MRPL40
RTN4R
SLC2A11
C22orf29
THAP7
OR11H1
C22orf13
SLC25A18
TMEM191A
MYO18B
KLHL22
DGCR6L
RIMBP3
KIAA1671
HPS4
SCARF2
GGTLC2
GUSBP11
DERL3
IGLL3P
LRP5L
LOC96610
MGC16703
GAB4
C22orf39
C22orf25
ZNF280A
SGSM1
ZNF280B
CCT8L2
XKR3
LOC150185
LOC150197
AIFM3
RIMBP3C
YDJC
ZDHHC8P1
C22orf15
CCDC116
TMEM211
POM121L4P
RGL4
LOC284865
LOC284889
PI4KAP2
TPTEP1
LOC388849
FAM211B
LOC391322
LOC400891
BCRP2
CHCHD10
SRRD
MIR130B
MIR185
THAP7-AS1
P2RX6P
RIMBP3B
PIWIL3
MIAT
BCRP3
TMEM191C
C22orf45
POM121L10P
LOC648691
CES5AP1
GSTTP2
GSTT2B
MIR648
MIR650
TMEM191B
PI4KAP1
LOC729444
DDTL
MIR301B
LOC100128531
CECR7
CECR5-AS1
FLJ41941
MIR1286
MIR1306
MIR548J
MIR3198-1
IGLL5
MIR3618
SEPT5-GP1BB
MIR4761

Figure 2.  Genomic positions of deleted regions: the X-axis represents the normalized deletion signals (top) and significance by Q value (bottom). The green line represents the significance cutoff at Q value=0.25.

Table 2.  Get Full Table Deletions Table - 15 significant deletions found. Click the link in the last column to view a comprehensive list of candidate genes. If no genes were identified within the peak, the nearest gene appears in brackets.

Cytoband Q value Residual Q value Wide Peak Boundaries # Genes in Wide Peak
9p21.3 7.6904e-63 7.6904e-63 chr9:21970802-21981214 1
8p23.2 0.0011594 0.0011594 chr8:3505239-3737401 1
13q14.2 0.0034854 0.0034854 chr13:49113914-49266161 0 [RCBTB2]
16p13.3 0.0082178 0.0082178 chr16:6476002-7366868 1
2q22.1 0.010987 0.016249 chr2:141787423-141949672 1
19p13.3 0.019679 0.019679 chr19:1-2778999 108
3p14.2 0.052201 0.052201 chr3:59581001-61091910 1
1p36.11 0.11313 0.11313 chr1:25402240-25727000 5
18q21.32 0.14465 0.14465 chr18:58754141-58962000 0 [CDH20]
17p12 0.16492 0.16492 chr17:11721269-11964725 3
2q37.3 0.057713 0.16868 chr2:219464714-243199373 244
4q34.3 0.12038 0.18097 chr4:177612884-191154276 68
4q22.1 0.16868 0.18927 chr4:91476030-92192999 1
5q12.1 0.20034 0.20034 chr5:59237021-59578303 1
11q22.3 0.2218 0.2218 chr11:104210996-124275179 208
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 9p21.3.

Table S46.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
CDKN2A
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 8p23.2.

Table S47.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
CSMD1
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 16p13.3.

Table S48.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
RBFOX1
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 2q22.1.

Table S49.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
LRP1B
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 19p13.3.

Table S50.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
STK11
TCF3
FSTL3
hsa-mir-4321
hsa-mir-1909
hsa-mir-3187
hsa-mir-1302-11
AMH
ATP5D
AZU1
HCN2
BSG
CDC34
CIRBP
CNN2
CSNK1G2
CFD
ARID3A
EFNA2
ELANE
GAMT
GNG7
MKNK2
GPX4
GZMM
GADD45B
OAZ1
PALM
POLR2E
POLRMT
PRTN3
PTBP1
RPS15
SGTA
MADCAM1
SF3A2
PPAP2C
AP3D1
MED16
APC2
ABCA7
UQCR11
SBNO2
HMHA1
SHC2
TIMM13
DAZAP1
FGF22
SLC39A3
THEG
LSM7
MBD3
MIER2
PCSK4
C19orf24
PLEKHJ1
BTBD2
RNF126
SPPL2B
WDR18
REXO1
LPPR3
OR4F17
FAM108A1
KLF16
DOT1L
KISS1R
LMNB2
MUM1
MIDN
R3HDM4
C19orf6
TPGS1
REEP6
IZUMO4
SCAMP4
ADAT3
GRIN3B
JSRP1
MOB3A
C19orf21
PLK5
C2CD4C
CIRBP-AS1
C19orf25
ATP8B3
DIRAS1
C19orf26
CSNK1G2-AS1
ODF3L2
ADAMTSL5
TMPRSS9
NDUFS7
C19orf35
WASH5P
ONECUT3
MEX3D
FLJ45445
PRSS57
FAM138F
LINGO3
FAM138A
LOC100288123
MIR1909
MIR1227
MIR3187
MIR4321
MIR4745
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 3p14.2.

Table S51.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
FHIT
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 1p36.11.

Table S52.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
RHCE
RHD
TMEM50A
SYF2
C1orf63
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 17p12.

Table S53.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
MAP2K4
DNAH9
ZNF18
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 2q37.3.

Table S54.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
ACSL3
PAX3
FEV
hsa-mir-3133
hsa-mir-149
hsa-mir-4269
hsa-mir-562
hsa-mir-1471
hsa-mir-1244-1
hsa-mir-4268
hsa-mir-3132
hsa-mir-153-1
hsa-mir-3131
AGXT
ALPI
ALPP
ALPPL2
KIF1A
BCS1L
BOK
CHRND
CHRNG
COL4A3
COL4A4
COL6A3
CRYBA2
CYP27A1
DES
DTYMK
EPHA4
GBX2
GPC1
GPR35
HDLBP
AGFG1
DNAJB2
HTR2B
SP110
IHH
INHA
INPP5D
IRS1
KCNJ13
NCL
NDUFA10
SEPT2
NEU2
NPPC
PDCD1
PDE6D
SERPINE2
PPP1R7
PSMD1
PTMA
PTPRN
SNORD20
SAG
CCL20
SLC4A3
SP100
SPP2
TUBA4A
WNT6
ZNF142
SCG2
CUL3
DGKD
STK16
PER2
CDK5R2
LRRFIP1
GPR55
TRIP12
ECEL1
EIF4E2
TTLL4
HDAC4
FARP2
FARSB
ABCB6
ARL4C
RAMP1
SPEG
NMUR1
STK25
COPS8
CAPN10
SP140
PASK
ATG4B
OBSL1
DNPEP
SH3BP4
KCNE4
NGEF
SNORD82
SNED1
GIGYF2
TRAF3IP1
CNPPD1
STK36
GMPPA
ANO7
PRLH
THAP4
ANKMY1
SCLY
ASB1
CAB39
PRKAG3
UGT1A10
UGT1A8
UGT1A7
UGT1A6
UGT1A5
UGT1A9
UGT1A4
UGT1A1
UGT1A3
PID1
ATG16L1
ANKZF1
USP40
HJURP
HES6
ACCN4
DOCK10
C2orf83
MFF
CXCR7
RNPEPL1
WDFY1
NYAP2
GAL3ST2
RAB17
RNF25
COPS7B
MRPL44
TRPM8
ATG9A
MLPH
FAM134A
GLB1L
CHPF
IQCA1
NHEJ1
FAM124B
TM4SF20
C2orf54
TUBA4B
ARMC9
EFHD1
SPHKAP
WNT10A
SLC19A3
ILKAP
ITM2C
RHBDD1
ING5
PLCD4
MGC16025
DNER
B3GNT7
SP140L
STK11IP
MOGAT1
AGAP1
TWIST2
DIS3L2
NEU4
AP1S3
SGPP2
SPATA3
TMEM198
ZFAND2B
FBXO36
MTERFD2
UBE2F
OTOS
MYEOV2
OR6B3
LOC150935
LOC151171
LOC151174
CCDC140
SLC23A3
LOC151300
SLC16A14
LOC151475
LINC00471
LOC151484
MSL3P1
WDR69
C2orf57
TIGD1
LOC200772
CCDC108
C2orf72
CXXC11
DUSP28
ESPNL
ECEL1P2
LOC348761
RBM44
AQP12A
KLHL30
RESP18
C2orf82
OR6B2
ASB18
FLJ43879
MIR149
MIR153-1
DNAJB3
MIR375
LOC643387
PRR21
LOC646736
PRSS56
AQP12B
SNORA75
SCARNA6
SCARNA5
D2HGDH
LOC728323
LOC100129175
PP14571
LOC100286922
MIR1471
MIR1244-1
BOK-AS1
MIR1244-3
MIR1244-2
MIR3131
MIR4268
MIR3132
MIR4269
UBE2F-SCLY
MIR4439
MIR4777
MIR2467
MIR4440
MIR4786
MIR4441
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 4q34.3.

Table S55.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
DUX4
hsa-mir-1305
AGA
SLC25A4
CASP3
DCTD
F11
ACSL1
FAT1
FRG1
HSP90AA4P
ING2
IRF2
KLKB1
MTNR1A
TLR3
VEGFC
SORBS2
FAM149A
DUX2
PDLIM3
CLDN22
NEIL3
UFSP2
CDKN2AIP
ODZ3
LRP2BP
STOX2
KIAA1430
TRAPPC11
MLF1IP
WWC2
SNX25
MGC45800
ZFP42
ENPP6
C4orf38
RWDD4
CCDC111
TRIML2
CCDC110
CYP4V2
LOC285441
LOC285501
LOC339975
TRIML1
ANKRD37
LOC389247
HELT
LOC401164
FAM92A3
C4orf47
DUX4L4
FRG2
SLED1
FLJ38576
DUX4L6
DUX4L5
DUX4L3
LINC00290
LOC728175
DUX4L2
LOC731424
CLDN24
LOC100288255
MIR1305
MIR3945
LOC100506229
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 4q22.1.

Table S56.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
FAM190A
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 5q12.1.

Table S57.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
PDE4D
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 11q22.3.

Table S58.  Genes in bold are cancer genes as defined by The Sanger Institute's Cancer Gene Census [7].

Genes
ATM
CBL
DDX6
DDX10
MLL
PAFAH1B2
POU2AF1
SDHD
PCSK7
ARHGEF12
hsa-mir-100
hsa-mir-4301
hsa-mir-34c
ACAT1
APOA1
APOA4
APOC3
ARCN1
FXYD2
CXCR5
CASP1
CASP4
CASP5
CD3D
CD3E
CD3G
CRYAB
DLAT
DPAGT1
DRD2
FDX1
SLC37A4
GRIA4
GRIK4
GUCY1A2
H2AFX
HMBS
HSPA8
HSPB2
HTR3A
IL10RA
IL18
VWA5A
MCAM
NCAM1
NNMT
NPAT
PPP2R1B
PTS
PVRL1
RDX
RPS25
SC5DL
SCN2B
SCN4B
SLN
SORL1
TAGLN
TECTA
THY1
UPK2
ZBTB16
ZNF202
CUL5
ZNF259
USP2
HTR3B
ZW10
UBE4A
C2CD2L
RBM7
MPZL2
HYOU1
ATP5L
TREH
CEP164
EXPH5
PHLDB1
SIK2
SIK3
BACE1
TRIM29
CADM1
POU2F3
HINFP
REXO2
OR8G2
OR8G1
TIMM8B
OR8B2
SIDT2
TRAPPC4
FXYD6
C11orf71
SLC35F2
RAB39A
BTG4
FAM55D
TTC12
C11orf57
ELMOD1
SCN3B
VPS11
TEX12
CRTAM
TMPRSS4
IFT46
DSCAML1
GRAMD1B
ARHGAP20
USP28
CARD18
AASDHPPT
ABCG4
C11orf1
RNF26
NLRX1
ALG9
CLMP
PDZD3
C11orf63
TMPRSS5
MFRP
BCO2
TMPRSS13
KIAA1826
BUD13
TMEM25
UBASH3B
DIXDC1
ZC3H12C
ALKBH8
FDXACB1
C11orf52
CARD16
C1QTNF5
APOA5
C11orf93
PIH1D2
FAM55A
FAM55B
AMICA1
KBTBD3
CWF19L2
KDELC2
LAYN
TTC36
C11orf65
MPZL3
OR8G5
OR10G8
OR10G9
OR10S1
OR6T1
OR4D5
TBCEL
TMEM136
OAF
ANKK1
RNF214
LOC283143
BCL9L
FOXR1
CCDC153
OR8D1
OR8D2
CCDC84
TMEM225
OR8D4
C11orf53
LOC341056
C11orf34
BSX
OR6X1
OR6M1
OR10G4
OR10G7
OR8B3
C11orf87
C11orf92
C11orf88
MIR100HG
MIRLET7A2
MIR100
MIR125B1
MIR34B
MIR34C
BLID
CARD17
LOC643733
LOC643923
CLDN25
LOC649133
RPL23AP64
LOC100132078
LOC100288346
BACE1-AS
MIR4301
LOC100499227
MIR3656
CASP12
LOC100526771
HSPB2-C11orf52
FXYD6-FXYD2
MIR4493
MIR4491
MIR4492
LOC100652768
Arm-level results

Table 3.  Get Full Table Arm-level significance table - 24 significant results found. The significance cutoff is at Q value=0.25.

Arm # Genes Amp Frequency Amp Z score Amp Q value Del Frequency Del Z score Del Q value
1p 2121 0.09 -0.609 1 0.12 1.1 0.361
1q 1955 0.31 14.6 0 0.09 -1.1 1
2p 924 0.13 -1.28 1 0.05 -6.28 1
2q 1556 0.09 -2.53 1 0.06 -4.3 1
3p 1062 0.12 -1.85 1 0.21 4.06 0.000121
3q 1139 0.22 5.03 1.22e-06 0.09 -3.26 1
4p 489 0.06 -6.54 1 0.18 0.562 0.574
4q 1049 0.03 -7.16 1 0.17 1.51 0.202
5p 270 0.26 4.58 1.05e-05 0.14 -2.23 1
5q 1427 0.09 -2.7 1 0.21 5.08 1.24e-06
6p 1173 0.11 -2.07 1 0.09 -3.17 1
6q 839 0.08 -4.91 1 0.16 0.225 0.783
7p 641 0.32 9.74 0 0.05 -6.29 1
7q 1277 0.26 7.94 7.99e-15 0.06 -4.68 1
8p 580 0.25 3.98 0.000136 0.39 13 0
8q 859 0.41 16.2 0 0.11 -2.35 1
9p 422 0.14 -2.04 1 0.27 5.58 1.22e-07
9q 1113 0.11 -1.95 1 0.23 5.42 2.43e-07
10p 409 0.16 -1.33 1 0.13 -2.61 1
10q 1268 0.09 -2.88 1 0.15 0.81 0.522
11p 862 0.08 -4.82 1 0.17 0.649 0.544
11q 1515 0.10 -1.68 1 0.17 2.97 0.00596
12p 575 0.18 0.616 0.769 0.10 -4.12 1
12q 1447 0.12 -0.475 1 0.07 -3.94 1
13q 654 0.20 1.77 0.141 0.18 0.757 0.528
14q 1341 0.10 -2.47 1 0.15 0.691 0.544
15q 1355 0.04 -5.78 1 0.19 3.99 0.000146
16p 872 0.12 -2.05 1 0.10 -3.32 1
16q 702 0.10 -3.98 1 0.15 -0.896 1
17p 683 0.11 -2.99 1 0.33 10.4 0
17q 1592 0.15 1.55 0.201 0.10 -1.4 1
18p 143 0.14 -2.53 1 0.27 4.73 6.27e-06
18q 446 0.07 -5.63 1 0.35 10.5 0
19p 995 0.10 -3 1 0.17 1.28 0.287
19q 1709 0.14 1.43 0.235 0.12 0.105 0.833
20p 355 0.31 7.9 8.88e-15 0.14 -2.24 1
20q 753 0.35 12.1 0 0.08 -4.09 1
21q 509 0.10 -4.52 1 0.15 -1.42 1
22q 921 0.06 -5.38 1 0.19 2.36 0.0307
Xq 1312 0.22 5.17 6.69e-07 0.18 2.39 0.0307
Methods & Data
Input
Description
  • Segmentation File: The segmentation file contains the segmented data for all the samples identified by GLAD, CBS, or some other segmentation algorithm. (See GLAD file format in the Genepattern file formats documentation.) It is a six column, tab-delimited file with an optional first line identifying the columns. Positions are in base pair units.The column headers are: (1) Sample (sample name), (2) Chromosome (chromosome number), (3) Start Position (segment start position, in bases), (4) End Position (segment end position, in bases), (5) Num markers (number of markers in segment), (6) Seg.CN (log2() -1 of copy number).

  • Markers File: The markers file identifies the marker names and positions of the markers in the original dataset (before segmentation). It is a three column, tab-delimited file with an optional header. The column headers are: (1) Marker Name, (2) Chromosome, (3) Marker Position (in bases).

  • Reference Genome: The reference genome file contains information about the location of genes and cytobands on a given build of the genome. Reference genome files are created in Matlab and are not viewable with a text editor.

  • CNV Files: There are two options for the cnv file. The first option allows CNVs to be identified by marker name. The second option allows the CNVs to be identified by genomic location. Option #1: A two column, tab-delimited file with an optional header row. The marker names given in this file must match the marker names given in the markers file. The CNV identifiers are for user use and can be arbitrary. The column headers are: (1) Marker Name, (2) CNV Identifier. Option #2: A 6 column, tab-delimited file with an optional header row. The 'CNV Identifier' is for user use and can be arbitrary. 'Narrow Region Start' and 'Narrow Region End' are also not used. The column headers are: (1) CNV Identifier, (2) Chromosome, (3) Narrow Region Start, (4) Narrow Region End, (5) Wide Region Start, (6) Wide Region End

  • Amplification Threshold: Threshold for copy number amplifications. Regions with a log2 ratio above this value are considered amplified.

  • Deletion Threshold: Threshold for copy number deletions. Regions with a log2 ratio below the negative of this value are considered deletions.

  • Cap Values: Minimum and maximum cap values on analyzed data. Regions with a log2 ratio greater than the cap are set to the cap value; regions with a log2 ratio less than -cap value are set to -cap. Values must be positive.

  • Broad Length Cutoff: Threshold used to distinguish broad from focal events, given in units of fraction of chromosome arm.

  • Remove X-Chromosome: Flag indicating whether to remove data from the X-chromosome before analysis. Allowed values= {1,0} (1: Remove X-Chromosome, 0: Do not remove X-Chromosome.

  • Confidence Level: Confidence level used to calculate the region containing a driver.

  • Join Segment Size: Smallest number of markers to allow in segments from the segmented data. Segments that contain fewer than this number of markers are joined to the neighboring segment that is closest in copy number.

  • Arm Level Peel Off: Flag set to enable arm-level peel-off of events during peak definition. The arm-level peel-off enhancement to the arbitrated peel-off method assigns all events in the same chromosome arm of the same sample to a single peak. It is useful when peaks are split by noise or chromothripsis. Allowed values= {1,0} (1: Use arm level peel off, 0: Use normal arbitrated peel-off).

  • Maximum Sample Segments: Maximum number of segments allowed for a sample in the input data. Samples with more segments than this threshold are excluded from the analysis.

  • Gene GISTIC: When enabled (value = 1), this option causes GISTIC to analyze deletions using genes instead of array markers to locate the lesion. In this mode, the copy number assigned to a gene is the lowest copy number among the markers that represent the gene.

Values

List of inputs used for this run of GISTIC2. All files listed should be included in the archived results.

  • Segmentation File = /xchip/cga/gdac-prod/tcga-gdac/jobResults/PrepareGisticDNASeq/PANCAN12-TP/6154890/segmentationfile.txt

  • Markers File = /xchip/cga/gdac-prod/tcga-gdac/jobResults/PrepareGisticDNASeq/PANCAN12-TP/6154890/markersfile.txt

  • Reference Genome = /xchip/cga/reference/gistic2/hg19_with_miR_20120227.mat

  • CNV Files = /xchip/cga/reference/gistic2/CNV.hg19.bypos.111213.txt

  • Amplification Threshold = 0.3

  • Deletion Threshold = 0.3

  • Cap Values = 2

  • Broad Length Cutoff = 0.5

  • Remove X-Chromosome = 0

  • Confidence Level = 0.99

  • Join Segment Size = 10

  • Arm Level Peel Off = 1

  • Maximum Sample Segments = 10000

  • Gene GISTIC = 0

Table 4.  Get Full Table First 10 out of 568 Input Tumor Samples.

Tumor Sample Names
TCGA-05-4249-01A-01D-1103-02
TCGA-05-4250-01A-01D-1103-02
TCGA-05-4382-01A-01D-1203-02
TCGA-05-4384-01A-01D-1751-02
TCGA-05-4389-01A-01D-1203-02
TCGA-05-4390-01A-02D-1751-02
TCGA-05-4395-01A-01D-1203-02
TCGA-05-4396-01A-21D-1853-02
TCGA-05-4397-01A-01D-1203-02
TCGA-05-4398-01A-01D-1203-02

Figure 3.  Segmented copy number profiles in the input data

Output
All Lesions File (all_lesions.conf_##.txt, where ## is the confidence level)

The all lesions file summarizes the results from the GISTIC run. It contains data about the significant regions of amplification and deletion as well as which samples are amplified or deleted in each of these regions. The identified regions are listed down the first column, and the samples are listed across the first row, starting in column 10.

Region Data

Columns 1-9 present the data about the significant regions as follows:

  1. Unique Name: A name assigned to identify the region.

  2. Descriptor: The genomic descriptor of that region.

  3. Wide Peak Limits: The 'wide peak' boundaries most likely to contain the targeted genes. These are listed in genomic coordinates and marker (or probe) indices.

  4. Peak Limits: The boundaries of the region of maximal amplification or deletion.

  5. Region Limits: The boundaries of the entire significant region of amplification or deletion.

  6. Q values: The Q value of the peak region.

  7. Residual Q values: The Q value of the peak region after removing ('peeling off') amplifications or deletions that overlap other, more significant peak regions in the same chromosome.

  8. Broad or Focal: Identifies whether the region reaches significance due primarily to broad events (called 'broad'), focal events (called 'focal'), or independently significant broad and focal events (called 'both').

  9. Amplitude Threshold: Key giving the meaning of values in the subsequent columns associated with each sample.

Sample Data

Each of the analyzed samples is represented in one of the columns following the lesion data (columns 10 through end). The data contained in these columns varies slightly by section of the file. The first section can be identified by the key given in column 9 - it starts in row 2 and continues until the row that reads 'Actual Copy Change Given.' This section contains summarized data for each sample. A '0' indicates that the copy number of the sample was not amplified or deleted beyond the threshold amount in that peak region. A '1' indicates that the sample had low-level copy number aberrations (exceeding the low threshold indicated in column 9), and a '2' indicates that the sample had high-level copy number aberrations (exceeding the high threshold indicated in column 9).The second section can be identified the rows in which column 9 reads 'Actual Copy Change Given.' The second section exactly reproduces the first section, except that here the actual changes in copy number are provided rather than zeroes, ones, and twos.The final section is similar to the first section, except that here only broad events are included. A 1 in the samples columns (columns 10+) indicates that the median copy number of the sample across the entire significant region exceeded the threshold given in column 9. That is, it indicates whether the sample had a geographically extended event, rather than a focal amplification or deletion covering little more than the peak region.

Amplification Genes File (amp_genes.conf_##.txt, where ## is the confidence level)

The amp genes file contains one column for each amplification peak identified in the GISTIC analysis. The first four rows are:

  1. Cytoband

  2. Q value

  3. Residual Q value

  4. Wide Peak Boundaries

These rows identify the lesion in the same way as the all lesions file.The remaining rows list the genes contained in each wide peak. For peaks that contain no genes, the nearest gene is listed in brackets.

Deletion Genes File (del_genes.conf_##.txt, where ## is the confidence level)

The del genes file contains one column for each deletion peak identified in the GISTIC analysis. The file format for the del genes file is identical to the format for the amp genes file.

Gistic Scores File (scores.gistic)

The scores file lists the Q values [presented as -log10(q)], G scores, average amplitudes among aberrant samples, and frequency of aberration, across the genome for both amplifications and deletions. The scores file is viewable with the Genepattern SNPViewer module and may be imported into the Integrated Genomics Viewer (IGV).

Segmented Copy Number (raw_copy_number.{fig|pdf|png} )

The segmented copy number is a pdf file containing a colormap image of the segmented copy number profiles in the input data.

Amplification Score GISTIC plot (amp_qplot.{fig|pdf|png|v2.pdf})

The amplification pdf is a plot of the G scores (top) and Q values (bottom) with respect to amplifications for all markers over the entire region analyzed.

Deletion Score GISTIC plot (del_qplot.{fig|pdf|png|v2.pdf})

The deletion pdf is a plot of the G scores (top) and Q values (bottom) with respect to deletions for all markers over the entire region analyzed.

Tables (table_{amp|del}.conf_##.txt, where ## is the confidence level)

Tables of basic information about the genomic regions (peaks) that GISTIC determined to be significantly amplified or deleted. These describe three kinds of peak boundaries, and list the genes contained in two of them. The region start and region end columns (along with the chromosome column) delimit the entire area containing the peak that is above the significance level. The region may be the same for multiple peaks. The peak start and end delimit the maximum value of the peak. The extended peak is the peak determined by robust, and is contained within the wide peak reported in {amp|del}_genes.txt by one marker.

Broad Significance Results (broad_significance_results.txt)

A table of per-arm statistical results for the data set. Each arm is a row in the table. The first column specifies the arm and the second column counts the number of genes known to be on the arm. For both amplification and deletion, the table has columns for the frequency of amplification or deletion of the arm, and a Z score and Q value.

Broad Values By Arm (broad_values_by_arm.txt)

A table of chromosome arm amplification levels for each sample. Each row is a chromosome arm, and each column a sample. The data are in units of absolute copy number -2.

All Data By Genes (all_data_by_genes.txt)

A gene-level table of copy number values for all samples. Each row is the data for a gene. The first three columns name the gene, its NIH locus ID, and its cytoband - the remaining columns are the samples. The copy number values in the table are in units of (copy number -2), so that no amplification or deletion is 0, genes with amplifications have positive values, and genes with deletions are negative values. The data are converted from marker level to gene level using the extreme method: a gene is assigned the greatest amplification or the least deletion value among the markers it covers.

Broad Data By Genes (broad_data_by_genes.txt)

A gene-level table of copy number data similar to the all_data_by_genes.txt output, but using only broad events with lengths greater than the broad length cutoff. The structure of the file and the methods and units used for the data analysis are otherwise identical to all_data_by_genes.txt.

Focal Data By Genes (focal_data_by_genes.txt)

A gene-level table of copy number data similar to the all_data_by_genes.txt output, but using only focal events with lengths greater than the focal length cutoff. The structure of the file and the methods and units used for the data analysis are otherwise identical to all_data_by_genes.txt.

All Thresholded By Genes (all_thresholded.by_genes.txt)

A gene-level table of discrete amplification and deletion indicators at for all samples. There is a row for each gene. The first three columns name the gene, its NIH locus ID, and its cytoband - the remaining columns are the samples. A table value of 0 means no amplification or deletion above the threshold. Amplifications are positive numbers: 1 means amplification above the amplification threshold; 2 means amplifications larger to the arm level amplifications observed for the sample. Deletions are represented by negative table values: -1 represents deletion beyond the threshold; -2 means deletions greater than the minimum arm-level deletion observed for the sample.

Sample Cutoffs (sample_cutoffs.txt)

A table of the per-sample threshold cutoffs (in units of absolute copy number -2) used to distinguish the high level amplifications (+/-2) from ordinary amplifications (+/-1) in the all_thresholded.by_genes.txt output file. The table contains three columns: the sample identifier followed by the low (deletion) and high (amplification) cutoff values. The cutoffs are calculated as the minimum arm-level amplification level less the deletion threshold for deletions and the maximum arm-level amplification plus the amplification threshold for amplifications.

Focal Input To Gistic (focal_input.seg.txt)

A list of copy number segments describing just the focal events present in the data. The segment amplification/deletion levels are in units of (copy number -2), with amplifications positive and deletions negative numbers. This file may be viewed with IGV.

Gene Counts vs. Copy Number Alteration Frequency (freqarms_vs_ngenes.{fig|pdf})

An image showing the correlation between gene counts and frequency of copy number alterations.

Confidence Intervals (regions_track.conf_##.bed, where ## is the confidence level)

A file indicating the position of the confidence intervals around GISTIC peaks that can be loaded as a track in a compatible viewer browser such as IGV or the UCSC genome browser.

GISTIC

GISTIC identifies genomic regions that are significantly gained or lost across a set of tumors. It takes segmented copy number ratios as input, separates arm-level events from focal events, and then performs two tests: (i) identifies significantly amplified/deleted chromosome arms; and (ii) identifies regions that are significantly focally amplified or deleted. For the focal analysis, the significance levels (Q values) are calculated by comparing the observed gains/losses at each locus to those obtained by randomly permuting the events along the genome to reflect the null hypothesis that they are all 'passengers' and could have occurred anywhere. The locus-specific significance levels are then corrected for multiple hypothesis testing. The arm-level significance is calculated by comparing the frequency of gains/losses of each arm to the expected rate given its size. The method outputs genomic views of significantly amplified and deleted regions, as well as a table of genes with gain or loss scores. A more in depth discussion of the GISTIC algorithm and its utility is given in [1], [3], and [5].

CNV Description

Regions of the genome that are prone to germ line variations in copy number are excluded from the GISTIC analysis using a list of germ line copy number variations (CNVs). A CNV is a DNA sequence that may be found at different copy numbers in the germ line of two different individuals. Such germ line variations can confound a GISTIC analysis, which finds significant somatic copy number variations in cancer. A more in depth discussion is provided in [6]. GISTIC currently uses two CNV exclusion lists. One is based on the literature describing copy number variation, and a second one comes from an analysis of significant variations among the blood normals in the TCGA data set.

Download Results

In addition to the links below, the full results of the analysis summarized in this report can also be downloaded programmatically using firehose_get, or interactively from either the Broad GDAC website or TCGA Data Coordination Center Portal.

References
[1] Beroukhim et al, Assessing the significance of chromosomal aberrations in cancer: Methodology and application to glioma, Proc Natl Acad Sci U S A. Vol. 104:50 (2007)
[3] Mermel et al, GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers, Genome Biology Vol. 12:4 (2011)
[5] Beroukhim et al., The landscape of somatic copy-number alteration across human cancers, Nature Vol. 463:7283 (2010)
[6] McCarroll, S. A. et al., Integrated detection and population-genetic analysis of SNPs and copy number variation, Nat Genet Vol. 40(10):1166-1174 (2008)