LowPass Copy number analysis (GISTIC2)
Stomach and Esophageal carcinoma (Primary solid tumor)
28 January 2016  |  analyses__2016_01_28
Maintainer Information
Citation Information
Maintained by Spring Yingchun Liu (Broad Institute)
Cite as Broad Institute TCGA Genome Data Analysis Center (2016): LowPass Copy number analysis (GISTIC2). Broad Institute of MIT and Harvard. doi:10.7908/C1SB457C
Overview
Introduction

GISTIC identifies genomic regions that are significantly gained or lost across a set of tumors. The pipeline first filters out normal samples from the segmented copy-number data by inspecting the TCGA barcodes and then executes GISTIC version 2.0.21 (Firehose task version: 127).

Summary

There were 158 tumor samples used in this analysis: 18 significant arm-level results, 18 significant focal amplifications, and 14 significant focal deletions were found.

Results
Focal results

Figure 1.  Genomic positions of amplified regions: the X-axis represents the normalized amplification signals (top) and significance by Q value (bottom). The green line represents the significance cutoff at Q value=0.25.

Table 1.  Get Full Table Amplifications Table - 18 significant amplifications found. Click the link in the last column to view a comprehensive list of candidate genes. If no genes were identified within the peak, the nearest gene appears in brackets.

Cytoband Q value Residual Q value Wide Peak Boundaries # Genes in Wide Peak
11q13.3 8.2836e-29 8.2836e-29 chr11:69380205-70286916 10
8q24.21 2.4045e-10 2.4045e-10 chr8:128086473-128865434 6
19q12 1.4237e-09 1.4237e-09 chr19:30134333-30461073 4
7p11.2 2.2362e-08 2.2362e-08 chr7:55036385-55544134 3
7q21.2 2.245e-06 2.245e-06 chr7:91087401-96704915 44
12q15 9.0283e-06 9.0283e-06 chr12:68032120-70935076 24
8p23.1 3.5087e-06 1.5703e-05 chr8:10975654-12619253 30
12p12.1 1.6753e-05 1.6753e-05 chr12:24832433-25872659 8
20q13.2 0.00051045 0.00051045 chr20:51594458-53180923 8
18q11.2 0.0011216 0.0011216 chr18:19444890-20134873 6
17q21.2 0.0017298 0.0017298 chr17:37156335-40008247 89
6p21.1 0.0042083 0.0042083 chr6:43508397-44052299 9
1q42.3 0.014416 0.014416 chr1:234848273-235264530 2
11p13 0.04582 0.04582 chr11:34942364-35672424 6
10q26.13 0.10255 0.10255 chr10:122709888-123893564 5
13q22.1 0.11748 0.11748 chr13:73591888-74325003 5
15q26.1 0.17568 0.17568 chr15:87285740-102531392 116
8p11.21 0.19643 0.22461 chr8:40751970-42819369 28
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 11q13.3.

Table S1.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
CCND1
MIR548K
FGF3
CTTN
FGF4
PPFIA1
FADD
FGF19
ANO1
ORAOV1
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 8q24.21.

Table S2.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
MYC
POU5F1B
CASC8
CCAT1
PCAT2
PVT1
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 19q12.

Table S3.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
C19orf12
CCNE1
URI1
PLEKHF1
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 7p11.2.

Table S4.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
EGFR
LANCL2
VOPP1
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 7q21.2.

Table S5.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
CDK6
AKAP9
RN7SL252P
SHFM1
MIR591
PON1
snoU13|ENSG00000238384.1
RN7SKP129
GNG11
MIR489
MIR653
RN7SL7P
GATAD1
CALCR
KRIT1
COL1A2
CYP51A1
DLX5
DLX6
DYNC1I1
GNGT1
PDK4
PEX1
PON2
PON3
MTERF
TFPI2
SGCE
SLC25A13
BET1
PEG10
ASB4
ANKIB1
SAMD9
PPP1R9A
CCDC132
CASD1
RBM48
SAMD9L
HEPACAM2
FAM133B
LRRD1
C7orf76
MIR4652
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 12q15.

Table S6.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
MDM2
MYRFL
RN7SL804P
SLC35E3
SNORA70G
DYRK2
CPM
IFNG
LYZ
CNOT2
PTPRB
RAP1B
YEATS4
CCT2
FRS2
CPSF6
KCNMB4
IL22
IL26
MDM1
NUP107
RAB3IP
BEST3
LRRC10
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 8p23.1.

Table S7.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
FAM86B2
FAM66A
RNA5SP254
FAM66D
RNA5SP253
DEFB130|ENSG00000233050.1
DEFB134
C8orf49
LINC00208
RN7SL293P
C8orf12
LINC00529
BLK
CTSB
FDFT1
GATA4
MTMR9
FAM167A
SLC35G5
FAM86B1
LONRF1
TDH
DEFB130|ENSG00000232948.1
NEIL2
XKR6
USP17L2
DEFB135
DEFB136
ZNF705D
MIR5692A2
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 12p12.1.

Table S8.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
KRAS
RN7SL38P
BCAT1
LRMP
CASC1
LYRM5
IFLTD1
C12orf77
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 20q13.2.

Table S9.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
RN7SKP184
CYP24A1
PFDN4
ZNF217
BCAS1
DOK5
TSHZ2
MIR4756
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 18q11.2.

Table S10.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
snoU13|ENSG00000238907.1
RNU6ATAC20P
RNA5SP451
GATA6
MIB1
CTAGE1
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 17q21.2.

Table S11.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
RARA
ERBB2
RN7SL871P
RN7SL399P
RNA5SP442
EIF1
KRT42P
LINC00974
KRT223P
KRT222
KRT222
RNA5SP441
GJD3
RNY4P8
SNORD124
IKZF3
MIR4728
PNMT
TCAP
NEUROD2
ARL5C
CACNB1
CDC6
CCR7
CSF3
GAST
GRB7
IGFBP4
JUP
KRT9
KRT10
KRT12
KRT13
KRT14
KRT15
KRT16
KRT17
KRT19
KRT31
KRT32
KRT33A
KRT33B
KRT34
KRT35
MED1
PSMD3
RPL19
SMARCE1
THRA
TOP2A
KRT38
KRT37
KRT36
HAP1
NR1D1
MED24
LEPREL4
STARD3
CASC3
KRT23
RAPGEFL1
CDK12
KRT20
GSDMB
PLXDC1
FKBP10
PPP1R1B
MIEN1
TNS4
FBXL20
PGAP3
ORMDL3
NT5C3B
ZPBP2
KRT40
WIPF2
KRT25
TMEM99
KRT28
KRT24
GSDMA
KLHL10
MSL1
KRT27
LRRC37A11P
STAC2
KRT26
KRT39
LRRC3C
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 6p21.1.

Table S12.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
RSPH9
SCARNA15|ENSG00000252218.1
POLH
VEGFA
MAD2L1BP
GTPBP2
MRPS18A
XPO5
C6orf223
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 1q42.3.

Table S13.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
RN7SL668P
RNY4P16
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 11p13.

Table S14.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
MIR1343
CD44
SLC1A2
PDHX
FJX1
PAMR1
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 10q26.13.

Table S15.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
FGFR2
RN7SKP167
TACC2
ATE1
NSMCE4A
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 13q22.1.

Table S16.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
LINC00392
LINC00393
RNY1P8
KLF5
KLF12
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 15q26.1.

Table S17.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
BLM
IDH2
NTRK3
CRTC3
DDX11L9
WASH3P
FAM138E
OR4F13P
RN7SL209P
DNM1P47
snoU13|ENSG00000238502.1
LINS
RNA5SP402
RN7SL484P
DKFZP779J2370
MIR4714
RNA5SP401
RN7SL677P
RN7SKP181
RN7SKP254
MIR1469
LINC00924
MIR3175
RN7SL599P
LINC00930
snoU109|ENSG00000239197.1
snoU13|ENSG00000238981.1
RN7SL363P
SNORD18|ENSG00000200677.1
ZNF774
GABARAPL3
RN7SL736P
CIB1
RN7SL346P
RN7SL755P
C15orf38
MIR5009
MIR5094
MESP1
LINC00928
LINC00925
ISG20
ACAN
ALDH1A3
ANPEP
CHD2
FES
IGF1R
MAN2A2
MEF2A
MFGE8
FURIN
PCSK6
PLIN1
POLG
RLBP1
SNRPA1
NR2F2
ST8SIA2
PEX11A
IQGAP1
PRC1
SV2B
AP3S2
SEMA4B
ABHD2
CHSY1
SYNM
VPS33B
MRPL46
OR4F4
SLCO3A1
NGRN
RHCG
DET1
FANCI
MCTP2
VIMP
UNC45A
RGMA
WDR93
AEN
TTC23
MRPS11
LRRK1
TM2D3
TICRR
RCCD1
ARRDC4
LINC00923
TARSL2
LRRC28
AGBL1
ASB7
LYSMD4
PGPEP1L
C15orf32
HAPLN3
MESP2
SPATA8
LINC00052
ADAMTS17
DNM1P46
CERS3
FAM169B
KIF7
ZNF710
HDDC3
SPATA41
GDPGP1
OR4F6
OR4F15
FAM174B
TTLL13
MIR1179
MIR3174
Genes in Wide Peak

This is the comprehensive list of amplified genes in the wide peak for 8p11.21.

Table S18.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
HOOK3
RN7SL806P
snoU13|ENSG00000238714.1
SNORD112|ENSG00000238966.1
RN7SL149P
MIR486
SNORD65|ENSG00000238936.1
MIR548AO
ANK1
CHRNB3
IKBKB
PLAT
POLB
SFRP1
SLC20A2
VDAC3
KAT6A
CHRNA6
AP3M2
DKK4
GOLGA7
THAP1
ZMAT4
RNF170
GINS4
SMIM19
AGPAT6
MIR4469

Figure 2.  Genomic positions of deleted regions: the X-axis represents the normalized deletion signals (top) and significance by Q value (bottom). The green line represents the significance cutoff at Q value=0.25.

Table 2.  Get Full Table Deletions Table - 14 significant deletions found. Click the link in the last column to view a comprehensive list of candidate genes. If no genes were identified within the peak, the nearest gene appears in brackets.

Cytoband Q value Residual Q value Wide Peak Boundaries # Genes in Wide Peak
9p21.3 5.9721e-38 3.3158e-37 chr9:21957042-21983214 2
16q23.1 0.00018861 0.00018861 chr16:78585111-78667886 1
3p14.2 0.0010953 0.0021153 chr3:61026842-61080715 1
7q36.3 0.0021577 0.0021577 chr7:143818945-159138663 124
5q12.1 0.0029815 0.0029815 chr5:58920396-59451270 2
6p25.3 0.0037452 0.0037452 chr6:1867395-2146300 1
4q34.3 0.0043886 0.011749 chr4:179873082-191154276 64
9p23 0.0034992 0.015499 chr9:9432336-9654796 2
2q37.2 0.023567 0.039437 chr2:236493901-236989888 2
18q21.2 0.066933 0.066933 chr18:39530588-78077248 180
3p11.1 0.049649 0.1603 chr3:90309030-93822733 5
4q22.1 0.024853 0.16266 chr4:91690999-91931333 1
2q22.1 0.093934 0.18602 chr2:141496007-142144972 1
10q23.31 0.19046 0.19046 chr10:89484732-89790832 7
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 9p21.3.

Table S19.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
CDKN2A
C9orf53
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 16q23.1.

Table S20.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
WWOX
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 3p14.2.

Table S21.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
FHIT
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 7q36.3.

Table S22.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
EZH2
LINC00689
MIR595
RN7SL142P
SHH
RN7SKP280
HTR5A
RN7SL845P
RN7SL811P
SNORA26|ENSG00000212590.1
snoU13|ENSG00000238557.1
RNA5SP250
FABP5P3
snoU13|ENSG00000239045.1
RN7SL76P
MIR3907
MIR671
IQCA1P1
CDK5
SSPO
SNORD112|ENSG00000252557.1
RN7SL521P
RNY1
RNY3
RNY4
RNY5
RN7SL569P
RN7SL72P
U3|ENSG00000199370.1
RN7SL456P
RNA5SP249
RN7SL207P
RN7SKP174
RNU6ATAC40P
OR2A9P
OR2A20P
AOC1
DPP6
EN2
GBX1
MNX1
INSIG1
KCNH2
NOS3
PTPRN2
RARRES2
RHEB
SLC4A2
SMARCD3
VIPR2
XRCC2
ARHGEF5
ZNF212
ZNF282
CUL1
ASIC3
PDIA4
UBE3C
DNAJB6
ABCF2
FASTK
ABCB8
PAXIP1
CNTNAP2
GIMAP2
TPK1
ZNF777
TMEM176B
REPIN1
PRKAG2
NUB1
CHPF2
NCAPG2
WDR60
GIMAP4
GIMAP5
TMEM176A
ACTR3B
ESYT2
ZNF398
KMT2C
GALNT11
LMBR1
NOM1
LRRC61
ZNF767
TMUB1
KRBA1
ZBED6CL
AGAP3
C7orf13
NOBOX
OR2A14
ZNF786
ASB10
RNF32
GIMAP8
CRYGN
ZNF425
ZNF746
ATP6V0E2
RBM33
GALNTL5
GIMAP7
ZNF467
GIMAP1
C7orf33
CNPY1
ZNF775
ATG9B
BLACE
OR2A1
WDR86
OR2A7
OR2A42
ARHGEF35
GIMAP6
ZNF862
ACTR3C
CTAGE4
CTAGE8
ZNF783
MIR548F4
MIR5707
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 5q12.1.

Table S23.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
MIR582
PDE4D
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 6p25.3.

Table S24.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
GMDS
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 4q34.3.

Table S25.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
DUX4
RNA5SP175
RNA5SP174
HSP90AA4P
CYP4V2
SNORA31|ENSG00000253013.1
TLR3
snoU13|ENSG00000239034.1
PDLIM3
UFSP2
SNORD79
RN7SL28P
snoU13|ENSG00000239116.1
snoU13|ENSG00000238596.1
snoU13|ENSG00000238319.1
U3|ENSG00000252048.1
FAM92A1P2
RN7SKP67
RN7SKP13
LINC00290
SLC25A4
CASP3
DCTD
F11
ACSL1
FAT1
FRG1
ING2
IRF2
KLKB1
MTNR1A
SORBS2
FAM149A
CLDN22
CDKN2AIP
TENM3
LRP2BP
STOX2
KIAA1430
TRAPPC11
MLF1IP
WWC2
SNX25
ZFP42
ENPP6
RWDD4
CCDC111
TRIML2
CCDC110
TRIML1
ANKRD37
HELT
C4orf47
DUX4L4
FRG2
DUX4L7
DUX4L6
DUX4L5
DUX4L3
DUX4L2
CLDN24
MIR1305
MIR3945
MIR4455
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 9p23.

Table S26.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
RN7SL5P
PTPRD
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 2q37.2.

Table S27.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
RN7SL204P
AGAP1
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 18q21.2.

Table S28.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
BCL2
MALT1
RBFA
RBFADN
SALL3
RNA5SP461
SNORA25|ENSG00000199392.1
GALR1
LINC00683
ZNF516
SMIM21
ZADH2
RN7SL551P
RN7SL401P
MIR548AV
RNA5SP460
RN7SL795P
snoU13|ENSG00000238988.1
RN7SL705P
RNF152
RN7SL342P
OACYLP
U8|ENSG00000199713.1
RN7SL112P
SNORD28|ENSG00000252284.1
FECH
U3|ENSG00000212539.1
TXNL1
SNORA73|ENSG00000201816.1
RNA5SP459
SNORA37
DCC
snoU13|ENSG00000238885.1
RN7SL695P
ME2
RNA5SP458
RN7SL310P
RNA5SP457
SCARNA18|ENSG00000252139.1
SCARNA17|ENSG00000251992.1
SCARNA17|ENSG00000267322.1
SNORD58B
SNORD58A
SNORD58C
C18orf32
MIR4744
MIR4743
RNA5SP456
IER3IP1
RNF165
RN7SKP26
RNA5SP455
RNA5SP454
LINC00907
PIK3C3
ATP5A1
CDH7
CYB5A
KDSR
GRP
LMAN1
SMAD2
SMAD4
SMAD7
MBD1
MBP
MC4R
MYO5B
NARS
NFATC1
SERPINB2
ATP8B1
SERPINB5
SERPINB8
SERPINB10
SERPINB13
PMAIP1
MAPK4
RAB27B
RIT2
RPL17
SERPINB3
SERPINB4
SLC14A1
SYT4
TCF4
ZNF236
SLC14A2
SERPINB7
TNFRSF11A
MBD2
PSTPIP2
PIAS2
CTDP1
SOCS6
LIPG
ONECUT2
VPS4B
CTIF
TSHZ1
ACAA2
CD226
TXNL4A
POLI
ADNP2
PHLPP1
NEDD4L
WDR7
PIGN
RTTN
SETBP1
KCNG2
CDH20
CDH19
TIMM21
ST8SIA5
RAX
CXXC1
ST8SIA3
IER3IP1
TCEB3B
MEX3C
TMX3
DYM
ZCCHC2
ZNF532
ELAC1
ZNF407
CNDP2
KIAA1468
EPG5
CCDC102B
PQLC1
CCDC68
NETO1
KATNAL2
MRO
HDHD2
PARD6G
CNDP1
SERPINB12
SERPINB11
SEC11C
DSEL
HAUS1
ALPK2
LOXHD1
FAM69C
STARD6
C18orf25
CCBE1
CBLN2
C18orf54
TCEB3C
FBXO15
ZBTB7C
SKA1
CCDC11
GTSCR1
DOK6
LINC00305
DYNAP
BOD1L2
SIGLEC15
LINC00908
HMSD
CPLX4
ATP9B
LINC00909
MIR122
HSBP1L1
C18orf63
SKOR2
TCEB3CL
MIR1539
MIR4319
TCEB3CL2
MIR4527
MIR4529
MIR5011
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 3p11.1.

Table S29.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
NSUN3
PROS1
ARL13B
DHFRL1
STX19
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 4q22.1.

Table S30.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
CCSER1
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 2q22.1.

Table S31.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
LRP1B
Genes in Wide Peak

This is the comprehensive list of deleted genes in the wide peak for 10q23.31.

Table S32.  Genes in bold are cancer genes as defined by The Sanger Institute: Cancer Gene Census[7].

Genes
PTEN
SNORD74|ENSG00000200891.1
RN7SL78P
CFL1P1
PAPSS2
ATAD1
KLLN
Arm-level results

Table 3.  Get Full Table Arm-level significance table - 18 significant results found. The significance cutoff is at Q value=0.25.

Arm # Genes Amp Frequency Amp Z score Amp Q value Del Frequency Del Z score Del Q value
1p 1300 0.06 -0.457 1 0.09 1.11 0.384
1q 1195 0.19 5.22 6.11e-07 0.09 0.663 0.534
2p 624 0.12 -0.955 1 0.03 -3.89 1
2q 967 0.09 -0.732 1 0.04 -2.46 1
3p 644 0.03 -3.28 1 0.28 4.9 1.91e-05
3q 733 0.21 2.62 0.0193 0.13 0.0229 0.892
4p 289 0.05 -3.88 1 0.22 1.29 0.33
4q 670 0.02 -4.02 1 0.19 1.95 0.143
5p 183 0.23 0.902 0.565 0.17 -0.862 1
5q 905 0.03 -2.85 1 0.20 3.57 0.00177
6p 710 0.06 -2.51 1 0.09 -1.43 1
6q 556 0.05 -3.38 1 0.11 -1.35 1
7p 389 0.37 6.54 3.11e-10 0.04 -3.52 1
7q 783 0.32 7.29 2.09e-12 0.05 -2.5 1
8p 338 0.34 4.69 7.77e-06 0.26 2.34 0.0643
8q 551 0.48 11.1 0 0.13 -0.676 1
9p 301 0.10 -2.36 1 0.30 3.69 0.00148
9q 700 0.12 -0.373 1 0.15 0.485 0.598
10p 253 0.17 -0.47 1 0.18 -0.289 1
10q 738 0.07 -2.18 1 0.16 1.11 0.384
11p 509 0.07 -2.79 1 0.18 0.797 0.472
11q 975 0.09 -0.582 1 0.14 1.64 0.225
12p 339 0.22 1.36 0.313 0.10 -2.18 1
12q 904 0.10 -0.346 1 0.08 -1.3 1
13q 560 0.20 1.53 0.251 0.17 0.527 0.598
14q 938 0.06 -2 1 0.09 -0.542 1
15q 810 0.04 -2.88 1 0.11 -0.385 1
16p 559 0.08 -2.43 1 0.18 0.819 0.472
16q 455 0.06 -3.31 1 0.21 1.38 0.305
17p 415 0.10 -1.93 1 0.23 1.9 0.143
17q 972 0.12 0.563 0.819 0.09 -0.423 1
18p 104 0.17 -0.875 1 0.19 -0.357 1
18q 275 0.09 -2.57 1 0.27 2.72 0.0259
19p 681 0.10 -1.35 1 0.17 0.991 0.429
19q 935 0.14 1.19 0.391 0.09 -0.5 1
20p 234 0.40 6.4 6.39e-10 0.12 -1.78 1
20q 448 0.49 10.9 0 0.04 -3.13 1
21q 258 0.05 -3.78 1 0.32 4.23 0.000232
22q 564 0.09 -2.08 1 0.20 1.57 0.233
Xq 668 0.27 4.4 2.75e-05 0.17 0.845 0.472
Methods & Data
Input
Description
  • Segmentation File: The segmentation file contains the segmented data for all the samples identified by GLAD, CBS, or some other segmentation algorithm. (See GLAD file format in the Genepattern file formats documentation.) It is a six column, tab-delimited file with an optional first line identifying the columns. Positions are in base pair units.The column headers are: (1) Sample (sample name), (2) Chromosome (chromosome number), (3) Start Position (segment start position, in bases), (4) End Position (segment end position, in bases), (5) Num markers (number of markers in segment), (6) Seg.CN (log2() -1 of copy number).

  • Markers File: The markers file identifies the marker names and positions of the markers in the original dataset (before segmentation). It is a three column, tab-delimited file with an optional header. The column headers are: (1) Marker Name, (2) Chromosome, (3) Marker Position (in bases).

  • Reference Genome: The reference genome file contains information about the location of genes and cytobands on a given build of the genome. Reference genome files are created in Matlab and are not viewable with a text editor.

  • CNV Files: There are two options for the cnv file. The first option allows CNVs to be identified by marker name. The second option allows the CNVs to be identified by genomic location. Option #1: A two column, tab-delimited file with an optional header row. The marker names given in this file must match the marker names given in the markers file. The CNV identifiers are for user use and can be arbitrary. The column headers are: (1) Marker Name, (2) CNV Identifier. Option #2: A 6 column, tab-delimited file with an optional header row. The 'CNV Identifier' is for user use and can be arbitrary. 'Narrow Region Start' and 'Narrow Region End' are also not used. The column headers are: (1) CNV Identifier, (2) Chromosome, (3) Narrow Region Start, (4) Narrow Region End, (5) Wide Region Start, (6) Wide Region End

  • Amplification Threshold: Threshold for copy number amplifications. Regions with a log2 ratio above this value are considered amplified.

  • Deletion Threshold: Threshold for copy number deletions. Regions with a log2 ratio below the negative of this value are considered deletions.

  • Cap Values: Minimum and maximum cap values on analyzed data. Regions with a log2 ratio greater than the cap are set to the cap value; regions with a log2 ratio less than -cap value are set to -cap. Values must be positive.

  • Broad Length Cutoff: Threshold used to distinguish broad from focal events, given in units of fraction of chromosome arm.

  • Remove X-Chromosome: Flag indicating whether to remove data from the X-chromosome before analysis. Allowed values= {1,0} (1: Remove X-Chromosome, 0: Do not remove X-Chromosome.

  • Confidence Level: Confidence level used to calculate the region containing a driver.

  • Join Segment Size: Smallest number of markers to allow in segments from the segmented data. Segments that contain fewer than this number of markers are joined to the neighboring segment that is closest in copy number.

  • Arm Level Peel Off: Flag set to enable arm-level peel-off of events during peak definition. The arm-level peel-off enhancement to the arbitrated peel-off method assigns all events in the same chromosome arm of the same sample to a single peak. It is useful when peaks are split by noise or chromothripsis. Allowed values= {1,0} (1: Use arm level peel off, 0: Use normal arbitrated peel-off).

  • Maximum Sample Segments: Maximum number of segments allowed for a sample in the input data. Samples with more segments than this threshold are excluded from the analysis.

  • Gene GISTIC: When enabled (value = 1), this option causes GISTIC to analyze deletions using genes instead of array markers to locate the lesion. In this mode, the copy number assigned to a gene is the lowest copy number among the markers that represent the gene.

Values

List of inputs used for this run of GISTIC2. All files listed should be included in the archived results.

  • Segmentation File = /xchip/cga/gdac-prod/tcga-gdac/jobResults/PrepareGisticDNASeq/STES-TP/22507538/segmentationfile.txt

  • Markers File = /xchip/cga/gdac-prod/tcga-gdac/jobResults/PrepareGisticDNASeq/STES-TP/22507538/markersfile.txt

  • Reference Genome = /xchip/cga/reference/gistic2/hg19_GENCODE_v18_20140127.mat

  • CNV Files = /xchip/gistic/CNV/SNP6.merged.151117.hg19.CNV.txt

  • Amplification Threshold = 0.3

  • Deletion Threshold = 0.3

  • Cap Values = 2

  • Broad Length Cutoff = 0.5

  • Remove X-Chromosome = 0

  • Confidence Level = 0.99

  • Join Segment Size = 10

  • Arm Level Peel Off = 1

  • Maximum Sample Segments = 10000

  • Gene GISTIC = 0

Table 4.  Get Full Table First 10 out of 158 Input Tumor Samples.

Tumor Sample Names
TCGA-B7-5816-01A-21D-1598-02
TCGA-B7-5818-01A-11D-1598-02
TCGA-BR-4183-01A-02D-1128-02
TCGA-BR-4184-01A-01D-1128-02
TCGA-BR-4187-01A-01D-1128-02
TCGA-BR-4188-01A-01D-1128-02
TCGA-BR-4191-01A-02D-1128-02
TCGA-BR-4201-01A-01D-1128-02
TCGA-BR-4253-01A-01D-1128-02
TCGA-BR-4255-01A-01D-1128-02

Figure 3.  Segmented copy number profiles in the input data

Output
All Lesions File (all_lesions.conf_##.txt, where ## is the confidence level)

The all lesions file summarizes the results from the GISTIC run. It contains data about the significant regions of amplification and deletion as well as which samples are amplified or deleted in each of these regions. The identified regions are listed down the first column, and the samples are listed across the first row, starting in column 10.

Region Data

Columns 1-9 present the data about the significant regions as follows:

  1. Unique Name: A name assigned to identify the region.

  2. Descriptor: The genomic descriptor of that region.

  3. Wide Peak Limits: The 'wide peak' boundaries most likely to contain the targeted genes. These are listed in genomic coordinates and marker (or probe) indices.

  4. Peak Limits: The boundaries of the region of maximal amplification or deletion.

  5. Region Limits: The boundaries of the entire significant region of amplification or deletion.

  6. Q values: The Q value of the peak region.

  7. Residual Q values: The Q value of the peak region after removing ('peeling off') amplifications or deletions that overlap other, more significant peak regions in the same chromosome.

  8. Broad or Focal: Identifies whether the region reaches significance due primarily to broad events (called 'broad'), focal events (called 'focal'), or independently significant broad and focal events (called 'both').

  9. Amplitude Threshold: Key giving the meaning of values in the subsequent columns associated with each sample.

Sample Data

Each of the analyzed samples is represented in one of the columns following the lesion data (columns 10 through end). The data contained in these columns varies slightly by section of the file. The first section can be identified by the key given in column 9 - it starts in row 2 and continues until the row that reads 'Actual Copy Change Given.' This section contains summarized data for each sample. A '0' indicates that the copy number of the sample was not amplified or deleted beyond the threshold amount in that peak region. A '1' indicates that the sample had low-level copy number aberrations (exceeding the low threshold indicated in column 9), and a '2' indicates that the sample had high-level copy number aberrations (exceeding the high threshold indicated in column 9).The second section can be identified the rows in which column 9 reads 'Actual Copy Change Given.' The second section exactly reproduces the first section, except that here the actual changes in copy number are provided rather than zeroes, ones, and twos.The final section is similar to the first section, except that here only broad events are included. A 1 in the samples columns (columns 10+) indicates that the median copy number of the sample across the entire significant region exceeded the threshold given in column 9. That is, it indicates whether the sample had a geographically extended event, rather than a focal amplification or deletion covering little more than the peak region.

Amplification Genes File (amp_genes.conf_##.txt, where ## is the confidence level)

The amp genes file contains one column for each amplification peak identified in the GISTIC analysis. The first four rows are:

  1. Cytoband

  2. Q value

  3. Residual Q value

  4. Wide Peak Boundaries

These rows identify the lesion in the same way as the all lesions file.The remaining rows list the genes contained in each wide peak. For peaks that contain no genes, the nearest gene is listed in brackets.

Deletion Genes File (del_genes.conf_##.txt, where ## is the confidence level)

The del genes file contains one column for each deletion peak identified in the GISTIC analysis. The file format for the del genes file is identical to the format for the amp genes file.

Gistic Scores File (scores.gistic)

The scores file lists the Q values [presented as -log10(q)], G scores, average amplitudes among aberrant samples, and frequency of aberration, across the genome for both amplifications and deletions. The scores file is viewable with the Genepattern SNPViewer module and may be imported into the Integrated Genomics Viewer (IGV).

Segmented Copy Number (raw_copy_number.{fig|pdf|png} )

The segmented copy number is a pdf file containing a colormap image of the segmented copy number profiles in the input data.

Amplification Score GISTIC plot (amp_qplot.{fig|pdf|png|v2.pdf})

The amplification pdf is a plot of the G scores (top) and Q values (bottom) with respect to amplifications for all markers over the entire region analyzed.

Deletion Score GISTIC plot (del_qplot.{fig|pdf|png|v2.pdf})

The deletion pdf is a plot of the G scores (top) and Q values (bottom) with respect to deletions for all markers over the entire region analyzed.

Tables (table_{amp|del}.conf_##.txt, where ## is the confidence level)

Tables of basic information about the genomic regions (peaks) that GISTIC determined to be significantly amplified or deleted. These describe three kinds of peak boundaries, and list the genes contained in two of them. The region start and region end columns (along with the chromosome column) delimit the entire area containing the peak that is above the significance level. The region may be the same for multiple peaks. The peak start and end delimit the maximum value of the peak. The extended peak is the peak determined by robust, and is contained within the wide peak reported in {amp|del}_genes.txt by one marker.

Broad Significance Results (broad_significance_results.txt)

A table of per-arm statistical results for the data set. Each arm is a row in the table. The first column specifies the arm and the second column counts the number of genes known to be on the arm. For both amplification and deletion, the table has columns for the frequency of amplification or deletion of the arm, and a Z score and Q value.

Broad Values By Arm (broad_values_by_arm.txt)

A table of chromosome arm amplification levels for each sample. Each row is a chromosome arm, and each column a sample. The data are in units of absolute copy number -2.

All Data By Genes (all_data_by_genes.txt)

A gene-level table of copy number values for all samples. Each row is the data for a gene. The first three columns name the gene, its NIH locus ID, and its cytoband - the remaining columns are the samples. The copy number values in the table are in units of (copy number -2), so that no amplification or deletion is 0, genes with amplifications have positive values, and genes with deletions are negative values. The data are converted from marker level to gene level using the extreme method: a gene is assigned the greatest amplification or the least deletion value among the markers it covers.

Broad Data By Genes (broad_data_by_genes.txt)

A gene-level table of copy number data similar to the all_data_by_genes.txt output, but using only broad events with lengths greater than the broad length cutoff. The structure of the file and the methods and units used for the data analysis are otherwise identical to all_data_by_genes.txt.

Focal Data By Genes (focal_data_by_genes.txt)

A gene-level table of copy number data similar to the all_data_by_genes.txt output, but using only focal events with lengths greater than the focal length cutoff. The structure of the file and the methods and units used for the data analysis are otherwise identical to all_data_by_genes.txt.

All Thresholded By Genes (all_thresholded.by_genes.txt)

A gene-level table of discrete amplification and deletion indicators at for all samples. There is a row for each gene. The first three columns name the gene, its NIH locus ID, and its cytoband - the remaining columns are the samples. A table value of 0 means no amplification or deletion above the threshold. Amplifications are positive numbers: 1 means amplification above the amplification threshold; 2 means amplifications larger to the arm level amplifications observed for the sample. Deletions are represented by negative table values: -1 represents deletion beyond the threshold; -2 means deletions greater than the minimum arm-level deletion observed for the sample.

Sample Cutoffs (sample_cutoffs.txt)

A table of the per-sample threshold cutoffs (in units of absolute copy number -2) used to distinguish the high level amplifications (+/-2) from ordinary amplifications (+/-1) in the all_thresholded.by_genes.txt output file. The table contains three columns: the sample identifier followed by the low (deletion) and high (amplification) cutoff values. The cutoffs are calculated as the minimum arm-level amplification level less the deletion threshold for deletions and the maximum arm-level amplification plus the amplification threshold for amplifications.

Focal Input To Gistic (focal_input.seg.txt)

A list of copy number segments describing just the focal events present in the data. The segment amplification/deletion levels are in units of (copy number -2), with amplifications positive and deletions negative numbers. This file may be viewed with IGV.

Gene Counts vs. Copy Number Alteration Frequency (freqarms_vs_ngenes.{fig|pdf})

An image showing the correlation between gene counts and frequency of copy number alterations.

Confidence Intervals (regions_track.conf_##.bed, where ## is the confidence level)

A file indicating the position of the confidence intervals around GISTIC peaks that can be loaded as a track in a compatible viewer browser such as IGV or the UCSC genome browser.

GISTIC

GISTIC identifies genomic regions that are significantly gained or lost across a set of tumors. It takes segmented copy number ratios as input, separates arm-level events from focal events, and then performs two tests: (i) identifies significantly amplified/deleted chromosome arms; and (ii) identifies regions that are significantly focally amplified or deleted. For the focal analysis, the significance levels (Q values) are calculated by comparing the observed gains/losses at each locus to those obtained by randomly permuting the events along the genome to reflect the null hypothesis that they are all 'passengers' and could have occurred anywhere. The locus-specific significance levels are then corrected for multiple hypothesis testing. The arm-level significance is calculated by comparing the frequency of gains/losses of each arm to the expected rate given its size. The method outputs genomic views of significantly amplified and deleted regions, as well as a table of genes with gain or loss scores. A more in depth discussion of the GISTIC algorithm and its utility is given in [1], [3], and [5].

CNV Description

Regions of the genome that are prone to germ line variations in copy number are excluded from the GISTIC analysis using a list of germ line copy number variations (CNVs). A CNV is a DNA sequence that may be found at different copy numbers in the germ line of two different individuals. Such germ line variations can confound a GISTIC analysis, which finds significant somatic copy number variations in cancer. A more in depth discussion is provided in [6]. GISTIC currently uses two CNV exclusion lists. One is based on the literature describing copy number variation, and a second one comes from an analysis of significant variations among the blood normals in the TCGA data set.

Download Results

In addition to the links below, the full results of the analysis summarized in this report can also be downloaded programmatically using firehose_get, or interactively from either the Broad GDAC website or TCGA Data Coordination Center Portal.

References
[1] Beroukhim et al, Assessing the significance of chromosomal aberrations in cancer: Methodology and application to glioma, Proc Natl Acad Sci U S A. Vol. 104:50 (2007)
[3] Mermel et al, GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers, Genome Biology Vol. 12:4 (2011)
[5] Beroukhim et al., The landscape of somatic copy-number alteration across human cancers, Nature Vol. 463:7283 (2010)
[6] McCarroll, S. A. et al., Integrated detection and population-genetic analysis of SNPs and copy number variation, Nat Genet Vol. 40(10):1166-1174 (2008)