Thyroid Adenocarcinoma: Copy number analysis (GISTIC2)
Maintained by Dan DiCara (Broad Institute)
Overview
Introduction

GISTIC identifies genomic regions that are significantly gained or lost across a set of tumors. The pipeline first filters out normal samples from the segmented copy-number data by inspecting the TCGA barcodes and then executes GISTIC version 2.0.16 (cga svn revision 38839).

Summary

There were 228 tumor samples used in this analysis: 14 significant arm-level results, 0 significant focal amplifications, and 0 significant focal deletions were found.

Results
Focal results

Figure 1.  Genomic positions of amplified regions: the X-axis represents the normalized amplification signals (top) and significance by Q value (bottom). The green line represents the significance cutoff at Q value=0.25.

Figure 2.  Genomic positions of deleted regions: the X-axis represents the normalized deletion signals (top) and significance by Q value (bottom). The green line represents the significance cutoff at Q value=0.25.

Arm-level results

Table 1.  Get Full Table Arm-level significance table - 14 significant results found.

Arm # Genes Amp Frequency Amp Z score Amp Q value Del Frequency Del Z score Del Q value
1p 2121 0.00 -1.74 0.959 0.00 -1.16 0.96
1q 1955 0.04 2.92 0.00801 0.00 -1.12 0.96
2p 924 0.00 -1.15 0.928 0.03 1.72 0.42
2q 1556 0.00 -1.14 0.928 0.02 1.16 0.799
3p 1062 0.00 -1.17 0.928 0.00 -1.17 0.96
3q 1139 0.00 -1.17 0.928 0.01 -0.592 0.96
4p 489 0.01 -0.0303 0.928 0.00 -1.17 0.96
4q 1049 0.01 -0.0152 0.928 0.00 -1.16 0.96
5p 270 0.04 3.4 0.00188 0.00 -1.73 0.96
5q 1427 0.04 3.46 0.00188 0.00 -1.72 0.96
6p 1173 0.01 -0.598 0.928 0.00 -1.75 0.96
6q 839 0.00 -1.17 0.928 0.01 -0.599 0.96
7p 641 0.04 3.42 0.00188 0.00 -1.73 0.96
7q 1277 0.05 4.61 7.82e-05 0.00 -1.71 0.96
8p 580 0.00 -1.18 0.928 0.01 -0.605 0.96
8q 859 0.00 -1.17 0.928 0.01 -0.598 0.96
9p 422 0.00 -1.17 0.928 0.02 0.544 0.96
9q 1113 0.01 -0.56 0.928 0.03 1.74 0.42
10p 409 0.01 -0.602 0.928 0.01 -0.602 0.96
10q 1268 0.01 -0.576 0.928 0.01 -0.00159 0.96
11p 862 0.01 0.00291 0.928 0.02 0.576 0.96
11q 1515 0.01 -0.558 0.928 0.02 1.17 0.799
12p 575 0.04 3.42 0.00188 0.00 -1.73 0.96
12q 1447 0.04 3.46 0.00188 0.00 -1.71 0.96
13q 654 0.01 -0.551 0.928 0.04 3.45 0.00544
14q 1341 0.02 1.14 0.365 0.00 -1.73 0.96
15q 1355 0.00 -1.74 0.959 0.02 0.564 0.96
16p 872 0.04 2.87 0.00801 0.00 -1.14 0.96
16q 702 0.03 1.73 0.148 0.01 0.0142 0.96
17p 683 0.04 2.9 0.00801 0.02 0.616 0.96
17q 1592 0.04 3.49 0.00188 0.00 -1.12 0.96
18p 143 0.00 -1.18 0.928 0.01 -0.0396 0.96
18q 446 0.00 -1.17 0.928 0.01 -0.0315 0.96
19p 995 0.01 -0.582 0.928 0.01 -0.00899 0.96
19q 1709 0.01 -0.00492 0.928 0.00 -1.73 0.96
20p 355 0.03 1.68 0.151 0.00 -1.74 0.96
20q 753 0.02 1.12 0.365 0.00 -1.74 0.96
21q 509 0.00 -1.17 0.928 0.02 0.547 0.96
22q 921 0.01 -1 0.928 0.15 17.3 0
Methods & Data
Input

List of inputs used for this run of GISTIC2. All files listed should be included in the archived results.

  • Segmentation File = /xchip/cga/gdac-prod/tcga-gdac/jobResults/GDAC_MergeDataFilesPipeline/THCA/1270178/2.GDAC_MergeDataFiles.Finished/THCA.snp__genome_wide_snp_6__broad_mit_edu__Level_3__segmented_scna_minus_germline_cnv_hg19__seg.seg.txt

  • Markers File = /xchip/tcga/CancerGenomeAnalysisData/trunk/copynumber/SNP6_annotations/genome.info.6.0_hg19.na31_minus_frequent_nan_probes_sorted_2.1.txt

  • Reference Genome = /xchip/gistic/variables/hg19/hg19_with_miR_20120227.mat

  • CNV Files = /xchip/gistic/CNV/blood_normals/CNV.hg19_111204/CNV.hg19.bypos.111213.txt

  • Amplification Threshold = 0.30

  • Deletion Threshold = 0.30

  • Cap Values = 2.0

  • Broad Length Cutoff = 0.5

  • Remove X-Chromosome = 1

  • Confidence Level = 0.99

  • Join Segment Size = 10

  • Arm Level Peel Off = 1

  • Maximum Sample Segments = 10000

Table 2.  Get Full Table First 10 out of 228 Input Tumor Samples.

Tumor Sample Names
TCGA-BJ-A0YZ-01A-11D-A10T-01
TCGA-BJ-A0Z0-01A-11D-A10T-01
TCGA-BJ-A0Z2-01A-11D-A10T-01
TCGA-BJ-A0Z3-01A-11D-A13V-01
TCGA-BJ-A0Z5-01A-11D-A10T-01
TCGA-BJ-A0Z9-01A-11D-A10T-01
TCGA-BJ-A0ZA-01A-11D-A10T-01
TCGA-BJ-A0ZB-01A-11D-A10T-01
TCGA-BJ-A0ZC-01A-12D-A13V-01
TCGA-BJ-A0ZE-01A-11D-A10T-01

Figure 3.  Segmented copy number profiles in the input data

GISTIC

GISTIC identifies genomic regions that are significantly gained or lost across a set of tumors. It takes segmented copy number ratios as input, separates arm-level events from focal events, and then performs two tests: (i) identifies significantly amplified/deleted chromosome arms; and (ii) identifies regions that are significantly focally amplified or deleted. For the focal analysis, the significance levels (Q values) are calculated by comparing the observed gains/losses at each locus to those obtained by randomly permuting the events along the genome to reflect the null hypothesis that they are all 'passengers' and could have occurred anywhere. The locus-specific significance levels are then corrected for multiple hypothesis testing. The arm-level significance is calculated by comparing the frequency of gains/losses of each arm to the expected rate given its size. The method outputs genomic views of significantly amplified and deleted regions, as well as a table of genes with gain or loss scores. A more in depth discussion of the GISTIC algorithm and its utility is given in [1], [3], and [5].

CNV Description

Regions of the genome that are prone to germ line variations in copy number are excluded from the GISTIC analysis using a list of germ line copy number variations (CNVs). A CNV is a DNA sequence that may be found at different copy numbers in the germ line of two different individuals. Such germ line variations can confound a GISTIC analysis, which finds significant somatic copy number variations in cancer. A more in depth discussion is provided in [6]. GISTIC currently uses two CNV exclusion lists. One is based on the literature describing copy number variation, and a second one comes from an analysis of significant variations among the blood normals in the TCGA data set.

Download Results

This is an experimental feature. The full results of the analysis summarized in this report can be downloaded from the TCGA Data Coordination Center.

References
[1] Beroukhim et al, Assessing the significance of chromosomal aberrations in cancer: Methodology and application to glioma, Proc Natl Acad Sci U S A. Vol. 104:50 (2007)
[3] Mermel et al, GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers, Genome Biology Vol. 12:4 (2011)
[5] Beroukhim et al., The landscape of somatic copy-number alteration across human cancers, Nature Vol. 463:7283 (2010)
[6] McCarroll, S. A. et al., Integrated detection and population-genetic analysis of SNPs and copy number variation, Nat Genet Vol. 40(10):1166-1174 (2008)
Meta
  • Maintainer = Dan DiCara