Mutation Analysis (MutSig v1.5)
Head and Neck Squamous Cell Carcinoma (Primary solid tumor)
02 April 2015  |  analyses__2015_04_02
Maintainer Information
Citation Information
Maintained by David Heiman (Broad Institute)
Cite as Broad Institute TCGA Genome Data Analysis Center (2015): Mutation Analysis (MutSig v1.5). Broad Institute of MIT and Harvard. doi:10.7908/C16D5S0G
Overview
Introduction

This report serves to describe the mutational landscape and properties of a given individual set, as well as rank genes and genesets according to mutational significance. MutSig v1.5 was used to generate the results found in this report.

  • Working with individual set: HNSC-TP

  • Number of patients in set: 510

Input

The input for this pipeline is a set of individuals with the following files associated for each:

  1. An annotated .maf file describing the mutations called for the respective individual, and their properties.

  2. A .wig file that contains information about the coverage of the sample.

Summary
  • MAF used for this analysis:HNSC-TP.final_analysis_set.maf

  • Blacklist used for this analysis: pancan_mutation_blacklist.v14.hg19.txt

  • Significantly mutated genes (q ≤ 0.1): 86

  • Mutations seen in COSMIC: 861

  • Significantly mutated genes in COSMIC territory: 16

  • Significantly mutated genesets: 59

  • Significantly mutated genesets: (excluding sig. mutated genes):0

Mutation Preprocessing
  • Read 510 MAFs of type "Broad"

  • Total number of mutations in input MAFs: 120565

  • After removing 8 mutations outside chr1-24: 120557

  • After removing 3459 blacklisted mutations: 117098

  • After removing 4218 noncoding mutations: 112880

  • After collapsing adjacent/redundant mutations: 102746

Mutation Filtering
  • Number of mutations before filtering: 102746

  • After removing 5118 mutations outside gene set: 97628

  • After removing 158 mutations outside category set: 97470

Results
Breakdown of Mutations by Type

Table 1.  Get Full Table Table representing breakdown of mutations by type.

type count
Frame_Shift_Del 2080
Frame_Shift_Ins 850
In_Frame_Del 446
In_Frame_Ins 54
Missense_Mutation 61396
Nonsense_Mutation 4725
Nonstop_Mutation 106
Silent 24432
Splice_Site 3027
Translation_Start_Site 354
Total 97470
Breakdown of Mutation Rates by Category Type

Table 2.  Get Full Table A breakdown of mutation rates per category discovered for this individual set.

category n N rate rate_per_mb relative_rate exp_ns_s_ratio
*CpG->T 10862 837007747 0.000013 13 2.7 2.1
*Cp(A/C/T)->T 16072 6830387296 2.4e-06 2.4 0.48 1.7
C->(G/A) 22812 7667395043 3e-06 3 0.61 4.8
A->mut 11989 7359712076 1.6e-06 1.6 0.34 3.9
indel+null 11154 15027107119 7.4e-07 0.74 0.15 NaN
double_null 149 15027107119 9.9e-09 0.0099 0.002 NaN
Total 73038 15027107119 4.9e-06 4.9 1 3.5
Target Coverage for Each Individual

The x axis represents the samples. The y axis represents the exons, one row per exon, and they are sorted by average coverage across samples. For exons with exactly the same average coverage, they are sorted next by the %GC of the exon. (The secondary sort is especially useful for the zero-coverage exons at the bottom). If the figure is unpopulated, then full coverage is assumed (e.g. MutSig CV doesn't use WIGs and assumes full coverage).

Figure 1. 

Distribution of Mutation Counts, Coverage, and Mutation Rates Across Samples

Figure 2.  Patients counts and rates file used to generate this plot: HNSC-TP.patients.counts_and_rates.txt

Lego Plots

The mutation spectrum is depicted in the lego plots below in which the 96 possible mutation types are subdivided into six large blocks, color-coded to reflect the base substitution type. Each large block is further subdivided into the 16 possible pairs of 5' and 3' neighbors, as listed in the 4x4 trinucleotide context legend. The height of each block corresponds to the mutation frequency for that kind of mutation (counts of mutations normalized by the base coverage in a given bin). The shape of the spectrum is a signature for dominant mutational mechanisms in different tumor types.

Figure 3.  Get High-res Image SNV Mutation rate lego plot for entire set. Each bin is normalized by base coverage for that bin. Colors represent the six SNV types on the upper right. The three-base context for each mutation is labeled in the 4x4 legend on the lower right. The fractional breakdown of SNV counts is shown in the pie chart on the upper left. If this figure is blank, not enough information was provided in the MAF to generate it.

Figure 4.  Get High-res Image SNV Mutation rate lego plots for 4 slices of mutation allele fraction (0<=AF<0.1, 0.1<=AF<0.25, 0.25<=AF<0.5, & 0.5<=AF) . The color code and three-base context legends are the same as the previous figure. If this figure is blank, not enough information was provided in the MAF to generate it.

CoMut Plot

Figure 5.  Get High-res Image The matrix in the center of the figure represents individual mutations in patient samples, color-coded by type of mutation, for the significantly mutated genes. The rate of synonymous and non-synonymous mutations is displayed at the top of the matrix. The barplot on the left of the matrix shows the number of mutations in each gene. The percentages represent the fraction of tumors with at least one mutation in the specified gene. The barplot to the right of the matrix displays the q-values for the most significantly mutated genes. The purple boxplots below the matrix (only displayed if required columns are present in the provided MAF) represent the distributions of allelic fractions observed in each sample. The plot at the bottom represents the base substitution distribution of individual samples, using the same categories that were used to calculate significance.

Significantly Mutated Genes

Column Descriptions:

  • N = number of sequenced bases in this gene across the individual set

  • n = number of (nonsilent) mutations in this gene across the individual set

  • npat = number of patients (individuals) with at least one nonsilent mutation

  • nsite = number of unique sites having a non-silent mutation

  • nsil = number of silent mutations in this gene across the individual set

  • n1 = number of nonsilent mutations of type: *CpG->T

  • n2 = number of nonsilent mutations of type: *Cp(A/C/T)->T

  • n3 = number of nonsilent mutations of type: C->(G/A)

  • n4 = number of nonsilent mutations of type: A->mut

  • n5 = number of nonsilent mutations of type: indel+null

  • n6 = number of nonsilent mutations of type: double_null

  • p_ns_s = p-value for the observed nonsilent/silent ratio being elevated in this gene

  • p = p-value (overall)

  • q = q-value, False Discovery Rate (Benjamini-Hochberg procedure)

Table 3.  Get Full Table A Ranked List of Significantly Mutated Genes. Number of significant genes found: 86. Number of genes displayed: 35. Click on a gene name to display its stick figure depicting the distribution of mutations and mutation types across the chosen gene (this feature may not be available for all significant genes).

rank gene description N n npat nsite nsil n1 n2 n3 n4 n5 n6 p_ns_s p q
1 TP53 tumor protein p53 627695 426 359 209 5 72 45 61 67 171 10 <1.00e-15 <1.00e-15 <6.04e-12
2 CDKN2A cyclin-dependent kinase inhibitor 2A (melanoma, p16, inhibits CDK4) 432500 116 112 55 0 4 6 8 7 89 2 9.28e-13 <1.00e-15 <6.04e-12
3 PIK3CA phosphoinositide-3-kinase, catalytic, alpha polypeptide 1675015 96 94 35 1 2 55 6 30 3 0 1.79e-11 <1.00e-15 <6.04e-12
4 HRAS v-Ha-ras Harvey rat sarcoma viral oncogene homolog 331344 32 29 9 0 7 3 18 3 1 0 3.95e-05 2.11e-15 9.55e-12
5 CASP8 caspase 8, apoptosis-related cysteine peptidase 890224 61 55 49 1 4 8 8 9 30 2 1.55e-06 3.77e-15 1.37e-11
6 NSD1 nuclear receptor binding SET domain protein 1 4162026 73 63 72 2 2 9 13 5 38 6 0.000300 5.33e-15 1.61e-11
7 FAT1 FAT tumor suppressor homolog 1 (Drosophila) 6963586 125 114 120 11 3 7 11 11 77 16 5.83e-05 6.66e-15 1.72e-11
8 NOTCH1 Notch homolog 1, translocation-associated (Drosophila) 3182901 95 87 92 9 14 15 14 9 42 1 1.19e-06 8.99e-15 2.04e-11
9 HLA-B major histocompatibility complex, class I, B 502645 26 24 21 1 0 1 8 2 15 0 0.00659 1.05e-14 2.12e-11
10 NFE2L2 nuclear factor (erythroid-derived 2)-like 2 911867 27 26 18 2 0 6 13 7 1 0 0.0701 6.77e-13 1.23e-09
11 TGFBR2 transforming growth factor, beta receptor II (70/80kDa) 878843 25 23 16 2 4 7 0 5 9 0 0.00403 2.38e-11 3.75e-08
12 OR2M5 olfactory receptor, family 2, subfamily M, member 5 480420 21 18 20 0 4 3 10 2 2 0 0.00297 2.48e-11 3.75e-08
13 RAC1 ras-related C3 botulinum toxin substrate 1 (rho family, small GTP binding protein Rac1) 317246 16 15 9 0 6 4 2 3 1 0 0.00386 2.72e-11 3.79e-08
14 HLA-A major histocompatibility complex, class I, A 564314 24 22 21 4 1 0 2 6 15 0 0.224 6.75e-10 8.73e-07
15 FBXW7 F-box and WD repeat domain containing 7 1269883 34 33 25 4 4 2 11 5 11 1 0.272 8.26e-10 9.97e-07
16 POM121L12 POM121 transmembrane nucleoporin-like 12 443503 24 22 24 4 6 2 9 4 3 0 0.0642 1.68e-08 1.90e-05
17 PTEN phosphatase and tensin homolog (mutated in multiple advanced cancers 1) 618847 14 14 14 0 1 1 2 2 8 0 0.0580 3.47e-08 3.69e-05
18 EP300 E1A binding protein p300 3751833 39 39 32 1 5 10 7 7 10 0 0.000293 4.01e-08 4.04e-05
19 B2M beta-2-microglobulin 189710 10 9 9 0 0 2 2 0 6 0 0.242 4.57e-08 4.35e-05
20 AGTR1 angiotensin II receptor, type 1 552840 14 14 14 1 2 5 2 5 0 0 0.0568 5.42e-08 4.91e-05
21 ZNF750 zinc finger protein 750 1111690 24 21 22 1 1 2 4 3 14 0 0.0388 1.11e-07 9.54e-05
22 EPHA2 EPH receptor A2 1449501 27 24 24 2 6 1 2 0 16 2 0.0231 3.32e-07 0.000273
23 CD1E CD1e molecule 605964 14 14 14 1 4 5 4 0 1 0 0.0880 4.96e-07 0.000390
24 RHOA ras homolog gene family, member A 304980 10 10 7 1 0 1 6 2 1 0 0.296 7.15e-07 0.000540
25 PSG8 pregnancy specific beta-1-glycoprotein 8 674216 17 17 16 2 1 5 8 2 1 0 0.103 7.45e-07 0.000540
26 C6 complement component 6 1464933 25 23 25 2 1 5 6 4 9 0 0.0821 3.14e-06 0.00218
27 CFHR4 complement factor H-related 4 515529 14 14 13 2 0 6 7 1 0 0 0.450 3.33e-06 0.00223
28 NPFFR2 neuropeptide FF receptor 2 807067 17 16 17 0 1 3 4 6 3 0 0.0151 3.46e-06 0.00224
29 OR5D13 olfactory receptor, family 5, subfamily D, member 13 482774 18 16 17 3 1 3 7 6 1 0 0.231 4.01e-06 0.00250
30 GRXCR1 glutaredoxin, cysteine rich 1 453156 11 11 11 1 3 0 5 2 1 0 0.271 4.38e-06 0.00264
31 SPATA16 spermatogenesis associated 16 892039 16 15 16 1 1 2 7 4 1 1 0.242 5.05e-06 0.00294
32 KEAP1 kelch-like ECH-associated protein 1 905929 24 22 23 3 8 3 7 4 2 0 0.0237 5.19e-06 0.00294
33 SMAD4 SMAD family member 4 867452 15 14 15 1 5 1 4 0 5 0 0.112 5.56e-06 0.00305
34 OR5L1 olfactory receptor, family 5, subfamily L, member 1 479340 12 11 12 2 2 2 4 4 0 0 0.172 7.10e-06 0.00369
35 LCP1 lymphocyte cytosolic protein 1 (L-plastin) 991044 16 15 16 0 3 2 5 4 2 0 0.0137 7.13e-06 0.00369
TP53

Figure S1.  This figure depicts the distribution of mutations and mutation types across the TP53 significant gene.

CDKN2A

Figure S2.  This figure depicts the distribution of mutations and mutation types across the CDKN2A significant gene.

PIK3CA

Figure S3.  This figure depicts the distribution of mutations and mutation types across the PIK3CA significant gene.

HRAS

Figure S4.  This figure depicts the distribution of mutations and mutation types across the HRAS significant gene.

NSD1

Figure S5.  This figure depicts the distribution of mutations and mutation types across the NSD1 significant gene.

FAT1

Figure S6.  This figure depicts the distribution of mutations and mutation types across the FAT1 significant gene.

NOTCH1

Figure S7.  This figure depicts the distribution of mutations and mutation types across the NOTCH1 significant gene.

NFE2L2

Figure S8.  This figure depicts the distribution of mutations and mutation types across the NFE2L2 significant gene.

TGFBR2

Figure S9.  This figure depicts the distribution of mutations and mutation types across the TGFBR2 significant gene.

OR2M5

Figure S10.  This figure depicts the distribution of mutations and mutation types across the OR2M5 significant gene.

RAC1

Figure S11.  This figure depicts the distribution of mutations and mutation types across the RAC1 significant gene.

HLA-A

Figure S12.  This figure depicts the distribution of mutations and mutation types across the HLA-A significant gene.

FBXW7

Figure S13.  This figure depicts the distribution of mutations and mutation types across the FBXW7 significant gene.

POM121L12

Figure S14.  This figure depicts the distribution of mutations and mutation types across the POM121L12 significant gene.

PTEN

Figure S15.  This figure depicts the distribution of mutations and mutation types across the PTEN significant gene.

EP300

Figure S16.  This figure depicts the distribution of mutations and mutation types across the EP300 significant gene.

B2M

Figure S17.  This figure depicts the distribution of mutations and mutation types across the B2M significant gene.

AGTR1

Figure S18.  This figure depicts the distribution of mutations and mutation types across the AGTR1 significant gene.

ZNF750

Figure S19.  This figure depicts the distribution of mutations and mutation types across the ZNF750 significant gene.

EPHA2

Figure S20.  This figure depicts the distribution of mutations and mutation types across the EPHA2 significant gene.

CD1E

Figure S21.  This figure depicts the distribution of mutations and mutation types across the CD1E significant gene.

RHOA

Figure S22.  This figure depicts the distribution of mutations and mutation types across the RHOA significant gene.

PSG8

Figure S23.  This figure depicts the distribution of mutations and mutation types across the PSG8 significant gene.

C6

Figure S24.  This figure depicts the distribution of mutations and mutation types across the C6 significant gene.

NPFFR2

Figure S25.  This figure depicts the distribution of mutations and mutation types across the NPFFR2 significant gene.

OR5D13

Figure S26.  This figure depicts the distribution of mutations and mutation types across the OR5D13 significant gene.

GRXCR1

Figure S27.  This figure depicts the distribution of mutations and mutation types across the GRXCR1 significant gene.

SPATA16

Figure S28.  This figure depicts the distribution of mutations and mutation types across the SPATA16 significant gene.

KEAP1

Figure S29.  This figure depicts the distribution of mutations and mutation types across the KEAP1 significant gene.

SMAD4

Figure S30.  This figure depicts the distribution of mutations and mutation types across the SMAD4 significant gene.

OR5L1

Figure S31.  This figure depicts the distribution of mutations and mutation types across the OR5L1 significant gene.

COSMIC analyses

In this analysis, COSMIC is used as a filter to increase power by restricting the territory of each gene. Cosmic version: v48.

Table 4.  Get Full Table Significantly mutated genes (COSMIC territory only). To access the database please go to: COSMIC. Number of significant genes found: 16. Number of genes displayed: 10

rank gene description n cos n_cos N_cos cos_ev p q
1 HRAS v-Ha-ras Harvey rat sarcoma viral oncogene homolog 32 19 31 9690 7799 0 0
2 TP53 tumor protein p53 426 356 392 181560 76737 0 0
3 PIK3CA phosphoinositide-3-kinase, catalytic, alpha polypeptide 96 220 81 112200 38739 0 0
4 CDKN2A cyclin-dependent kinase inhibitor 2A (melanoma, p16, inhibits CDK4) 116 332 114 169320 4870 0 0
5 FBXW7 F-box and WD repeat domain containing 7 34 91 23 46410 520 0 0
6 FGFR3 fibroblast growth factor receptor 3 (achondroplasia, thanatophoric dwarfism) 15 62 9 31620 5442 0 0
7 PIK3R1 phosphoinositide-3-kinase, regulatory subunit 1 (alpha) 10 33 6 16830 14 3.9e-10 2.5e-07
8 PTEN phosphatase and tensin homolog (mutated in multiple advanced cancers 1) 14 767 14 391170 527 1.6e-08 8.9e-06
9 RB1 retinoblastoma 1 (including osteosarcoma) 18 267 9 136170 24 3.7e-08 0.000019
10 SMAD4 SMAD family member 4 15 159 7 81090 17 2.1e-07 0.000094

Note:

n - number of (nonsilent) mutations in this gene across the individual set.

cos = number of unique mutated sites in this gene in COSMIC

n_cos = overlap between n and cos.

N_cos = number of individuals times cos.

cos_ev = total evidence: number of reports in COSMIC for mutations seen in this gene.

p = p-value for seeing the observed amount of overlap in this gene)

q = q-value, False Discovery Rate (Benjamini-Hochberg procedure)

Geneset Analyses

Table 5.  Get Full Table A Ranked List of Significantly Mutated Genesets. (Source: MSigDB GSEA Cannonical Pathway Set).Number of significant genesets found: 59. Number of genesets displayed: 10

rank geneset description genes N_genes mut_tally N n npat nsite nsil n1 n2 n3 n4 n5 n6 p_ns_s p q
1 ARFPATHWAY Cyclin-dependent kinase inhibitor 2A is a tumor suppressor that induces G1 arrest and can activate the p53 pathway, leading to G2/M arrest. ABL1, CDKN2A, E2F1, MDM2, MYC, PIK3CA, PIK3R1, POLR1A, POLR1B, POLR1C, POLR1D, RAC1, RB1, TBX2, TP53, TWIST1 16 ABL1(8), CDKN2A(116), MDM2(2), MYC(6), PIK3CA(96), PIK3R1(10), POLR1A(10), POLR1B(4), POLR1C(3), RAC1(16), RB1(18), TBX2(4), TP53(426), TWIST1(1) 15420104 720 420 371 20 90 120 90 118 290 12 1.11e-15 <1.00e-15 <4.11e-14
2 G1_TO_S_CELL_CYCLE_REACTOME ATM, CCNA1, CCNB1, CCND1, CCND2, CCND3, CCNE1, CCNE2, CCNG2, CCNH, CDC25A, CDC45L, CDK2, CDK4, CDK7, CDKN1A, CDKN1B, CDKN1C, CDKN2A, CDKN2B, CDKN2C, CDKN2D, CREB3, CREB3L1, CREB3L3, CREB3L4, CREBL1, CREBL1, TNXB, E2F1, E2F2, E2F3, E2F4, E2F5, E2F6, FLJ14001, GADD45A, GBA2, MCM2, MCM3, MCM4, MCM5, MCM6, MCM7, MDM2, MNAT1, MYC, MYT1, NACA, NACA, FKSG17, ORC1L, ORC2L, ORC3L, ORC4L, ORC5L, ORC6L, PCNA, POLA2, POLE, POLE2, PRIM1, PRIM2A, RB1, RBL1, RPA1, RPA2, RPA3, TFDP1, TFDP2, TP53, WEE1 64 ATM(16), CCNA1(7), CCNB1(3), CCND1(4), CCND2(2), CCNE1(5), CCNE2(4), CCNG2(4), CDC25A(2), CDK4(5), CDKN1B(3), CDKN2A(116), CDKN2C(1), CREB3(1), CREB3L1(4), CREB3L3(10), CREB3L4(2), E2F2(3), E2F3(3), E2F4(2), E2F5(6), E2F6(1), GADD45A(1), GBA2(9), MCM2(4), MCM3(4), MCM4(7), MCM5(2), MCM6(4), MCM7(9), MDM2(2), MNAT1(1), MYC(6), MYT1(12), NACA(25), PCNA(1), POLA2(2), POLE(11), POLE2(1), PRIM1(3), RB1(18), RBL1(11), RPA1(5), RPA2(1), TFDP1(3), TFDP2(1), TNXB(23), TP53(426), WEE1(1) 58754687 797 416 516 76 105 118 141 112 308 13 <1.00e-15 <1.00e-15 <4.11e-14
3 ST_FAS_SIGNALING_PATHWAY The Fas receptor induces apoptosis and NF-kB activation when bound to Fas ligand. ADPRT, ALG2, BAK1, BAX, BFAR, BIRC4, BTK, CAD, CASP10, CASP3, CASP8, CASP8AP2, CD7, CDK2AP1, CSNK1A1, DAXX, DEDD, DEDD2, DFFA, DIABLO, EGFR, EPHB2, FADD, FAF1, FAIM2, FREQ, HRB, HSPB1, IL1A, IL8, MAP2K4, MAP2K7, MAP3K1, MAP3K5, MAPK1, MAPK10, MAPK8, MAPK8IP1, MAPK8IP2, MAPK8IP3, MAPK9, MCP, MET, NFAT5, NFKB1, NFKB2, NFKBIA, NFKBIB, NFKBIE, NFKBIL1, NFKBIL2, NR0B2, PFN1, PFN2, PTPN13, RALBP1, RIPK1, ROCK1, SMPD1, TNFRSF6, TNFRSF6B, TP53, TPX2, TRAF2, TUFM, VIL2 59 ALG2(2), BAK1(1), BAX(2), BTK(8), CAD(5), CASP10(3), CASP3(2), CASP8(61), CD7(2), CSNK1A1(1), DAXX(5), DEDD(1), DEDD2(5), DIABLO(2), EGFR(21), EPHB2(3), FADD(4), FAF1(2), IL1A(1), MAP2K4(2), MAP2K7(2), MAP3K1(6), MAP3K5(6), MAPK1(9), MAPK10(4), MAPK8(4), MAPK8IP1(3), MAPK8IP2(3), MAPK8IP3(8), MAPK9(5), MET(4), NFAT5(13), NFKB1(5), NFKB2(5), NFKBIA(1), NFKBIB(2), NFKBIE(1), NFKBIL1(2), NR0B2(1), PFN1(1), PTPN13(13), RALBP1(5), RIPK1(2), ROCK1(19), SMPD1(2), TNFRSF6B(4), TP53(426), TPX2(5), TRAF2(2), TUFM(2) 55474647 698 416 462 67 113 99 130 104 239 13 <1.00e-15 <1.00e-15 <4.11e-14
4 PMLPATHWAY Ring-shaped PML nuclear bodies regulate transcription and are required co-activators in p53- and DAXX-mediated apoptosis. CREBBP, DAXX, HRAS, PAX3, PML, PRAM-1, RARA, RB1, SIRT1, SP100, TNF, TNFRSF1A, TNFRSF1B, TNFRSF6, TNFSF6, TP53, UBL1 13 CREBBP(40), DAXX(5), HRAS(32), PAX3(6), PML(4), RARA(4), RB1(18), SIRT1(2), SP100(8), TNFRSF1A(1), TNFRSF1B(2), TP53(426) 14483208 548 402 307 32 88 65 104 83 197 11 <1.00e-15 <1.00e-15 <4.11e-14
5 G2PATHWAY Activated Cdc2-cyclin B kinase regulates the G2/M transition; DNA damage stimulates the DNA-PK/ATM/ATR kinases, which inactivate Cdc2. ATM, ATR, BRCA1, CCNB1, CDC2, CDC25A, CDC25B, CDC25C, CDC34, CDKN1A, CDKN2D, CHEK1, CHEK2, EP300, GADD45A, MDM2, MYT1, PLK, PRKDC, RPS6KA1, TP53, WEE1, YWHAH, YWHAQ 22 ATM(16), ATR(25), BRCA1(12), CCNB1(3), CDC25A(2), CDC25B(4), CDC25C(4), CDC34(4), CHEK1(1), CHEK2(5), EP300(39), GADD45A(1), MDM2(2), MYT1(12), PRKDC(28), RPS6KA1(10), TP53(426), WEE1(1), YWHAH(2), YWHAQ(3) 31890716 600 394 376 34 94 85 107 102 202 10 1.33e-15 <1.00e-15 <4.11e-14
6 ATRBRCAPATHWAY BRCA1 and 2 block cell cycle progression in response to DNA damage and promote double-stranded break repair; mutations induce breast cancer susceptibility. ATM, ATR, BRCA1, BRCA2, CHEK1, CHEK2, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, HUS1, MRE11A, NBS1, RAD1, RAD17, RAD50, RAD51, RAD9A, TP53, TREX1 21 ATM(16), ATR(25), BRCA1(12), BRCA2(22), CHEK1(1), CHEK2(5), FANCA(7), FANCC(3), FANCD2(9), FANCF(2), FANCG(2), HUS1(2), MRE11A(3), RAD17(7), RAD50(9), RAD51(1), RAD9A(2), TP53(426), TREX1(4) 33493143 558 390 341 32 81 86 93 96 192 10 <1.00e-15 <1.00e-15 <4.11e-14
7 ST_JNK_MAPK_PATHWAY JNKs are MAP kinases regulated by several levels of kinases (MAPKK, MAPKKK) and phosphorylate transcription factors and regulatory proteins. AKT1, ATF2, CDC42, DLD, DUSP10, DUSP4, DUSP8, GAB1, GADD45A, GCK, IL1R1, JUN, MAP2K4, MAP2K5, MAP2K7, MAP3K1, MAP3K10, MAP3K11, MAP3K12, MAP3K13, MAP3K2, MAP3K3, MAP3K4, MAP3K5, MAP3K7, MAP3K7IP1, MAP3K7IP2, MAP3K9, MAPK10, MAPK7, MAPK8, MAPK9, MYEF2, NFATC3, NR2C2, PAPPA, SHC1, TP53, TRAF6, ZAK 38 AKT1(2), ATF2(1), CDC42(4), DLD(1), DUSP10(3), DUSP4(1), DUSP8(1), GAB1(5), GADD45A(1), GCK(4), IL1R1(4), JUN(1), MAP2K4(2), MAP2K5(3), MAP2K7(2), MAP3K1(6), MAP3K10(6), MAP3K11(4), MAP3K12(9), MAP3K13(15), MAP3K2(1), MAP3K3(3), MAP3K4(14), MAP3K5(6), MAP3K7(8), MAP3K9(9), MAPK10(4), MAPK7(6), MAPK8(4), MAPK9(5), MYEF2(7), NFATC3(6), NR2C2(1), PAPPA(13), SHC1(3), TP53(426), TRAF6(3), ZAK(5) 38392565 599 388 379 50 100 85 119 92 193 10 <1.00e-15 <1.00e-15 <4.11e-14
8 TIDPATHWAY On ligand binding, interferon gamma receptors stimulate JAK2 kinase to phosphorylate STAT transcription factors, which promote expression of interferon responsive genes. DNAJA3, HSPA1A, IFNG, IFNGR1, IFNGR2, IKBKB, JAK2, LIN7A, NFKB1, NFKBIA, RB1, RELA, TIP-1, TNF, TNFRSF1A, TNFRSF1B, TP53, USH1C, WT1 18 DNAJA3(3), IFNG(2), IFNGR1(4), IFNGR2(5), IKBKB(12), JAK2(5), LIN7A(5), NFKB1(5), NFKBIA(1), RB1(18), RELA(2), TNFRSF1A(1), TNFRSF1B(2), TP53(426), USH1C(7), WT1(3) 13844325 501 384 283 28 78 57 85 77 194 10 <1.00e-15 <1.00e-15 <4.11e-14
9 RBPATHWAY The ATM protein kinase recognizes DNA damage and blocks cell cycle progression by phosphorylating chk1 and p53, which normally inhibits Rb to allow G1/S transitions. ATM, CDC2, CDC25A, CDC25B, CDC25C, CDK2, CDK4, CHEK1, MYT1, RB1, TP53, WEE1, YWHAH 12 ATM(16), CDC25A(2), CDC25B(4), CDC25C(4), CDK4(5), CHEK1(1), MYT1(12), RB1(18), TP53(426), WEE1(1), YWHAH(2) 13349006 491 381 274 14 79 59 75 75 193 10 <1.00e-15 <1.00e-15 <4.11e-14
10 P53HYPOXIAPATHWAY Hypoxia induces p53 accumulation and consequent apoptosis with p53-mediated cell cycle arrest, which is present under conditions of DNA damage. ABCB1, AKT1, ATM, BAX, CDKN1A, CPB2, CSNK1A1, CSNK1D, FHL2, GADD45A, HIC1, HIF1A, HSPA1A, HSPCA, IGFBP3, MAPK8, MDM2, NFKBIB, NQO1, TP53 19 ABCB1(20), AKT1(2), ATM(16), BAX(2), CPB2(4), CSNK1A1(1), CSNK1D(1), FHL2(4), GADD45A(1), HIC1(1), HIF1A(3), IGFBP3(1), MAPK8(4), MDM2(2), NFKBIB(2), NQO1(3), TP53(426) 15825286 493 372 276 26 83 59 79 77 185 10 <1.00e-15 <1.00e-15 <4.11e-14

Table 6.  Get Full Table A Ranked List of Significantly Mutated Genesets (Excluding Significantly Mutated Genes). Number of significant genesets found: 0. Number of genesets displayed: 10

rank geneset description genes N_genes mut_tally N n npat nsite nsil n1 n2 n3 n4 n5 n6 p_ns_s p q
1 FBW7PATHWAY Cyclin E interacts with cell cycle checkpoint kinase cdk2 to allow transcription of genes required for S phase, including transcription of additional cyclin E. CCNE1, CDC34, CDK2, CUL1, E2F1, FBXW7, RB1, SKP1A, TFDP1 7 CCNE1(5), CDC34(4), CUL1(11), RB1(18), TFDP1(3) 4986845 41 35 41 7 3 12 6 3 17 0 0.038 0.26 1
2 HSA00627_1,4_DICHLOROBENZENE_DEGRADATION Genes involved in 1,4-dichlorobenzene degradation CMBL 1 CMBL(2) 386575 2 2 2 0 0 0 0 0 2 0 0.66 0.3 1
3 SULFUR_METABOLISM BPNT1, PAPSS1, PAPSS2, SULT1A2, SULT1A3, SULT1A3, SULT1A4, SULT1E1, SULT2A1, SUOX 7 BPNT1(2), PAPSS1(6), PAPSS2(6), SULT1A2(1), SULT1E1(5), SULT2A1(2), SUOX(2) 4644911 24 22 24 2 5 7 5 5 2 0 0.031 0.5 1
4 HSA00472_D_ARGININE_AND_D_ORNITHINE_METABOLISM Genes involved in D-arginine and D-ornithine metabolism DAO 1 DAO(2) 549130 2 2 2 1 1 0 0 1 0 0 0.8 0.57 1
5 HSA00550_PEPTIDOGLYCAN_BIOSYNTHESIS Genes involved in peptidoglycan biosynthesis GLUL, PGLYRP2 2 GLUL(3), PGLYRP2(6) 1383171 9 9 9 2 1 3 2 2 1 0 0.35 0.62 1
6 HSA00902_MONOTERPENOID_BIOSYNTHESIS Genes involved in monoterpenoid biosynthesis CYP2C19, CYP2C9 2 CYP2C19(6), CYP2C9(7) 1537915 13 13 13 3 0 4 5 3 1 0 0.37 0.67 1
7 RABPATHWAY Rab family GTPases regulate vesicle transport, endocytosis and exocytosis, and vesicle docking via interactions with the rabphilins. ACTA1, MEL, RAB11A, RAB1A, RAB2, RAB27A, RAB3A, RAB4A, RAB5A, RAB6A, RAB7, RAB9A 9 ACTA1(4), RAB11A(1), RAB1A(2), RAB27A(1), RAB3A(2), RAB4A(1), RAB5A(3) 3208378 14 14 14 1 5 4 2 2 1 0 0.034 0.68 1
8 NUCLEOTIDE_SUGARS_METABOLISM GALE, GALT, TGDS, UGDH, UXS1 5 GALT(2), TGDS(3), UGDH(4), UXS1(2) 2921501 11 11 11 0 0 1 5 4 1 0 0.1 0.7 1
9 HSA00031_INOSITOL_METABOLISM Genes involved in inositol metabolism ALDH6A1, TPI1 2 ALDH6A1(2), TPI1(4) 1256585 6 6 6 2 1 3 2 0 0 0 0.65 0.72 1
10 HSA00830_RETINOL_METABOLISM Genes involved in retinol metabolism ALDH1A1, ALDH1A2, BCMO1, RDH5 4 ALDH1A1(1), ALDH1A2(10), BCMO1(3), RDH5(1) 2966657 15 15 15 3 2 1 6 2 4 0 0.36 0.78 1
Methods & Data
Methods

In brief, we tabulate the number of mutations and the number of covered bases for each gene. The counts are broken down by mutation context category: four context categories that are discovered by MutSig, and one for indel and 'null' mutations, which include indels, nonsense mutations, splice-site mutations, and non-stop (read-through) mutations. For each gene, we calculate the probability of seeing the observed constellation of mutations, i.e. the product P1 x P2 x ... x Pm, or a more extreme one, given the background mutation rates calculated across the dataset. [1]

Download Results

In addition to the links below, the full results of the analysis summarized in this report can also be downloaded programmatically using firehose_get, or interactively from either the Broad GDAC website or TCGA Data Coordination Center Portal.

References
[1] TCGA, Integrated genomic analyses of ovarian carcinoma, Nature 474:609 - 615 (2011)