This report serves to describe the mutational landscape and properties of a given individual set, as well as rank genes and genesets according to mutational significance. MutSig vS2N was used to generate the results found in this report.
-
Working with individual set: HNSC-TP
The input for this pipeline is a set of individuals with the following files associated for each:
-
An annotated .maf file describing the mutations called for the respective individual, and their properties.
-
A .wig file that contains information about the coverage of the sample.
-
MAF used for this analysis:HNSC-TP.final_analysis_set.maf
-
Significantly mutated genes (q ≤ 0.1): 61
Column Descriptions:
-
N = number of sequenced bases in this gene across the individual set
-
nnon = number of (nonsilent) mutations in this gene across the individual set
-
nnull = number of (nonsilent) null mutations in this gene across the individual set
-
nflank = number of noncoding mutations from this gene's flanking region, across the individual set
-
nsil = number of silent mutations in this gene across the individual set
-
p = p-value (overall)
-
q = q-value, False Discovery Rate (Benjamini-Hochberg procedure)
gene | N | nflank | nsil | nnon | nnull | p | q |
---|---|---|---|---|---|---|---|
ZNF737 | 848 | 0 | 2 | 11 | 2 | 0 | 0 |
POTEC | 764 | 0 | 1 | 15 | 1 | 0 | 0 |
KRTAP4-11 | 184 | 0 | 2 | 7 | 0 | 0 | 0 |
PRAMEF11 | 624 | 0 | 0 | 10 | 0 | 0 | 0 |
ANKRD30B | 964 | 0 | 1 | 10 | 2 | 0 | 0 |
ZNF844 | 644 | 0 | 0 | 6 | 0 | 0 | 0 |
CDKN2A | 18272 | 0 | 0 | 62 | 49 | 0 | 0 |
FAM75C1 | 1340 | 0 | 0 | 10 | 2 | 0 | 0 |
NBPF9 | 1208 | 0 | 1 | 9 | 2 | 0 | 0 |
ZNF716 | 724 | 0 | 2 | 7 | 0 | 0 | 0 |
DCAF8L2 | 424 | 0 | 1 | 5 | 0 | 0 | 0 |
RGPD4 | 1784 | 0 | 1 | 8 | 3 | 0 | 0 |
TP53 | 38556 | 0 | 5 | 244 | 95 | 2e-286 | 2.9e-283 |
ZNF860 | 1532 | 0 | 0 | 5 | 0 | 2.3e-103 | 3.1e-100 |
JUB | 32738 | 0 | 1 | 18 | 12 | 8.2e-86 | 1e-82 |
PIK3CA | 128834 | 0 | 0 | 64 | 0 | 8.1e-76 | 9.5e-73 |
CT47B1 | 284 | 0 | 3 | 4 | 1 | 2.3e-48 | 2.6e-45 |
UBTFL1 | 400 | 0 | 0 | 4 | 0 | 1.1e-39 | 1.2e-36 |
FAM75A6 | 1100 | 0 | 4 | 13 | 0 | 4.6e-38 | 4.6e-35 |
FAT1 | 498120 | 0 | 2 | 88 | 66 | 6.4e-38 | 6e-35 |
MLL2 | 382710 | 0 | 3 | 59 | 37 | 5.5e-29 | 4.9e-26 |
TGFBR2 | 65790 | 0 | 1 | 11 | 7 | 4.5e-28 | 3.9e-25 |
ZNF286B | 696 | 0 | 0 | 4 | 0 | 3.5e-26 | 2.9e-23 |
NSD1 | 276934 | 0 | 1 | 38 | 24 | 5.4e-25 | 4.2e-22 |
EYS | 68854 | 0 | 2 | 13 | 0 | 6.2e-23 | 4.7e-20 |
HLA-A | 33044 | 0 | 2 | 9 | 8 | 1.6e-22 | 1.1e-19 |
NOTCH1 | 159614 | 0 | 5 | 60 | 23 | 1e-20 | 7.2e-18 |
TRIP11 | 235306 | 0 | 3 | 20 | 1 | 3.8e-15 | 2.5e-12 |
CASP8 | 71910 | 0 | 0 | 25 | 13 | 1.2e-13 | 7.6e-11 |
RGPD3 | 1676 | 0 | 1 | 4 | 1 | 2.6e-12 | 1.6e-09 |
ZNF845 | 1472 | 0 | 4 | 7 | 0 | 4.8e-12 | 2.9e-09 |
HRAS | 20820 | 0 | 0 | 11 | 0 | 7e-12 | 4.1e-09 |
ANKRD36 | 15262 | 0 | 3 | 5 | 1 | 3.9e-11 | 2.2e-08 |
EPHA2 | 92738 | 0 | 0 | 17 | 10 | 4.6e-11 | 2.5e-08 |
CD248 | 48054 | 0 | 2 | 14 | 0 | 1.4e-08 | 7.7e-06 |
In brief, we tabulate the number of mutations and the number of covered bases for each gene. The counts are broken down by mutation context category: four context categories that are discovered by MutSig, and one for indel and 'null' mutations, which include indels, nonsense mutations, splice-site mutations, and non-stop (read-through) mutations. For each gene, we calculate the probability of seeing the observed constellation of mutations, i.e. the product P1 x P2 x ... x Pm, or a more extreme one, given the background mutation rates calculated across the dataset. [1]
This is an experimental feature. The full results of the analysis summarized in this report can be downloaded from the TCGA Data Coordination Center.