This report serves to describe the mutational landscape and properties of a given individual set, as well as rank genes and genesets according to mutational significance. MutSig vS2N was used to generate the results found in this report.
-
Working with individual set: STAD
The input for this pipeline is a set of individuals with the following files associated for each:
-
An annotated .maf file describing the mutations called for the respective individual, and their properties.
-
A .wig file that contains information about the coverage of the sample.
-
MAF used for this analysis:STAD.final_analysis_set.maf
-
Significantly mutated genes (q ≤ 0.1): 52
Column Descriptions:
-
N = number of sequenced bases in this gene across the individual set
-
nnon = number of (nonsilent) mutations in this gene across the individual set
-
nnull = number of (nonsilent) null mutations in this gene across the individual set
-
nflank = number of noncoding mutations from this gene's flanking region, across the individual set
-
nsil = number of silent mutations in this gene across the individual set
-
p = p-value (overall)
-
q = q-value, False Discovery Rate (Benjamini-Hochberg procedure)
gene | N | nflank | nsil | nnon | nnull | p | q |
---|---|---|---|---|---|---|---|
TP53 | 16758 | 4 | 2 | 76 | 24 | 2.9e-66 | 5.4e-62 |
CBWD1 | 15188 | 4 | 0 | 21 | 19 | 9.9e-43 | 9.3e-39 |
KRAS | 13759 | 4 | 0 | 18 | 0 | 4.4e-40 | 2.8e-36 |
RPL22 | 6650 | 1 | 0 | 10 | 10 | 2e-31 | 9.6e-28 |
FGF22 | 2128 | 0 | 1 | 6 | 6 | 1.1e-27 | 4.2e-24 |
PTH2 | 2261 | 0 | 0 | 6 | 0 | 1.2e-24 | 3.7e-21 |
ARID1A | 77406 | 8 | 4 | 34 | 24 | 1.7e-17 | 4.7e-14 |
ZNF579 | 2806 | 0 | 0 | 5 | 0 | 1.1e-15 | 2.5e-12 |
NKPD1 | 5852 | 0 | 0 | 6 | 0 | 9.9e-15 | 2.1e-11 |
ZNF703 | 3810 | 0 | 0 | 5 | 0 | 9e-12 | 1.7e-08 |
RNF43 | 26673 | 4 | 4 | 16 | 11 | 1e-11 | 1.7e-08 |
ITPRIPL2 | 14351 | 0 | 0 | 13 | 0 | 3.6e-10 | 5.7e-07 |
PHF2 | 37892 | 10 | 4 | 15 | 11 | 2.4e-09 | 3.5e-06 |
RPLP0 | 14158 | 1 | 0 | 6 | 0 | 3.6e-09 | 4.8e-06 |
PIK3CA | 56113 | 14 | 5 | 45 | 2 | 4.4e-09 | 5.6e-06 |
RAB27A | 11172 | 1 | 0 | 7 | 0 | 8.3e-09 | 9.8e-06 |
IRF2 | 17230 | 3 | 1 | 17 | 7 | 1.3e-08 | 0.000014 |
PTEN | 21053 | 8 | 5 | 12 | 3 | 2.9e-08 | 3e-05 |
CRYGC | 7980 | 0 | 0 | 8 | 0 | 7.7e-08 | 0.000077 |
HLA-B | 12708 | 9 | 0 | 14 | 8 | 9.4e-08 | 0.000089 |
ACVR2A | 25257 | 7 | 0 | 18 | 14 | 1.2e-07 | 0.0001 |
VEZT | 30016 | 8 | 0 | 10 | 2 | 3.5e-07 | 0.0003 |
PGM5 | 20834 | 8 | 0 | 25 | 2 | 4.6e-07 | 0.00037 |
DEM1 | 17084 | 0 | 0 | 9 | 0 | 7.3e-07 | 0.00057 |
SMAD2 | 21932 | 8 | 0 | 10 | 7 | 9.3e-07 | 0.00068 |
DUSP22 | 9044 | 12 | 0 | 9 | 0 | 9.4e-07 | 0.00068 |
C4A | 29815 | 0 | 1 | 8 | 0 | 2.3e-06 | 0.0016 |
OCEL1 | 8778 | 3 | 0 | 6 | 5 | 3.8e-06 | 0.0026 |
MLL4 | 65274 | 4 | 3 | 34 | 11 | 4.7e-06 | 0.0031 |
UPF3A | 16020 | 6 | 2 | 7 | 6 | 7.3e-06 | 0.0046 |
DNAJC6 | 41363 | 7 | 1 | 16 | 1 | 9.4e-06 | 0.0057 |
BCL2L13 | 21014 | 9 | 0 | 9 | 0 | 1e-05 | 0.0061 |
SLC25A2 | 13832 | 1 | 1 | 10 | 0 | 0.000011 | 0.0065 |
MLL2 | 164943 | 6 | 8 | 39 | 13 | 0.000012 | 0.0066 |
SPERT | 14158 | 1 | 3 | 12 | 2 | 0.000014 | 0.0076 |
In brief, we tabulate the number of mutations and the number of covered bases for each gene. The counts are broken down by mutation context category: four context categories that are discovered by MutSig, and one for indel and 'null' mutations, which include indels, nonsense mutations, splice-site mutations, and non-stop (read-through) mutations. For each gene, we calculate the probability of seeing the observed constellation of mutations, i.e. the product P1 x P2 x ... x Pm, or a more extreme one, given the background mutation rates calculated across the dataset. [1]
This is an experimental feature. The full results of the analysis summarized in this report can be downloaded from the TCGA Data Coordination Center.