This report serves to describe the mutational landscape and properties of a given individual set, as well as rank genes and genesets according to mutational significance. MutSig vS2N was used to generate the results found in this report.
-
Working with individual set: BLCA
The input for this pipeline is a set of individuals with the following files associated for each:
-
An annotated .maf file describing the mutations called for the respective individual, and their properties.
-
A .wig file that contains information about the coverage of the sample.
-
MAF used for this analysis:BLCA.final_analysis_set.maf
-
Significantly mutated genes (q ≤ 0.1): 22
Column Descriptions:
-
N = number of sequenced bases in this gene across the individual set
-
nnon = number of (nonsilent) mutations in this gene across the individual set
-
nnull = number of (nonsilent) null mutations in this gene across the individual set
-
nflank = number of noncoding mutations from this gene's flanking region, across the individual set
-
nsil = number of silent mutations in this gene across the individual set
-
p = p-value (overall)
-
q = q-value, False Discovery Rate (Benjamini-Hochberg procedure)
gene | N | nflank | nsil | nnon | nnull | p | q |
---|---|---|---|---|---|---|---|
TP53 | 3528 | 0 | 1 | 14 | 3 | 0 | 0 |
KDM6A | 12769 | 1 | 2 | 6 | 6 | 5.7e-132 | 5.4e-128 |
ARID1A | 16296 | 0 | 1 | 8 | 5 | 1.2e-53 | 7.3e-50 |
ASXL2 | 12589 | 0 | 0 | 6 | 2 | 2.6e-18 | 1.2e-14 |
HRNR | 15146 | 0 | 0 | 6 | 0 | 2.4e-15 | 9.2e-12 |
FGA | 8013 | 0 | 0 | 5 | 0 | 6.3e-15 | 2e-11 |
FLG | 30575 | 0 | 2 | 8 | 0 | 1.1e-14 | 3e-11 |
TTN | 357098 | 1 | 3 | 34 | 4 | 2.7e-14 | 6.3e-11 |
RPAP1 | 10651 | 0 | 0 | 5 | 0 | 2e-11 | 4.3e-08 |
FBXW7 | 8047 | 0 | 0 | 6 | 2 | 4.3e-11 | 8.1e-08 |
MLL | 36311 | 0 | 1 | 9 | 2 | 1.3e-10 | 2.2e-07 |
GOLGB1 | 34742 | 0 | 2 | 12 | 3 | 1.7e-10 | 2.7e-07 |
ADAMTS12 | 15260 | 0 | 1 | 4 | 0 | 9e-10 | 1.3e-06 |
DCC | 13602 | 0 | 1 | 5 | 0 | 4.2e-09 | 5.7e-06 |
MLL2 | 34764 | 0 | 0 | 6 | 6 | 2.6e-08 | 0.000032 |
MACF1 | 85714 | 2 | 0 | 10 | 2 | 1.2e-07 | 0.00014 |
PRX | 9460 | 0 | 0 | 9 | 1 | 1.8e-07 | 0.0002 |
ABCA10 | 17600 | 1 | 0 | 5 | 2 | 4.6e-07 | 0.00049 |
HCN1 | 8148 | 1 | 1 | 5 | 3 | 2e-06 | 0.002 |
SPTA1 | 24718 | 0 | 0 | 5 | 0 | 0.000024 | 0.023 |
APOB | 49173 | 0 | 0 | 6 | 0 | 0.000031 | 0.028 |
TPR | 26207 | 1 | 1 | 5 | 0 | 0.000045 | 0.038 |
BCLAF1 | 9083 | 0 | 0 | 5 | 1 | 0.00026 | 0.22 |
HMCN1 | 55209 | 3 | 0 | 9 | 2 | 0.00028 | 0.22 |
DNAH5 | 49325 | 0 | 1 | 7 | 1 | 0.00032 | 0.24 |
SACS | 47870 | 0 | 2 | 6 | 2 | 0.00092 | 0.67 |
OTUD7A | 4967 | 0 | 0 | 4 | 0 | 0.0012 | 0.85 |
XPR1 | 7476 | 0 | 1 | 5 | 2 | 0.0014 | 0.97 |
SPATS2 | 5365 | 0 | 0 | 4 | 1 | 0.002 | 1 |
KPNA3 | 5639 | 1 | 0 | 4 | 1 | 0.0027 | 1 |
NFE2L2 | 6020 | 0 | 0 | 4 | 0 | 0.004 | 1 |
MLL3 | 45342 | 0 | 0 | 7 | 2 | 0.0066 | 1 |
SYNE1 | 91835 | 0 | 3 | 8 | 1 | 0.012 | 1 |
ERCC2 | 7666 | 0 | 0 | 4 | 0 | 0.012 | 1 |
CUL1 | 8607 | 0 | 0 | 4 | 0 | 0.02 | 1 |
In brief, we tabulate the number of mutations and the number of covered bases for each gene. The counts are broken down by mutation context category: four context categories that are discovered by MutSig, and one for indel and 'null' mutations, which include indels, nonsense mutations, splice-site mutations, and non-stop (read-through) mutations. For each gene, we calculate the probability of seeing the observed constellation of mutations, i.e. the product P1 x P2 x ... x Pm, or a more extreme one, given the background mutation rates calculated across the dataset. [1]
This is an experimental feature. The full results of the analysis summarized in this report can be downloaded from the TCGA Data Coordination Center.