Mutation Analysis (MutSigCV v0.6)

Prostate Adenocarcinoma (Primary solid tumor)

22 February 2013 | analyses__2013_02_22

Maintainer Information

Citation Information

Maintained by Dan DiCara (Broad Institute)

Cite as Broad Institute TCGA Genome Data Analysis Center (2013): Mutation Analysis (MutSigCV v0.6). Broad Institute of MIT and Harvard. doi:10.7908/C1P26WCX

Overview

Introduction

This report serves to describe the mutational landscape and properties of a given individual set, as well as rank genes and genesets according to mutational significance. MutSigCV v0.6 was used to generate the results found in this report.

Working with individual set: PRAD-TP
Number of patients in set: 83

Input

The input for this pipeline is a set of individuals with the following files associated for each:

An annotated .maf file describing the mutations called for the respective individual, and their properties.
A .wig file that contains information about the coverage of the sample.

Summary

MAF used for this analysis:PRAD-TP.final_analysis_set.maf
Significantly mutated genes (q ≤ 0.1): 1

Results

Target Coverage for Each Individual

The x axis represents the samples. The y axis represents the exons, one row per exon, and they are sorted by average coverage across samples. For exons with exactly the same average coverage, they are sorted next by the %GC of the exon. (The secondary sort is especially useful for the zero-coverage exons at the bottom).

Figure 1.

Distribution of Mutation Counts, Coverage, and Mutation Rates Across Samples

Figure 2. Patients counts and rates file used to generate this plot: PRAD-TP.patients.counts_and_rates.txt

CoMut Plot

Figure 3. Get High-res Image The matrix in the center of the figure represents individual mutations in patient samples, color-coded by type of mutation, for the significantly mutated genes. The rate of synonymous and non-synonymous mutations is displayed at the top of the matrix. The barplot on the left of the matrix shows the number of mutations in each gene. The percentages represent the fraction of tumors with at least one mutation in the specified gene. The barplot to the right of the matrix displays the q-values for the most significantly mutated genes. The purple boxplots below the matrix (only displayed if required columns are present in the provided MAF) represent the distributions of allelic fractions observed in each sample. The plot at the bottom represents the base substitution distribution of individual samples, using the same categories that were used to calculate significance.

Significantly Mutated Genes

Column Descriptions:

nnon = number of (nonsilent) mutations in this gene across the individual set
npat = number of patients (individuals) with at least one nonsilent mutation
nsite = number of unique sites having a non-silent mutation
nflank = number of noncoding mutations from this gene's flanking region, across the individual set
nsil = number of silent mutations in this gene across the individual set
p = p-value (overall)
q = q-value, False Discovery Rate (Benjamini-Hochberg procedure)

Table 1. Get Full Table A Ranked List of Significantly Mutated Genes. Number of significant genes found: 1. Number of genes displayed: 35. Click on a gene name to display its stick figure depicting the distribution of mutations and mutation types across the chosen gene (this feature may not be available for all significant genes).

gene	Nnon	Nsil	Nflank	nnon	npat	nsite	nsil	nnei	fMLE	p	score	time	q
NKX3-1	32370	9794	1440	5	5	5	0	11	0	3.1e-07	21	0.072	0.0056
MLL3	952425	270580	74400	10	7	9	0	13	0	0.000016	32	0.078	0.14
SPOP	75945	19920	11040	4	4	3	0	20	0	6e-05	14	0.069	0.37
CDKN1B	38595	10707	10440	2	2	2	0	20	0	0.0003	13	0.066	1
KRT25	88644	25730	9840	3	3	3	0	20	0	0.00034	13	0.07	1
ESCO1	167494	42828	18720	3	3	3	0	20	0	0.00053	15	0.072	1
CPEB4	143341	40836	12120	3	3	3	0	20	0	0.00057	15	0.073	1
YBX1	52622	14608	6960	4	3	2	0	20	0	0.00057	11	0.063	1
FBXO4	66898	17596	7440	2	2	2	0	20	0	0.00059	11	0.063	1
BANF2	18758	4648	2340	2	2	2	0	20	0.74	0.0012	8	0.054	1
GPATCH4	94454	24651	11460	2	2	1	0	15	0	0.0013	13	0.067	1
C20orf111	56274	17015	3720	2	2	2	0	20	0	0.002	10	0.063	1
SLITRK4	162265	45899	1380	3	3	2	0	11	2.1	0.0029	17	0.075	1
LCE2D	21414	6142	1440	2	2	2	0	20	0.67	0.003	7.3	0.051	1
PRR21	40172	14525	900	4	4	4	0	5	0	0.003	12	0.067	1
FGFR1OP2	52207	12865	6960	2	2	2	0	20	0	0.0031	7.5	0.055	1
TBL1XR1	88561	24153	10620	2	2	2	0	20	0.72	0.0031	12	0.07	1
PTEN	80842	19422	10440	3	3	3	0	20	0.79	0.0033	12	0.069	1
NEUROD6	66151	17845	1440	2	2	2	0	20	0	0.0035	7.4	0.054	1
ZMYM3	217543	62416	20340	5	4	5	0	20	1.9	0.0041	19	0.073	1
ACTR8	115785	31789	14100	2	2	2	0	20	0	0.0043	12	0.068	1
PLCH2	145333	43077	9840	3	3	3	0	20	0	0.0046	12	0.07	1
BBS9	176873	49634	27360	3	3	2	0	9	0.86	0.0049	14	0.071	1
TP53	78435	22908	12360	5	5	5	0	4	0	0.0053	16	0.071	1
MMP12	80759	20750	7560	2	2	2	0	20	0	0.0059	7.2	0.053	1
SFRS11	104912	31789	15900	2	2	1	0	8	0	0.0059	12	0.068	1
NANOS1	9213	3071	0	1	1	1	0	20	0	0.0062	7.2	0.05	1
SFRS2	58681	19090	6900	2	2	2	1	20	1.8	0.0063	12	0.07	1
LCN2	39923	10956	7200	2	2	2	0	20	0	0.0063	7.2	0.053	1
INADL	357730	102422	49920	4	4	4	0	13	0	0.0065	14	0.071	1
ZNF181	102754	24568	1920	3	2	3	0	20	0	0.0068	9.6	0.061	1
FOXA1	67728	19837	2580	3	2	3	0	20	1.8	0.007	12	0.069	1
GLTSCR1	58515	19422	4260	2	2	2	0	20	0	0.0079	7	0.055	1
SHFM1	15189	3071	3840	1	1	1	0	20	0.93	0.0082	6.9	0.05	1
NKX2-4	14442	4233	960	1	1	1	0	20	0.99	0.0085	6.9	0.052	1

NKX3-1

Figure S1. This figure depicts the distribution of mutations and mutation types across the NKX3-1 significant gene.

Methods & Data

Methods

In brief, we tabulate the number of mutations and the number of covered bases for each gene. The counts are broken down by mutation context category: four context categories that are discovered by MutSig, and one for indel and 'null' mutations, which include indels, nonsense mutations, splice-site mutations, and non-stop (read-through) mutations. For each gene, we calculate the probability of seeing the observed constellation of mutations, i.e. the product P1 x P2 x ... x Pm, or a more extreme one, given the background mutation rates calculated across the dataset. [1]

Download Results

This is an experimental feature. The full results of the analysis summarized in this report can be downloaded from the TCGA Data Coordination Center.

References

[1] TCGA, Integrated genomic analyses of ovarian carcinoma, Nature 474:609 - 615 (2011)

Made with Nozzle