Analysis Overview
Thymoma (Primary solid tumor)
17 October 2017  |  None
Maintainer Information
Maintained by Broad Institute GDAC (Broad Institute of MIT & Harvard)
Overview
Introduction

This is an overview of Thymoma analysis pipelines from FireCloud run "17 October 2017".

Summary

Note: These results are offered to the community as an additional reference point, enabling a wide range of cancer biologists, clinical investigators, and genome and computational scientists to easily incorporate TCGA into the backdrop of ongoing research. While every effort is made to ensure that FireCloud input data and algorithms are of the highest possible quality, these analyses have not been reviewed by domain experts.

Results
  • Sequence and Copy Number Analyses

    • Analysis of mutagenesis by APOBEC cytidine deaminases (P-MACD).
      View Report | There are 123 tumor samples in this analysis. The Benjamini-Hochberg-corrected p-value for enrichment of the APOBEC mutation signature in 2 samples is <=0.05. Out of these, 2 have enrichment values >2, which implies that in such samples at least 50% of APOBEC signature mutations have been in fact made by APOBEC enzyme(s).

    • Mutation Assessor
      View Report | 

    • Mutation Signature Analysis
      View Report | Our analysis idenfied 2 solution(s) of mutational signatures across 123 samples by BayesNMF method.

    • SNP6 Copy number analysis (GISTIC2)
      View Report | There were 124 tumor samples used in this analysis: 41 significant arm-level results, 4 significant focal amplifications, and 24 significant focal deletions were found.

  • Correlations to Clinical Parameters

    • Correlation between aggregated molecular cancer subtypes and selected clinical features
      View Report | Testing the association between subtypes identified by 10 different clustering approaches and 8 clinical features across 124 patients, 27 significant findings detected with P value < 0.05 and Q value < 0.25.

    • Correlation between APOBEC groups and selected clinical features
      View Report | Testing the association between 'APOBEC ENRICH' and 8 clinical features across 123 patients, no significant finding detected with P value < 0.05 and Q value < 0.25.

    • Correlation between APOBEC signature variables and clinical features
      View Report | Testing the association between 3 variables and 8 clinical features across 123 samples, statistically thresholded by P value < 0.05 and Q value < 0.3, 3 clinical features related to at least one variables.

    • Correlation between copy number variation genes (focal events) and selected clinical features
      View Report | Testing the association between copy number variation 28 focal events and 8 clinical features across 124 patients, 46 significant findings detected with Q value < 0.25.

    • Correlation between copy number variations of arm-level result and selected clinical features
      View Report | Testing the association between copy number variation 52 arm-level events and 8 clinical features across 124 patients, 59 significant findings detected with Q value < 0.25.

    • Correlation between gene mutation status and selected clinical features
      View Report | Testing the association between mutation status of 5 genes and 8 clinical features across 123 patients, one significant finding detected with Q value < 0.25.

    • Correlation between mRNAseq expression and clinical features
      View Report | Testing the association between 18747 genes and 8 clinical features across 119 samples, statistically thresholded by P value < 0.05 and Q value < 0.3, 5 clinical features related to at least one genes.

    • Correlation between mutation rate and clinical features
      View Report | Testing the association between 2 variables and 9 clinical features across 123 samples, statistically thresholded by P value < 0.05 and Q value < 0.3, 3 clinical features related to at least one variables.

  • Clustering Analyses

    • Clustering of copy number data by focal peak region with absolute value: consensus NMF
      View Report | The most robust consensus NMF clustering of 124 samples using the 28 most variable genes was identified for k = 4 clusters. We computed the clustering for k = 2 to k = 10 and used the cophenetic correlation coefficient and the average silhouette width calculation to determine the robust clusters.

    • Clustering of copy number data by peak region with threshold value: consensus NMF
      View Report | The most robust consensus NMF clustering of 77 samples using the 28 most variable genes was identified for k = 3 clusters. We computed the clustering for k = 2 to k = 10 and used the cophenetic correlation coefficient and the average silhouette width calculation to determine the robust clusters.

    • Clustering of lincRNA expression: consensus hierarchical
      View Report | Median absolute deviation (MAD) was used to select 2500 most variable lincRNAs. Consensus ward linkage hierarchical clustering of 118 samples and 2500 lincRNAs identified 3 subtypes with the stability of the clustering increasing for k = 2 to k = 10.

    • Clustering of lincRNA expression: consensus NMF
      View Report | The most robust consensus NMF clustering of 119 samples using the 2500 most variable lincRNAs was identified for k = 3 clusters. We computed the clustering for k = 2 to k = 10 and used the cophenetic correlation coefficient and the average silhouette width calculation to determine the robust clusters.

    • Clustering of miR mature expression: consensus hierarchical
      View Report | Median absolute deviation (MAD) was used to select 338 most variable miRs. Consensus ward linkage hierarchical clustering of 122 samples and 338 miRs identified 5 subtypes with the stability of the clustering increasing for k = 2 to k = 10.

    • Clustering of miR mature expression: consensus NMF
      View Report | The most robust consensus NMF clustering of 123 samples using the 338 most variable miRs was identified for k = 3 clusters. We computed the clustering for k = 2 to k = 10 and used the cophenetic correlation coefficient and the average silhouette width calculation to determine the robust clusters.

    • Clustering of protein coding gene expression: consensus NMF
      View Report | The most robust consensus NMF clustering of 119 samples using the 2500 most variable genes was identified for k = 3 clusters. We computed the clustering for k = 2 to k = 10 and used the cophenetic correlation coefficient and the average silhouette width calculation to determine the robust clusters.

    • Clustering of Protein-coding gene expression: consensus hierarchical
      View Report | Median absolute deviation (MAD) was used to select 2500 most variable genes. Consensus ward linkage hierarchical clustering of 118 samples and 2500 genes identified 8 subtypes with the stability of the clustering increasing for k = 2 to k = 10.

  • Other Analyses

    • Identification of putative miR direct targets by sequencing data
      View Report | The CLR algorithm was applied on 848 miRs and 18747 mRNAs across 118 samples. After 2 filtering steps, the number of 46 miR:gene pairs were detected.

    • Methylation__HM450_Clustering_CNMF
      View Report | The most robust consensus NMF clustering of 124 samples using the 8509 most variable genes was identified for k = 3 clusters. We computed the clustering for k = 2 to k = 10 and used the cophenetic correlation coefficient and the average silhouette width calculation to determine the robust clusters.

    • Methylation__HM450_Clustering_Consensus_Plus
      View Report | Median absolute deviation (MAD) was used to select 2500 most variable genes. Consensus ward linkage hierarchical clustering of 123 samples and 2500 genes identified 6 subtypes with the stability of the clustering increasing for k = 2 to k = 10.

    • Mutation_MutSig2CV
      View Report | 

  • Other Correlation Analyses

    • Correlation between copy number variation genes (focal events) and molecular subtypes
      View Report | Testing the association between copy number variation 28 focal events and 10 molecular subtypes across 124 patients, 145 significant findings detected with P value < 0.05 and Q value < 0.25.

    • Correlation between copy number variations of arm-level result and molecular subtypes
      View Report | Testing the association between copy number variation 52 arm-level events and 10 molecular subtypes across 124 patients, 268 significant findings detected with P value < 0.05 and Q value < 0.25.

    • Correlation between gene mutation status and molecular subtypes
      View Report | Testing the association between mutation status of 5 genes and 10 molecular subtypes across 123 patients, 12 significant findings detected with P value < 0.05 and Q value < 0.25.

Methods & Data
Input
  • Summary Report Date = Thu Dec 14 14:14:18 2017

  • Protection = FALSE