Cholangiocarcinoma (CHOL) Samples Report
2014_07_15 Data Snapshot
Overview
Introduction

The Broad GDAC mirrors data from the DCC on a daily basis. Although all data is mirrored, not every sample is ingested into Firehose. There are three main mechanisms that filter samples to ensure that only the most scientifically relevant samples make it into our standard data and analyses runs. These three mechanisms are redactions, replicate filtering, and blacklisting. This report summarizes the data that is ingested into Firehose, describes the three filtering mechanisms, lists those samples that are removed, and gives all available annotations from the DCC's Annotation Manager.

Summary

There were 0 redactions, 0 replicate aliquots, 0 blacklisted aliquots, and 0 FFPE aliquots. The table below represents the sample counts for those samples that were ingested into firehose after filtering out redactions, replicates, and blacklisted data, and segregating FFPEs.

Table 1.  This table provides a breakdown of sample counts on a per sample type and, if applicable, per subtype basis. Each count is a link to a table containing a list of the samples that comprise that count and details pertaining to each individual sample (e.g. platform, sequencing center, etc.). Please note, there are usually multiple protocols per data type, so there are typically many more rows than the count implies.

Sample Type BCR Clinical CN LowP Methylation mRNA mRNASeq miR miRSeq RPPA MAF
TP 36 0 0 0 0 0 0 0 0 0 0
Totals 36 0 0 0 0 0 0 0 0 0 0
CHOL Primary Solid Tumor BCR Data

Table S1. 

TCGA Barcode Platform Center Data Level Protocol
TCGA-3X-AAV9 Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-3X-AAVA Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-3X-AAVB Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-3X-AAVC Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-3X-AAVE Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-4G-AAZO Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-4G-AAZT Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA2G Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA2H Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA2I Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA2O Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA2Q Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA2R Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA2T Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA2U Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA2W Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA2X Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA2Z Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA30 Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA31 Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA33 Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA34 Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA36 Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA38 Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W5-AA39 Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-W6-AA0S Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-WD-A7RX Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-YR-A95A Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-ZD-A8I3 Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-ZH-A8Y1 Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-ZH-A8Y2 Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-ZH-A8Y4 Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-ZH-A8Y5 Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-ZH-A8Y6 Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-ZH-A8Y8 Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen
TCGA-ZU-A8S4 Biospecimen Metadata - Complete Set Nationwide Children's Hospital 1 biospecimen

The sample type short letter codes in the table above are defined in the following list.

  • TP: Primary Solid Tumor

  • TR: Recurrent Solid Tumor

  • TB: Primary Blood Derived Cancer - Peripheral Blood

  • TAP: Additional - New Primary

  • TM: Metastatic

  • TAM: Additional Metastatic

  • NB: Blood Derived Normal

  • NT: Solid Tissue Normal

The following platforms are outdated and are not included in the counts depicted in the table above.

  • Agilent SurePrint G3 Human CGH Microarray Kit 1x1M

  • Agilent Human Genome CGH Microarray 244A

  • Agilent Human Genome CGH Custom Microarray 2x415K

  • Affymetrix Human Exon 1.0 ST Array

  • Illumina DNA Methylation OMA002 Cancer Panel I

  • Illumina DNA Methylation OMA003 Cancer Panel I

  • Illumina Human1M-Duo BeadChip

  • Illumina 550K Infinium HumanHap550 SNP Chip

Figure 1.  Get High-res Image This figure depicts the distribution of available data on a per participant basis.

Results
FFPE Cases
Additional Annotations from the DCC's Annotations Manager
Methods & Data
Redactions and Other Annotations

Annotation data was taken from theTCGA Data Portalusing the query string:

https://tcga-data.nci.nih.gov/annotations/resources/searchannotations/json?item=TCGA

Redaction information was generated by filtering for the annotationClassificationName "Redaction"

FFPE information was generated by filtering for "FFPE" in annotation note text

Additional FFPEs were garnered from clinical data

Remaining annotations were sorted into sections by annotationClassificationName

Preprocessors
mRNA Preprocessor

The mRNA preprocess median module chooses the matrix for the platform(Affymetrix HG U133, Affymetrix Exon Array and Agilent Gene Expression) with the largest number of samples.

mRNAseq Preprocessor

The mRNAseq preprocessor picks the "scaled_estimate" (RSEM) value from Illumina HiSeq/GA2 mRNAseq level_3 (v2) data set and makes the mRNAseq matrix with log2 transformed for the downstream analysis. If there are overlap samples between two different platforms, samples from illumina hiseq will be selected. The pipeline also creates the matrix with RPKM and log2 transform from HiSeq/GA2 mRNAseq level 3 (v1) data set.

miRseq Preprocessor

The miRseq preprocessor picks the "RPM" (reads per million miRNA precursor reads) from the Illumina HiSeq/GA miRseq Level_3 data set and makes the matrix with log2 transformed values.

Methylation Preprocessor

The methylation preprocessor filters methylation data for use in downstream pipelines. To learn more about this preprocessor, please visit the documentation.