Replicate Samples
2017_09_15 Data Snapshot

In many instances there is more than one aliquot for a given combination of individual, platform, and data type. However, only one aliquot may be ingested into Firehose. Therefore, a set of precedence rules are applied to select the most scientifically advantageous one among them. Two filters are applied to achieve this aim: an Analyte Replicate Filter and a Sort Replicate Filter.

Analyte Replicate Filter

The following precedence rules are applied when the aliquots have differing analytes. For RNA aliquots, T analytes are dropped in preference to H and R analytes, since T is the inferior extraction protocol. If H and R are encountered, H is the chosen analyte. This is somewhat arbitrary and subject to change, since it is not clear at present whether H or R is the better protocol. If there are multiple aliquots associated with the chosen RNA analyte, the aliquot with the later plate number is chosen. For DNA aliquots, D analytes (native DNA) are preferred over G, W, or X (whole-genome amplified) analytes, unless the G, W, or X analyte sample has a higher plate number.

Sort Replicate Filter

The following precedence rules are applied when the analyte filter still produces more than one sample. The sort filter chooses the aliquot with the highest lexicographical sort value, to ensure that the barcode with the highest portion and/or plate number is selected when all other barcode fields are identical.

Table 1.  Get Full Table

Participant.Id Tumor.Type Annotation Filter.Reason Removed.Samples Chosen.Sample
TCGA-AF-2689 READ CNV__snp6 Analyte Replicate Filter TCGA-AF-2689-01A-01D-0819-01 TCGA-AF-2689-01A-01D-1549-01
TCGA-AF-2689 READ CNV__unfiltered__snp6 Analyte Replicate Filter TCGA-AF-2689-01A-01D-0819-01 TCGA-AF-2689-01A-01D-1549-01
TCGA-AF-2691 READ CNV__snp6 Analyte Replicate Filter TCGA-AF-2691-01A-01D-0819-01 TCGA-AF-2691-01A-01D-1549-01
TCGA-AF-2691 READ CNV__unfiltered__snp6 Analyte Replicate Filter TCGA-AF-2691-01A-01D-0819-01 TCGA-AF-2691-01A-01D-1549-01
TCGA-AF-3400 READ CNV__snp6 Analyte Replicate Filter TCGA-AF-3400-01A-01D-0819-01 TCGA-AF-3400-01A-01D-1549-01
TCGA-AF-3400 READ CNV__unfiltered__snp6 Analyte Replicate Filter TCGA-AF-3400-01A-01D-0819-01 TCGA-AF-3400-01A-01D-1549-01
TCGA-AG-3892 READ SNV__mutect Analyte Replicate Filter TCGA-AG-3892-01A-01W-1073-09 TCGA-AG-3892-01A-01D-1989-10
TCGA-AG-A008 READ SNV__mutect Analyte Replicate Filter TCGA-AG-A008-01A-01W-A00K-09 TCGA-AG-A008-01A-01D-A183-10
TCGA-AG-A00C READ SNV__mutect Analyte Replicate Filter TCGA-AG-A00C-01A-01W-A00K-09 TCGA-AG-A00C-01A-01D-A183-10
TCGA-AG-A015 READ SNV__mutect Analyte Replicate Filter TCGA-AG-A015-01A-01W-A00K-09 TCGA-AG-A015-01A-01D-A183-10
TCGA-AG-A02N READ SNV__mutect Analyte Replicate Filter TCGA-AG-A02N-01A-11W-A096-10 TCGA-AG-A02N-01A-11D-A183-10