[1] "libdir: /xchip/tcga/gdac_prod/applications/process_mgmt/firehose_task_registry/tcga-gdac/ClinicalAnalysisAllGenes_new/broadinstitute.org/cancer.genome.analysis/00333/69/"
[1] "op: MUTATION_RATE"
[1] "dfn: /xchip/cga/gdac-prod/tcga-gdac/jobResults/MutSigRun/LAML-TB/13351591/LAML-TB.patients.counts_and_rates.txt"
[1] "cfn: /xchip/cga/gdac-prod/tcga-gdac/jobResults/Append_Data/LAML-TB/15082450/LAML-TB.merged_data.txt"
[1] "gv: ALL"
[1] "gfn: "
[1] "sfn: "
[1] "fv: ALL"
[1] "ofn: "
[1] "dx: "
[1] "cfn"
[1] "/xchip/cga/gdac-prod/tcga-gdac/jobResults/Append_Data/LAML-TB/15082450/LAML-TB.merged_data.txt"
[1] "ok3"

nSamples in clinical file=200, in expression file=197, common to both=197
Number of genes in original expression dataset=2
[1] "ALL"
[1] "data2feature, selection=ALL"
 [1] "YEARS_TO_BIRTH"                      
 [2] "VITAL_STATUS"                        
 [3] "DAYS_TO_DEATH"                       
 [4] "DAYS_TO_LAST_FOLLOWUP"               
 [5] "PRIMARY_SITE_OF_DISEASE"             
 [6] "DCC_UPLOAD_DATE"                     
 [7] "GENDER"                              
 [8] "DATE_OF_INITIAL_PATHOLOGIC_DIAGNOSIS"
 [9] "DAYS_TO_LAST_KNOWN_ALIVE"            
[10] "KARNOFSKY_PERFORMANCE_SCORE"         
[11] "RACE"                                
[12] "ETHNICITY"                           
[13] "BATCH_NUMBER"                        

Input Data has 13 rows and 197 columns.

[1] "Batch" "13"   
[1] "Last Follow UP"
Variable 1:'YEARS_TO_BIRTH':	nDistinctValues=62,	numeric=TRUE,	binary=FALSE,	exclude=FALSE.
Variable 2:'VITAL_STATUS':	nDistinctValues=2,	numeric=TRUE,	binary=TRUE,	exclude=FALSE.
Variable 3:'DAYS_TO_DEATH':	nDistinctValues=61,	numeric=TRUE,	binary=FALSE,	exclude=TRUE.
[1] "exclude grep('DAYS_?TO', vnms) to deal with survival parameters seperately"
Variable 4:'DAYS_TO_LAST_FOLLOWUP':	nDistinctValues=58,	numeric=TRUE,	binary=FALSE,	exclude=TRUE.
[1] "exclude grep('FOLLOWUP', vnms) to deal with survival parameters seperately"
Variable 5:'PRIMARY_SITE_OF_DISEASE':	nDistinctValues=1,	numeric=FALSE,	binary=FALSE,	exclude=TRUE.
[1] "PRIMARY_SITE_OF_DISEASE is excluded in the analysis because there is no more than two cases of (unique non-NA values)"
Variable 6:'DCC_UPLOAD_DATE':	nDistinctValues=1,	numeric=FALSE,	binary=FALSE,	exclude=TRUE.
[1] "DCC_UPLOAD_DATE is excluded in the analysis because there is no more than two cases of (unique non-NA values)"
Variable 7:'GENDER':	nDistinctValues=2,	numeric=FALSE,	binary=FALSE,	exclude=FALSE.
Variable 8:'DATE_OF_INITIAL_PATHOLOGIC_DIAGNOSIS':	nDistinctValues=10,	numeric=TRUE,	binary=FALSE,	exclude=TRUE.
[1] "exclude grep('DATE', vnms) to deal with survival parameters seperately"
Variable 9:'DAYS_TO_LAST_KNOWN_ALIVE':	nDistinctValues=0,	numeric=FALSE,	binary=FALSE,	exclude=TRUE.
[1] "DAYS_TO_LAST_KNOWN_ALIVE is excluded in the analysis because there is no more than two cases of (unique non-NA values)"
Variable 10:'KARNOFSKY_PERFORMANCE_SCORE':	nDistinctValues=0,	numeric=FALSE,	binary=FALSE,	exclude=TRUE.
[1] "KARNOFSKY_PERFORMANCE_SCORE is excluded in the analysis because there is no more than two cases of (unique non-NA values)"
Variable 11:'RACE':	nDistinctValues=3,	numeric=FALSE,	binary=FALSE,	exclude=FALSE.
Variable 12:'ETHNICITY':	nDistinctValues=2,	numeric=FALSE,	binary=FALSE,	exclude=FALSE.
Variable 13:'BATCH_NUMBER':	nDistinctValues=1,	numeric=FALSE,	binary=FALSE,	exclude=TRUE.
[1] "BATCH_NUMBER is excluded in the analysis because there is no more than two cases of (unique non-NA values)"
[1] "## **** detect survival parameters (defined in index such as ind_OS, ind_MFS, ind_RFS, ind_RFS, ind_BCR and ind_d2ssd) *** ##"
[1] "detected survival parameters using [ind_OS, overall_survival]"
[1] "survival parameters accumulated so far"
[1] "DAYS_TO_DEATH_OR_LAST_FUP" "VITAL_STATUS"             
[1] "detected survival parameters using [ind_OS, curated_overall_survival]"
[1] "survival parameters accumulated so far"
[1] "DAYS_TO_DEATH_OR_LAST_FUP" "VITAL_STATUS"             
[1] "detected survival parameters using [ind_TCGAOS]"
[1] "survival parameters accumulated so far"
[1] "DAYS_TO_DEATH_OR_LAST_FUP" "VITAL_STATUS"             
[1] "detected survivial parameters using [ind_MFS]"
[1] "survival parameters accumulated so far"
[1] "DAYS_TO_DEATH_OR_LAST_FUP" "VITAL_STATUS"             
[1] "detected survival parameters using [ind_RFS]"
[1] "survival parameters accumulated so far"
[1] "DAYS_TO_DEATH_OR_LAST_FUP" "VITAL_STATUS"             
[1] "detected survival parameters using [ind_BCR]"
[1] "survival parameters accumulated so far"
[1] "DAYS_TO_DEATH_OR_LAST_FUP" "VITAL_STATUS"             
[1] "detected survival parameters using [ind_Progression]"
[1] "survival parameters accumulated so far"
[1] "DAYS_TO_DEATH_OR_LAST_FUP" "VITAL_STATUS"             
[1] "detected survival parameters using [index_additional_survival_time]"
[1] "survival parameters accumulated so far"
[1] "DAYS_TO_DEATH_OR_LAST_FUP" "VITAL_STATUS"             
[1] "detected survival parameters using condition: [is.null(surv.mat)&&(selection=='SURV')]"
[1] "survival parameters accumulated so far"
[1] "DAYS_TO_DEATH_OR_LAST_FUP" "VITAL_STATUS"             
[1] "************ conversion from categorical data to rank data ********** "
[1] "****** SUMMARY ***** "
Output Data has 197 columns, 1 survival variables, and 4 non-survival variables.
[1] "* survival variables: "
[1] "DAYS_TO_DEATH_OR_LAST_FUP" "VITAL_STATUS"             
[1] "* non-survival variables: "
[1] "YEARS_TO_BIRTH" "GENDER"         "RACE"           "ETHNICITY"     
[1] "changed to 5 non-survival variables adding another age variable for linear regression analysis on mutaion rate."
[1] "DAYS_TO_DEATH_OR_LAST_FUP"
[1] "DAYS_TO_DEATH_OR_LAST_FUP"
[1] "D"                         "DAYS_TO_DEATH_OR_LAST_FUP"
[3] "Month"                    
[1] "check if there is any case_to_report in survival time data or not"
[1] "alarming case(s) exist!"
[1] "[  1  ] case_to_report(s) is(are) excluded in survival analysis"
AGE, nv=62, binary=FALSE, numeric=TRUE
AGE_mutation.rate, nv=62, binary=FALSE, numeric=TRUE
$MUTATIONRATE_NONSYNONYMOUS

Call:
lm(formula = as.numeric(i) ~ vv)

Residuals:
       Min         1Q     Median         3Q        Max 
-3.390e-07 -1.450e-07  1.570e-09  1.122e-07  9.369e-07 

Coefficients:
             Estimate Std. Error t value Pr(>|t|)    
(Intercept) 1.472e-07  4.659e-08   3.158 0.001839 ** 
vv          2.864e-09  8.122e-10   3.526 0.000526 ***
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 

Residual standard error: 1.839e-07 on 195 degrees of freedom
Multiple R-squared: 0.05993,	Adjusted R-squared: 0.05511 
F-statistic: 12.43 on 1 and 195 DF,  p-value: 0.0005263 


$MUTATIONRATE_SILENT

Call:
lm(formula = as.numeric(i) ~ vv)

Residuals:
       Min         1Q     Median         3Q        Max 
-1.017e-07 -4.974e-08 -1.457e-08  3.586e-08  2.278e-07 

Coefficients:
             Estimate Std. Error t value Pr(>|t|)  
(Intercept) 3.990e-08  1.755e-08   2.273   0.0241 *
vv          7.630e-10  3.060e-10   2.494   0.0135 *
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 

Residual standard error: 6.927e-08 on 195 degrees of freedom
Multiple R-squared: 0.03091,	Adjusted R-squared: 0.02594 
F-statistic: 6.219 on 1 and 195 DF,  p-value: 0.01347 


GENDER, nv=2, binary=FALSE, numeric=FALSE
RACE, nv=3, binary=FALSE, numeric=FALSE
ETHNICITY, nv=2, binary=FALSE, numeric=FALSE
[1] "saved param, results, example.expr in analysis.result.Rdata "
