FlowCAP2 Results: Challenges 1, 2, and 3 Nima Aghaeepour CIHR/MSFHR Strategic Training Program in Bioinformatics for Health Research, University of British Columbia Sep.22.2011 1 / 27
Problem Statement Binary Classification Goal: evaluate the ability of computational pipelines in finding cell populations that can discriminate between two classes: 1: HEU vs UE 2: AML vs normal 3a: ENV vs GAG 3b: Responders vs non-responders Participants identify the cell populations that are different across the two classes. 2 / 27
Problem Statement 0.8 0.6 How does it generalize Probabilities to previously unseen UE samples? HEU 0.4 0.2 0.0 0 10 20 30 40 Samples 3 / 27
Problem Statement Binary Classification Two classes. HEU vs UE 1 AML vs normal 2 ENV vs GAG 3 Responders vs non-responders 4 Participants identify the cell populations that are different across the two classes. Results will be tested on independent samples. 4 / 27
Evaluation Metrics (e.g., for AML) True(T), False(F), Positive(P) Negative(N) TP : An AML case marked as AML by a participants. FP : A normal case marked as AML by a participants. FN : An AML case marked as normal by a participants. TN : A normal case marked as normal by a participants. Accuracy Accuracy : ( TP + TN ) / ( TP + TN + FP + FN ) Sensitivity and Specificity Sensitivity : TP / ( TP + FN ) Specificity : TN / ( FN + FP ) F-measure F-measure: 2 ∗ Sensitivity ∗ Specificity / ( Sensitivity + Specificity ) Should not be mistaken with FlowCAP1’s F-measure. 5 / 27
Challenge 1: HIV Exposed Uninfected vs UnExposed Infants 6 / 27
Challenge 1: HEU vs UE Table 1: Challenge 1: HEUvsUE Sensitivity Specificity Accuracy F-measure 2DhistsSVM 0.50 0.50 0.50 0.50 flowBin 0.00 0.48 0.45 0.00 flowType 0.58 0.60 0.59 0.59 flowType-FeaLect 0.33 0.38 0.36 0.36 PBSC 0.55 0.55 0.55 0.55 PramSpheres 0.36 0.36 0.36 0.36 SWIFT 0.67 0.62 0.64 0.64 Note: FLOCK has been renamed to PBSC. 7 / 27
0.6 HEU vs UE 0.5 Random: 0.5 How can some of them be 0.4 worst than random? F−measures Have we been able to find something meaningful? 0.3 Cross-validation Holdout validation 0.2 (using other time points). 0.1 flowType−FeaLect PramSpheres 2DhistsSVM flowType SWIFT flowBin PBSC 0.0 Algorithms 8 / 27
Challenge 1: HEU vs UE 0.8 0.6 Probabilities UE HEU 0.4 0.2 0.0 0 10 20 30 40 Samples 9 / 27
Challenge 1: HEU vs UE 0.8 0.6 Probabilities UE HEU 0.4 0.2 0.0 0 10 20 30 40 Samples 10 / 27
Challenge 1: HEU vs UE 6 UE HEU 5 4 MisClassifications 3 2 1 0 SampleNumber 11 / 27
Challenge 2: AML vs normal subjects 12 / 27
AML patients. classification of 360 Three perfect F−measures 0.70 0.75 0.80 0.85 0.90 0.95 1.00 flowPeakssvm flowType−FeaLect SPADE 2DhistsSVM Algorithms EMMIXCYTOM flowType RandomSpheres flowBin PBSC 13 / 27
Challenge 2: AML Table 2: Challenge 2: AML Sensitivity Specificity Accuracy F-measure 2DhistsSVM 1.00 0.99 0.99 1.00 EMMIXCYTOM 0.95 0.99 0.99 0.97 PBSC 0.75 0.97 0.94 0.85 flowBin 1.00 0.92 0.92 0.96 flowPeakssvm 1.00 1.00 1.00 1.00 flowType 0.95 0.99 0.99 0.97 flowType-FeaLect 1.00 1.00 1.00 1.00 RandomSpheres 0.95 0.99 0.99 0.97 SPADE 1.00 1.00 1.00 1.00 14 / 27
Challenge 2: AML 1.0 0.8 0.6 Probabilities Normal AML 0.4 0.2 0.0 0 50 100 150 200 250 300 350 Samples 15 / 27
Challenge 2: AML 1.0 0.8 0.6 Probabilities Normal AML 0.4 0.2 0.0 0 50 100 150 200 250 300 350 Samples 16 / 27
Challenge 2: AML 12 normal aml 10 8 MisClassifications 6 4 2 0 SampleNumber 17 / 27
Challenge 2: AML Normal AML Outlier 400 400 400 300 0.9% 300 21% 300 17% SS Log SS Log SS Log 200 200 200 100 100 100 0 0 0 400 600 800 1000 400 600 800 1000 400 600 800 1000 FS Lin FS Lin FS Lin This dataset, perhaps, requires analysis of one marker at a time. Potential challenge for FlowCAP3: a dataset in which multiple markers should be used to find a rare cell populations. 18 / 27
Challenge 3a: Identification of Antigen Stimulation Group 19 / 27
HVTNa 40 patients. Six perfect classifiers for F−measures 0.75 0.80 0.85 0.90 0.95 1.00 flowCore−flowStats flowType−FeaLect Kmeanssvm PRAMS SPADE SWIFT PBSC PramSpheres flowType 20 / 27
Challenge 3: HVTNa Table 3: Challenge 3: HVTNa Sensitivity Specificity Accuracy F-measure PBSC 0.95 0.95 0.95 0.95 flowType 0.88 0.76 0.81 0.82 flowType-FeaLect 1.00 1.00 1.00 1.00 flowCore-flowStats 1.00 1.00 1.00 1.00 Kmeanssvm 1.00 1.00 1.00 1.00 PRAMS 1.00 1.00 1.00 1.00 PramSpheres 0.90 0.90 0.90 0.90 SPADE 1.00 1.00 1.00 1.00 SWIFT 1.00 1.00 1.00 1.00 21 / 27
Challenge 3: HVTNa 2.0 GAG ENV 1.5 MisClassifications 1.0 0.5 0.0 SampleNumber 22 / 27
Challenge 3b: Identification of Responders and Non-Responders in Intracellular Cytokine Staining of Post-HIV Vaccine Antigen Stimulated T-cells 23 / 27
0.8 HVTNb 0.6 Maximum of 0.8 F-measure against F−measures cytokine reposes measured 0.4 by a human across 80 samples. Has the human been wrong? 0.2 flowCore−flowStats SPADE SWIFT PBSC 0.0 Algorithms 24 / 27
Challenge 3: HVTNb Table 4: Challenge 3: HVTNb Sensitivity Specificity Accuracy F-measure PBSC 0.27 0.89 0.81 0.42 flowCore-flowStats 0.79 1.00 0.96 0.88 SPADE 0.67 0.99 0.93 0.80 SWIFT 0.43 0.98 0.83 0.60 25 / 27
Challenge 3: HVTNb 4 − + 3 MisClassifications 2 1 0 1 4 7 10 14 18 22 26 30 34 38 42 3 6 9 12 16 20 24 28 32 36 40 SampleNumber 26 / 27
Acknowledgements FlowCAP CC Ryan Brinkman, Raphael Gottardo, Tim Mosmann, Richard Scheuermann Upenn Wade Rogers CFRI Tobias Kollman FHCRC Steve De Rosa UBC Holger Hoos FlowCAP Participants Funding FlowCAP is supported by NIH/NIBIB grant (EB008400). The FlowCAP summits are supported by NIH/NIAID. 27 / 27
Recommend
More recommend