svmpauc tight a new algorithm for optimizing partial auc
play

SVMpAUC-tight: A new algorithm for optimizing partial AUC based on a - PowerPoint PPT Presentation

SVMpAUC-tight: A new algorithm for optimizing partial AUC based on a tight convex upper bound Harikrishna Narasimhan and Shivani Agarwal Department of Computer Science and Automation Indian Institute of Science, Bangalore Receiver Operating


  1. SVMpAUC-tight: A new algorithm for optimizing partial AUC based on a tight convex upper bound Harikrishna Narasimhan and Shivani Agarwal Department of Computer Science and Automation Indian Institute of Science, Bangalore

  2. Receiver Operating Characteristic Curve

  3. Receiver Operating Characteristic Curve Binary Classification Vs. Spam Non-Spam Area Under the ROC Curve (AUC)

  4. Receiver Operating Characteristic Curve Binary Classification Vs. Spam Non-Spam Area Under the ROC Curve (AUC) Bipartite Ranking Ranking of documents

  5. Partial AUC? Full AUC

  6. Partial AUC? Vs Full AUC Partial AUC

  7. Ranking http://www.google.com/

  8. Ranking http://www.google.com/

  9. Medical Diagnosis http://en.wikipedia.org/

  10. Medical Diagnosis KDD Cup 2008 http://en.wikipedia.org/

  11. Bioinformatics ― Drug Discovery ― Gene Prioritization ― Protein Interaction Prediction ― …… http://en.wikipedia.org/wiki http://commons.wikimedia.org/ http://www.google.com/imghp

  12. Bioinformatics ― Drug Discovery ― Gene Prioritization ― Protein Interaction Prediction ― …… http://en.wikipedia.org/wiki http://commons.wikimedia.org/ http://www.google.com/imghp

  13. Partial Area Under the ROC Curve is critical to many applications

  14. SVMpAUC (ICML 2013) Narasimhan, H. and Agarwal, S. “ A structural SVM based approach for optimizing partial AUC ”, ICML 2013. SVMpAUC

  15. SVMpAUC (ICML 2013) Narasimhan, H. and Agarwal, S. “ A structural SVM based approach for optimizing partial AUC ”, ICML 2013. SVMpAUC SVM-AUC Joachims, 2005

  16. Improved Version of SVMpAUC Tighter upper bound Improved accuracy Better runtime guarantee

  17. Outline • Overview of SVMpAUC • Upper Bound Optimized by SVMpAUC • Improved Formulation: SVMpAUC-tight • Optimization Methods • Experiments

  18. Receiver Operating Characteristic Curve …….. x 1 + x 2 + x 3 + x m + Positive Instances Training …….. Set x 1 - x 2 - x 3 - x n - Negative Instances GOAL? Learn a scoring function

  19. Receiver Operating Characteristic Curve …….. x 1 + x 2 + x 3 + x m + Positive Instances Training …….. Set x 1 - x 2 - x 3 - x n - Negative Instances GOAL? Learn a scoring function Rank objects Build a classifier x 5 + x 5 + x 3 + x 3 + Threshold or x 1 - x 1 - x 6 + x 6 + …. …. x n - x n -

  20. Receiver Operating Characteristic Curve …….. x 1 + x 2 + x 3 + x m + Positive Instances Training …….. Set x 1 - x 2 - x 3 - x n - Negative Instances GOAL? Learn a scoring function Rank objects Build a classifier Quality of scoring function? x 5 + x 5 + x 3 + x 3 + Threshold or x 1 - x 1 - Threshold Assignment x 6 + x 6 + …. …. x n - x n -

  21. Receiver Operating Characteristic Curve ROC Curve 20 15 14 True Positives 13 11 9 8 6 5 3 False Positives 2 0

  22. Receiver Operating Characteristic Curve ROC Curve 20 15 14 True Positives 13 11 9 8 6 5 3 False Positives 2 0

  23. Receiver Operating Characteristic Curve ROC Curve 20 15 14 True Positives 13 11 9 8 6 5 3 False Positives 2 0

  24. Receiver Operating Characteristic Curve ROC Curve 20 15 14 True Positives 13 11 9 8 6 5 3 False Positives 2 0

  25. Receiver Operating Characteristic Curve ROC Curve 20 15 14 Area Under the True Positives ROC Curve 13 (AUC) 11 9 8 6 5 3 False Positives 2 0

  26. Receiver Operating Characteristic Curve ROC Curve 20 15 14 Area Under the True Positives ROC Curve 13 (AUC) 11 9 8 6 5 3 False Positives 2 0 Partial AUC

  27. Receiver Operating Characteristic Curve ROC Curve 20 15 14 True Positives 13 11 9 8 6 5 3 False Positives 2 0

  28. Receiver Operating Characteristic Curve ROC Curve 20 15 14 True Positives 13 11 9 8 6 5 3 False Positives 2 0

  29. Receiver Operating Characteristic Curve ROC Curve 20 15 14 True Positives 13 11 9 8 6 5 3 False Positives 2 0

  30. Receiver Operating Characteristic Curve ROC Curve 20 15 14 True Positives 13 11 9 8 6 5 3 False Positives 2 0

  31. Receiver Operating Characteristic Curve ROC Curve 20 15 14 True Positives 13 β = 0.5 11 9 Top 3 negatives! 8 6 5 3 False Positives 2 0

  32. Receiver Operating Characteristic Curve ROC Curve 20 15 14 True Positives 13 β = 0.5 11 9 Top 3 negatives! 8 6 5 3 False Positives 2 0

  33. Receiver Operating Characteristic Curve ROC Curve 20 15 14 True Positives 13 β = 0.5 11 9 Top 3 negatives! 8 6 5 3 False Positives 2 0

  34. Receiver Operating Characteristic Curve ROC Curve 20 15 14 True Positives 13 β = 0.5 11 9 Top 3 negatives! 8 6 5 3 False Positives 2 0

  35. (1 – pAUC) for f

  36. Convex Upper Bound (1 – pAUC) for f

  37. Convex Upper Bound (1 – pAUC) for f + Regularizer

  38. SVMpAUC (ICML 2013) SVMpAUC: Structural SVM Approach Narasimhan and Agarwal, 2013

  39. SVMpAUC (ICML 2013) Ordering of training examples: n 0 0 0 0 0 1 1 0 0 0 m 1 1 0 0 1 1 1 0 0 1 SVMpAUC: Structural SVM Approach Narasimhan and Agarwal, 2013

  40. SVMpAUC (ICML 2013) Ordering of training examples: n 0 0 0 0 0 Scoring 1 1 0 0 0 m function f 1 1 0 0 1 1 1 0 0 1 SVMpAUC: Structural SVM Approach Narasimhan and Agarwal, 2013

  41. SVMpAUC (ICML 2013) Ordering of training examples: n 0 0 0 0 0 Scoring 1 1 0 0 0 m function f 1 1 0 0 1 1 1 0 0 1 SVMpAUC: Structural SVM Approach Narasimhan and Agarwal, 2013

  42. SVMpAUC (ICML 2013) Ordering of training examples: n 0 0 0 0 0 Scoring 1 1 0 0 0 m function f 1 1 0 0 1 1 1 0 0 1 SVMpAUC: Structural SVM Approach Narasimhan and Agarwal, 2013

  43. Convex Upper Bound (1 – pAUC) for f + Regularizer ≤

  44. Convex Upper Bound (1 – pAUC) for f + Regularizer ≤ How does this upper bound look?

  45. Convex Upper Bound (1 – pAUC) for f + Regularizer ≤ Can we obtain a tighter upper bound?

  46. Outline • Overview of SVMpAUC • Upper Bound Optimized by SVMpAUC • Improved Formulation: SVMpAUC-tight • Optimization Methods • Experiments

  47. Upper bound we want? 1 - pAUC ∝

  48. Upper bound we want? 1 - pAUC ∝

  49. Upper bound we want? 1 - pAUC ∝

  50. Upper bound we want? 1 - pAUC ∝ ≤ pair-wise hinge loss!

  51. Upper optimized by SVMpAUC?

  52. Upper optimized by SVMpAUC? = pair-wise hinge loss + extra term

  53. Upper optimized by SVMpAUC? Subset of pairs of positive-negative examples = pair-wise hinge loss + extra term

  54. Upper optimized by SVMpAUC? Subset of pairs of positive-negative examples = ? pair-wise hinge loss + extra term

  55. Upper optimized by SVMpAUC?

  56. Upper optimized by SVMpAUC? ≤ pair-wise hinge loss + extra term

  57. Upper optimized by SVMpAUC? approx. pair-wise hinge loss + extra term ≤ ≤ pair-wise hinge loss + extra term

  58. Upper optimized by SVMpAUC? ? approx. pair-wise hinge loss + extra term ≤ ≤ ? pair-wise hinge loss + extra term

  59. Outline • Overview of SVMpAUC • Upper Bound Optimized by SVMpAUC • Improved Formulation: SVMpAUC-tight • Optimization Methods • Experiments

  60. Rewriting the Partial AUC Loss α = 0, β = 0.5 20 15 14 3 + 2 + 2 = 7 True Positives 13 11 9 8 6 5 3 False Positives 2 0

  61. Rewriting the Partial AUC Loss α = 0, β = 0.5 20 15 14 3 + 2 + 2 = 7 True Positives 13 2 + 2 + 1 = 5 11 9 8 6 5 3 False Positives 2 0

  62. Rewriting the Partial AUC Loss α = 0, β = 0.5 20 15 14 3 + 2 + 2 = 7 True Positives 13 2 + 2 + 1 = 5 11 . 9 . 8 . 6 1 + 1 + 1 = 3 5 3 False Positives 2 0

  63. Rewriting the Partial AUC Loss 1 - AUC restricted to α = 0, β = 0.5 top β fraction of negatives 20 15 Maximum! 14 3 + 2 + 2 = 7 True Positives 13 2 + 2 + 1 = 5 11 . 9 . 8 . 6 1 + 1 + 1 = 3 5 3 False Positives 2 0

  64. Top j β negatives

  65. Top j β negatives SVM-AUC

  66. Negatives j α to j β

  67. Negatives j α to j β Truncated SVMpAUC

  68. SVMpAUC-tight: Improved Formulation SVMpAUC objective restricted to S

  69. SVMpAUC-tight: Top j β Improved Formulation negatives

  70. SVMpAUC-tight: Top j β Improved Formulation negatives Same pairs of positive-negative examples = pair-wise hinge loss + extra term

  71. SVMpAUC-tight: Negatives Improved Formulation j α to j β

  72. SVMpAUC-tight: Negatives Improved Formulation j α to j β approx. pair-wise hinge loss + extra term ≤ ≤ pair-wise hinge loss + extra term

  73. Outline • Overview of SVMpAUC • Upper Bound Optimized by SVMpAUC • Improved Formulation: SVMpAUC-tight • Optimization Methods • Experiments

  74. SVMpAUC-tight: Optimization Problem + Regularizer exponential in size

  75. SVMpAUC-tight: Optimization Problem + Regularizer exponential in size Quadratic program with an exponential number of constraints

Recommend


More recommend