weighted sums of random kitchen
play

WEIGHTED SUMS OF RANDOM KITCHEN SINKS Replacing minimization with - PowerPoint PPT Presentation

WEIGHTED SUMS OF RANDOM KITCHEN SINKS Replacing minimization with randomization in learning The model Given a set of training data in a domain Fit a function to minimize risk Empirical Risk Risk Loss Function Hinge loss


  1. WEIGHTED SUMS OF RANDOM KITCHEN SINKS Replacing minimization with randomization in learning

  2. The model • Given a set of training data in a domain • Fit a function to minimize risk • Empirical Risk • Risk

  3. Loss Function • Hinge loss (SVM) • Exponential loss (AdaBoost) • Quadratic loss

  4. Form of solution function • Consider solutions in the form weights feature functions • Feature functions • Eigenfunctions (kernel SVM) • Decision trees/stumps (AdaBoost) • More feature functions gives better classification

  5. Solving f • Approximate • This is hard! • New approach: • Randomly choose and minimize over

  6. Randomized approach • Training data • Feature function • Number of features • Parameter distribution • Scaling factor • Algorithm • Draw feature parameters randomly from • Let • Minimize empirical risk

  7. Experimental Results vs AdaBoost • Three datasets • adult • activity • KDDCUP99 • Feature function • sampled uniformly at random • sampled from Gaussian

  8. Experimental Results vs AdaBoost

  9. Pros and Cons • Pros • Much faster • Allows simple and efficient experimentation of feature functions • Cons • Some loss in quality • Need to tune probability distribution (not needed in practice)

  10. Concentration of Risk • The randomized algorithm returns a function such that with probability • Number of training points • Number of feature vectors • Lipschitz constant of loss function • Bound approximation error • Lowest risk versus lowest risk from functions returned is not large • Bound estimation error • True risk of every function returned is close to its empirical risk

  11. Proof • minimizer of risk over all solution functions • minimizer of risk over functions returned • minimizer of empirical risk over functions returned • Then with probability

  12. Bounding approximation error • Lemma 1. Let be i.i.d. random variables in a ball of radius centered about the origin in a Hilbert space. Then with probability • Construct functions • Then there exists • So that

  13. Bounding approximation error • If the loss function has Lipschitz constant , for any two functions • Then

Recommend


More recommend