CS475/CS675 Lecture 24: July 21, 2016 Open problems CS475/CS675 - PowerPoint PPT Presentation

Two Open Problems • Kernel methods: how to solve linear systems of equation in less than cubic time • Markov decision processes: how to evaluate factored policies in less than exponential time CS475/CS675 (c) 2016 P. Poupart 2

Kernel Methods • Class of non‐parametric Machine Learning techniques that scale with the amount of data • Examples: – Gaussian processes – Support vector machines – Kernel logistic regression – Kernel principal component analysis – Kernel perceptron CS475/CS675 (c) 2016 P. Poupart 3

Kernel • Covariance function is a kernel function � �, � � � � � � �� • Where is the feature function that defines the kernel • Popular kernels with infinitely many features: � �� Gaussian kernel: � �� Exponential kernel: � CS475/CS675 (c) 2016 P. Poupart 5

Common problem • In all kernel methods, a linear system of equations must be solved: • is an instantiation of the kernel function called the � � Gram matrix, i.e. �,�� • is a constant positive scalar • is constant vector • is the vector of unknowns CS475/CS675 (c) 2016 P. Poupart 6

Challenge • is an matrix where is the number of data points in the dataset � time to solve • Linear system takes • This does not scale to large datasets, i.e., millions or billions of data points. � or less? • How can we reduce the time to CS475/CS675 (c) 2016 P. Poupart 7

Properties • Gram matrix is – Symmetric – Positive semi‐definite – We also know the feature function that is � � � used to create • Can you exploit those properties to reduce � or less? the solution complexity to CS475/CS675 (c) 2016 P. Poupart 8

Markov Decision Processes • Popular model in Operations Research and Artificial Intelligence for decision‐theoretic planning Agent State Action Reward Environment a0 a1 a2 … s0 s1 s2 r1 r2 r0 9 CS475/CS675 (c) 2016 P. Poupart

Markov Decision Processes Formally: Set of states � , set of actions � , discount � ∈ �0,1� Transition function � �, �, � � � Pr �� |�, �� Reward function � �, � ∈ � a 1 a 0 a 3 a 2 s 0 s 1 s 2 s 4 s 3 r 2 r 3 r 4 r 1 CS475/CS675 (c) 2016 P. Poupart 10

Value Function � • Value � of a policy at state � : � � � � � R � s � Pr � � � � , � � � �� ∑ � � �� ∑ � � �� Pr � � � � , � ∑ Pr � � � � , � � � � � �� ∑ � � �� Pr � � � � , � ∑ ∑ Pr � � � � , � Pr � � � � , � � � � � � � � ⋯ CS475/CS675 (c) 2016 P. Poupart 12

Factored MDP • Let be the number of binary features • Each state corresponds to all combinations of binary features � states • This yields �� which is exponential in the number of • Time features • Challenge: can we reduce the solution to be polynomial in ? CS475/CS675 (c) 2016 P. Poupart 15

Properties • Factored MDP � sum to 1 – Rows of � is 1 – Largest eigenvalue of � is factored and � is additive – • Can you exploit those properties to reduce the time complexity to be polynomial in ? CS475/CS675 (c) 2016 P. Poupart 17

CS475/CS675 Lecture 24: July 21, 2016 Open problems CS475/CS675 - PowerPoint PPT Presentation

CS475/CS675 Lecture 24: July 21, 2016 Open problems CS475/CS675 (c) 2016 P. Poupart 1 Two Open Problems Kernel methods: how to solve linear systems of equation in less than cubic time Markov decision processes: how to evaluate

CS475/CS675 Lecture 23: July 19, 2016 Principal Component Analysis, Eigenfaces CS475/CS675 (c)

CS475 / CS675 Lecture 19: July 5, 2016 Singular value decomposition Reading: [TB] Chapter 31

CS475 / CS675 Lecture 20: July 7, 2016 Bidiagonalization SVD Image Compression Reading: [TB]

CS475 / CS675 Lecture 18: June 30, 2016 QR Method with Shifts Google Page Rank Reading: [TB]

CS475/CS675 Lecture 4: May 12, 2016 Sparse Gaussian Elimination, Graph Representation Reading:

CS475/CS675 Lecture 2: May 3, 2016 Cholesky factorization, tridiagonal, band matrices Reading:

CS475 / CS675 Lecture 10: June 2, 2016 Least Squares Problems Reading: [TB] Chapt 11

CS475/CS675 Lecture 1: May 3, 2016 Basic Theory of Linear Algebra Reading: [TB] chapt 1 (p.

CS475/CM375 Lecture 8: Oct 6, 2011 Iterative Methods Reading: [Saad] Chapt 4 CS475/CM375 (c) 2011

CS475 / CM375 Lecture 23: Nov 29, 2011 Convergence of Iterative Methods CS475/CM375 (c) 2011 P.

CS475/CM375 Lecture 4: Sept 22 Sparse Gaussian Elimination, Graph Representation Reading: [Saad]

CS475 / CM375 Lecture 14: Oct 27, 2011 Eigenvalue problems Reading: [TB] Chapters 24, 25

CS475 / CM375 Lecture 17: Nov 8, 2011 QR Algorithm and Reduction to Hessenberg Reading: [TB] Chapt

CS475 / CM 375 Lecture 18: Nov 10, 2011 QR Method with Shifts Google Page Rank Reading: [TB]

CS475 / CM375 Lecture 11: Oct 18, 2011 QR Factorization and Gram Schmidt Orthogonalization

CS675: Convex and Combinatorial Optimization Fall 2016 Consequences of the Ellipsoid Algorithm

CS 309: Autonomous Intelligent Robotics Instructor: Jivko Sinapov

Diagonalization Marco Chiarandini Department of Mathematics & Computer Science University of

Setpoint Tracking in SS Systems 1. Setpoint Tracking in SS Systems 1. In addition to the

Micro-service Developer Experience Node Interac6ve 2015 Peter

Dynamic Bayesian Networks And Particle Filtering 1 Time and uncertainty The world changes; we

Outline Sequential Data - Part 2 Greg Mori - CMPT 419/726 Hidden Markov Models - Most Likely

Homework 2 CSE 573: Artificial Intelligence Autumn 2012 Particle Filters Particle Filters for

Recursive State Estimation Lecture 7 Perception as a Continuous Process Perception as a

CS475/CS675 Lecture 24: July 21, 2016 Open problems CS475/CS675 - PowerPoint PPT Presentation

CS475/CS675 Lecture 24: July 21, 2016 Open problems CS475/CS675 (c) 2016 P. Poupart 1 Two Open Problems Kernel methods: how to solve linear systems of equation in less than cubic time Markov decision processes: how to evaluate

CS475/CS675 Lecture 23: July 19, 2016 Principal Component Analysis, Eigenfaces CS475/CS675 (c)

CS475 / CS675 Lecture 19: July 5, 2016 Singular value decomposition Reading: [TB] Chapter 31

CS475 / CS675 Lecture 20: July 7, 2016 Bidiagonalization SVD Image Compression Reading: [TB]

CS475 / CS675 Lecture 18: June 30, 2016 QR Method with Shifts Google Page Rank Reading: [TB]

CS475/CS675 Lecture 4: May 12, 2016 Sparse Gaussian Elimination, Graph Representation Reading:

CS475/CS675 Lecture 2: May 3, 2016 Cholesky factorization, tridiagonal, band matrices Reading:

CS475 / CS675 Lecture 10: June 2, 2016 Least Squares Problems Reading: [TB] Chapt 11

CS475/CS675 Lecture 1: May 3, 2016 Basic Theory of Linear Algebra Reading: [TB] chapt 1 (p.

CS475/CM375 Lecture 8: Oct 6, 2011 Iterative Methods Reading: [Saad] Chapt 4 CS475/CM375 (c) 2011

CS475 / CM375 Lecture 23: Nov 29, 2011 Convergence of Iterative Methods CS475/CM375 (c) 2011 P.

CS475/CM375 Lecture 4: Sept 22 Sparse Gaussian Elimination, Graph Representation Reading: [Saad]

CS475 / CM375 Lecture 14: Oct 27, 2011 Eigenvalue problems Reading: [TB] Chapters 24, 25

CS475 / CM375 Lecture 17: Nov 8, 2011 QR Algorithm and Reduction to Hessenberg Reading: [TB] Chapt

CS475 / CM 375 Lecture 18: Nov 10, 2011 QR Method with Shifts Google Page Rank Reading: [TB]

CS475 / CM375 Lecture 11: Oct 18, 2011 QR Factorization and Gram Schmidt Orthogonalization

CS675: Convex and Combinatorial Optimization Fall 2016 Consequences of the Ellipsoid Algorithm

CS 309: Autonomous Intelligent Robotics Instructor: Jivko Sinapov

Diagonalization Marco Chiarandini Department of Mathematics &amp; Computer Science University of

Setpoint Tracking in SS Systems 1. Setpoint Tracking in SS Systems 1. In addition to the

Micro-service Developer Experience Node Interac6ve 2015 Peter

Dynamic Bayesian Networks And Particle Filtering 1 Time and uncertainty The world changes; we

Outline Sequential Data - Part 2 Greg Mori - CMPT 419/726 Hidden Markov Models - Most Likely

Homework 2 CSE 573: Artificial Intelligence Autumn 2012 Particle Filters Particle Filters for

Recursive State Estimation Lecture 7 Perception as a Continuous Process Perception as a

Diagonalization Marco Chiarandini Department of Mathematics & Computer Science University of