complexity of linear regions in deep nets
play

Complexity of Linear Regions in Deep Nets Boris Hanin Facebook AI - PowerPoint PPT Presentation

Complexity of Linear Regions in Deep Nets Boris Hanin Facebook AI Research and Texas A&M March 5, 2019 Joint with David Rolnick Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19 Theoretical vs. Practical Expressivity Boris


  1. Complexity of Linear Regions in Deep Nets Boris Hanin Facebook AI Research and Texas A&M March 5, 2019 Joint with David Rolnick Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  2. Theoretical vs. Practical Expressivity Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  3. Theoretical vs. Practical Expressivity Brain: Why deep nets, Pinky? Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  4. Theoretical vs. Practical Expressivity Brain: Why deep nets, Pinky? Pinky: Expressivity, Brain! Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  5. Theoretical vs. Practical Expressivity Brain: Why deep nets, Pinky? Pinky: Expressivity, Brain! Brain: What about learnability? Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  6. Numerical Instability for Large Numbers of Regions Figure: Random perturbation of example w/maximal number of regions. Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  7. Theoretical Expressivity Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  8. Practical Expressivity at Init Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  9. Practical Expressivity Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  10. How To Do Theory? Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  11. How To Do Theory? Goal. Characterize typical complexity of functions drawn from µ A , init , µ A , train . Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  12. How To Do Theory? Goal. Characterize typical complexity of functions drawn from µ A , init , µ A , train . Intution. Probabilty measures in high dimensions are often concentrated around low dimensional sets. Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  13. How To Do Theory? Goal. Characterize typical complexity of functions drawn from µ A , init , µ A , train . Intution. Probabilty measures in high dimensions are often concentrated around low dimensional sets. Idea. For networks with piecewise linear activations, complexity of µ A , init and µ A , train encoded in corresponding partition of input space. Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  14. Overview Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  15. Overview N − depth d ReLU net with n out = 1 Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  16. Overview N − depth d ReLU net with n out = 1 x �→ N ( x ) is continuous and piecewise linear function Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  17. Overview N − depth d ReLU net with n out = 1 x �→ N ( x ) is continuous and piecewise linear function Fixed weights/biases partition R n in into convex pieces on which N is linear Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  18. Overview N − depth d ReLU net with n out = 1 x �→ N ( x ) is continuous and piecewise linear function Fixed weights/biases partition R n in into convex pieces on which N is linear Goal. Understand average complexity of this partition Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  19. ReLU Net with n in = n out = 1 at Initialization Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  20. Input Space Partition with n in = 2 at Initialization Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  21. Evolution of Input Partition Through Network Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  22. Complexity v1.0: Number of Regions Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  23. Complexity v1.0: Number of Regions Deterministic Bounds : 1 ≤ # regions ≤ 2 # neurons Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  24. Complexity v1.0: Number of Regions Deterministic Bounds : 1 ≤ # regions ≤ 2 # neurons Moral of Prior Work. There exist very special weight/bias settings for deep skinny nets that saturate upper bound. Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  25. Complexity v1.0: Number of Regions Deterministic Bounds : 1 ≤ # regions ≤ 2 # neurons Moral of Prior Work. There exist very special weight/bias settings for deep skinny nets that saturate upper bound. Q1. What is the average number of regions at init? Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  26. Complexity v1.0: Number of Regions Deterministic Bounds : 1 ≤ # regions ≤ 2 # neurons Moral of Prior Work. There exist very special weight/bias settings for deep skinny nets that saturate upper bound. Q1. What is the average number of regions at init? Q2. What happens to regions during training (practical vs. theoretical expressivity)? Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  27. Number of Regions when n in = n out = 1 Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  28. Number of Regions when n in = n out = 1 Theorem (H-Rolnick) Suppose weights and biases are independent with Var[ bias ] = σ 2 Var[ weights ] = 2 / fan - in , b > 0 . Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  29. Number of Regions when n in = n out = 1 Theorem (H-Rolnick) Suppose weights and biases are independent with Var[ bias ] = σ 2 Var[ weights ] = 2 / fan - in , b > 0 . For any compact S ⊂ R there are c = c ( σ b ) , C = C ( σ b ) so that 1 � � c # { neurons } ≤ # { regions in S } ≤ C # { neurons } | S | E Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  30. Number of Regions when n in = n out = 1 Theorem (H-Rolnick) Suppose weights and biases are independent with Var[ bias ] = σ 2 Var[ weights ] = 2 / fan - in , b > 0 . For any compact S ⊂ R there are c = c ( σ b ) , C = C ( σ b ) so that 1 � � c # { neurons } ≤ # { regions in S } ≤ C # { neurons } | S | E Remark 1 Comes from formula that holds throughout training Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  31. Number of Regions when n in = n out = 1 Theorem (H-Rolnick) Suppose weights and biases are independent with Var[ bias ] = σ 2 Var[ weights ] = 2 / fan - in , b > 0 . For any compact S ⊂ R there are c = c ( σ b ) , C = C ( σ b ) so that 1 � � c # { neurons } ≤ # { regions in S } ≤ C # { neurons } | S | E Remark 1 Comes from formula that holds throughout training 2 Holds for any network connectivity Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  32. Number of Regions when n in = n out = 1 Theorem (H-Rolnick) Suppose weights and biases are independent with Var[ bias ] = σ 2 Var[ weights ] = 2 / fan - in , b > 0 . For any compact S ⊂ R there are c = c ( σ b ) , C = C ( σ b ) so that 1 � � c # { neurons } ≤ # { regions in S } ≤ C # { neurons } | S | E Remark 1 Comes from formula that holds throughout training 2 Holds for any network connectivity 3 Holds for any 1D curve inside high dimensional input space Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  33. Number of Regions on 1D Line Through Training Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  34. Number of Regions on 1D Line Through Training Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  35. Maximal # Regions on 2D Plane Figure: Heuristic: # { regions on k dim slice } ∼ (# neurons ) k . When k = 2 , should have ≈ (16 ∗ 3) 2 = 2304 regions. Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  36. Maximal # Regions on 2D Plane Figure: Heuristic: # { regions on k dim slice } ∼ (# neurons ) k . When k = 2 , should have ≈ (32 ∗ 3) 2 = 9216 regions. Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  37. Maximal # Regions on 2D Plane Figure: Heuristic: # { regions on k dim slice } ∼ (# neurons ) k . When k = 2 , should have ≈ (32 ∗ 3) 2 = 9216 regions. Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  38. Complexity v2.0: Volume of Linear Region Boundaries Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  39. Complexity v2.0: Volume of Linear Region Boundaries Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  40. Complexity v2.0: Volume of Linear Region Boundaries Basic Object of Study: B N := { Linear region boundaries of N} . Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  41. Complexity v2.0: Volume of Linear Region Boundaries Basic Object of Study: B N := { Linear region boundaries of N} . vol( B N ) + 1 n in = 1: = # regions Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  42. Complexity v2.0: Volume of Linear Region Boundaries Basic Object of Study: B N := { Linear region boundaries of N} . vol( B N ) + 1 n in = 1: = # regions n in > 1: # { regions inside S } � = vol( B N ∩ S ) Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

  43. Complexity v2.0: Volume of Linear Region Boundaries Basic Object of Study: B N := { Linear region boundaries of N} . vol( B N ) + 1 n in = 1: = # regions n in > 1: # { regions inside S } � = vol( B N ∩ S ) Motivation 1. vol( B N ) controls avg dist to boundary: P ( dist ( x , B N ) ≤ ǫ ) ≃ ǫ vol( B N ∩ S ) , x ∼ Unif ( S ) . Boris Hanin Complexity of Linear Regions in Deep Nets - 3/5/19

Recommend


More recommend