neural networks i
play

Neural Networks - I Henrik I Christensen Robotics & Intelligent - PowerPoint PPT Presentation

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Neural Networks - I Henrik I Christensen Robotics & Intelligent Machines @ GT Georgia Institute of Technology, Atlanta, GA 30332-0280


  1. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Neural Networks - I Henrik I Christensen Robotics & Intelligent Machines @ GT Georgia Institute of Technology, Atlanta, GA 30332-0280 hic@cc.gatech.edu Henrik I Christensen (RIM@GT) Neural Networks 1 / 27

  2. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Outline Introduction 1 Neural Networks - Architecture 2 Network Training 3 Small Example - ZIP Codes 4 Summary 5 Henrik I Christensen (RIM@GT) Neural Networks 2 / 27

  3. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Introduction Initial motivation for design from modelling of neural systems Perceptrons emerged about same time as we started to have real neural data Studies of functional specialization in the brain Henrik I Christensen (RIM@GT) Neural Networks 3 / 27

  4. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Neurons - the motivation Henrik I Christensen (RIM@GT) Neural Networks 4 / 27

  5. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Neural Code Example Henrik I Christensen (RIM@GT) Neural Networks 5 / 27

  6. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Outline Outline of ANN architecture Formulation of the criteria function Optimization of weights Example from image analysis Next time: Bayesian Neural Networks Henrik I Christensen (RIM@GT) Neural Networks 6 / 27

  7. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Outline Introduction 1 Neural Networks - Architecture 2 Network Training 3 Small Example - ZIP Codes 4 Summary 5 Henrik I Christensen (RIM@GT) Neural Networks 7 / 27

  8. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Data Process w. Two-Layer Neural Network wTx wTz h(.) σ (x) Henrik I Christensen (RIM@GT) Neural Networks 8 / 27

  9. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Neural Net Architecture as a Graph hidden units z M w (1) w (2) MD KM x D y K inputs outputs y 1 x 1 w (2) z 1 10 x 0 z 0 Henrik I Christensen (RIM@GT) Neural Networks 9 / 27

  10. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Neural Network Equations Consider an input layer D w (1) � a j = ji x i i =0 where w j 0 and x 0 represent the bias weight / term The activation, a j , is mapped by an activation function z j = h ( a j ) which typically is a Sigmoid or tanh The output is considered the hidden activations Output unit activations are computed, similarly M w (2) � a k = kj z j j =0 Henrik I Christensen (RIM@GT) Neural Networks 10 / 27

  11. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Neural Networks - A few more details The full system is then � D  � M w (2) w (1) � � y k ( x , w ) = σ kj h ji x i   j =0 i =0 The information is flowing “forward” through the system Naming is sometimes complicated! 3-layer network single-hidden-layer network two-layer network (input/output) Henrik I Christensen (RIM@GT) Neural Networks 11 / 27

  12. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Outline Introduction 1 Neural Networks - Architecture 2 Network Training 3 Small Example - ZIP Codes 4 Summary 5 Henrik I Christensen (RIM@GT) Neural Networks 12 / 27

  13. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Training Neural Networks For optimization we consider the error function: N � || y ( x n , w ) − t n || 2 E ( w ) = n =1 The optimization is similar to earlier searches Objective ∇ E ( w ) = 0 Due to non-linearity closed form solution is a challenge Newton-Raphson type solutions are possible ∆ w = − H − 1 ∇ E w Often an iterated solution is realistic w ( τ +1) = w ( τ ) − η ∇ E ( w ( τ ) ) Henrik I Christensen (RIM@GT) Neural Networks 13 / 27

  14. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Error Backpropagation Consider the error composed of parts N � E ( w ) = E n ( w ) n =1 Considering errors by parts we get � y k = w ki x i i with the error E n = 1 � ( y nk − t nk ) 2 2 k the associated gradient is ∂ E n = ( y nj − t nj ) x ni ∂ w ji Henrik I Christensen (RIM@GT) Neural Networks 14 / 27

  15. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Computing gradients Given � a j = w ji z i i and z j = h ( a j ) The gradient is (using chain rule) ∂ E n = ∂ E n ∂ a j ∂ w ji ∂ a j ∂ w ji We already know ∂ E n = ( y k − t j ) = δ j ∂ a j and ∂ a j = z i ∂ w ji Henrik I Christensen (RIM@GT) Neural Networks 15 / 27

  16. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Updating of weights Updating backwards in the systems z i δ k δ j w ji w kj z j δ 1 Error Propagation � δ j = h ′ ( a j ) w kj δ k k Henrik I Christensen (RIM@GT) Neural Networks 16 / 27

  17. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Update Algorithm 1 Enter a training sample x n , propagate and compare to expected value t n , y ( x n ) 2 Evaluate δ k at all outputs 3 Backpropagate δ to correct hidden unit weights 4 Evaluate derivatives to correct input level weights Henrik I Christensen (RIM@GT) Neural Networks 17 / 27

  18. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Issues related to training of networks The Sigmoid is “linear” at 0 so random values around 0 is a good start. Be aware that training a network too much could result in over fitting There can be multiple hidden layers Henrik I Christensen (RIM@GT) Neural Networks 18 / 27

  19. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Outline Introduction 1 Neural Networks - Architecture 2 Network Training 3 Small Example - ZIP Codes 4 Summary 5 Henrik I Christensen (RIM@GT) Neural Networks 19 / 27

  20. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Small Example From (Le Cun 1989) on state of the art of ANN’s for recognition Recognition of handwritten characters has been widely studied Still considered an important benchmark for new recognition methods Henrik I Christensen (RIM@GT) Neural Networks 20 / 27

  21. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary ZIP code data Data normalized to 16x16 pixels 320 digits in training set and 160 digits in test set Henrik I Christensen (RIM@GT) Neural Networks 21 / 27

  22. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Different types of networks No hidden layer - pure 1 level regression 1 hidden layer with 12 hidden units - fully connected 2 hidden layers and local connectivity 2 hidden layers, locally connected and weight sharing 2 hidden layers, locally connected and 2 level weight sharing Henrik I Christensen (RIM@GT) Neural Networks 22 / 27

  23. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Example - Net Architectures Henrik I Christensen (RIM@GT) Neural Networks 23 / 27

  24. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Example Results Henrik I Christensen (RIM@GT) Neural Networks 24 / 27

  25. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Example - Summary Careful design of network architectures is important Neural Networks offer a rich variety of solutions Later results have shown improved performance with SVN’s Henrik I Christensen (RIM@GT) Neural Networks 25 / 27

  26. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Outline Introduction 1 Neural Networks - Architecture 2 Network Training 3 Small Example - ZIP Codes 4 Summary 5 Henrik I Christensen (RIM@GT) Neural Networks 26 / 27

  27. Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Summary Neural networks are general approximators Useful both for regression and discrimination Some would term them - “self-parameterized lookup tables” There is a rich community engaged in design of systems Rich variety of optimization techniques Henrik I Christensen (RIM@GT) Neural Networks 27 / 27

Recommend


More recommend