beginning neural networks assignment 6 paolo gabriel
play

Beginning Neural Networks [Assignment 6] Paolo Gabriel ECE 228, - PowerPoint PPT Presentation

Beginning Neural Networks [Assignment 6] Paolo Gabriel ECE 228, Spring 2018 Objectives Data description Introduce a simple neural network model Implement getGradient() for sum-of-squares error function Optimize weights of NN


  1. Beginning Neural Networks [Assignment 6] Paolo Gabriel ECE 228, Spring 2018

  2. Objectives • Data description • Introduce a simple neural network model • Implement getGradient() for sum-of-squares error function • Optimize weights of NN using gradient descent

  3. The data • Sunspot dataset (unit-scaled) • Time series of relative sunspot activity Input Target x_1 x_2 x_3 x_4 x_5 x_6 … … … … … … x_(n-5) x_(n-4) x_(n-3) x_(n-2) x_(n-1) x_n

  4. Given • Input, Output ( x , y) NN construction parameters (5x8x1) • The network z1 Error function parameters (sum-of-squares, α) • Gradient descent parameters (stop condition, η) • z2 Goal z3 find network parameters that predict value • x1 based on previous 5 ordered observations z4 x2 z5 y1 x3 z6 x4 z7 x5 z8 1 bias 1 bias w(2) Output w(1) Hidden Input

  5. Finding network weights • Specifically, gradient of network error function E( w ) • Find parameter changes that cause most change in error (make opposite changes) {Prediction error} Noise control

  6. Forward pass z1 Goal: calculate estimated output using current z2 network weights z3 x1 z4 x2 z5 y1 x3 z6 x4 z7 x5 z8 1 bias 1 bias w(2) Output w(1) Hidden Input

  7. “ “ Backward pass z1 Goal: calculate gradient of error function based on z2 estimates from current network weights z3 x6 True value x1 z4 x2 Error function z5 y1 x3 z6 x4 z7 x5 Hidden -> output z8 Input -> hidden 1 bias 1 bias w(2) Output w(1) Hidden Input

  8. [6.1] getGradient() Hidden -> output Input -> hidden

  9. [6.2] Gradient Descent Single step Step size, η Typical stopping criteria can be based on number of iterations or change in error Multiple steps

  10. Final comments • Bias added to end of arrays in this assignment… generally start/end ok • tanh used to keep hidden node values near [-1,1] • Things that affect neural network: • Parameters: step size, stop criteria, iteration range, initialization, network size and shape, error/optimization algorithms, hidden unit functions

Recommend


More recommend