Neural Network Basics Niloy Mitra Iasonas Kokkinos Paul Guerrero - PowerPoint PPT Presentation

Stochastic Gradient Descent (SGD) Gradient: Batch: [1..N] Noisy (‘Stochastic’) Gradient: b(1), b(2),…, b(B): sampled from [1,N] Minibatch: B elements Epoch: N samples, N/B batches EG Course Deep Learning for Graphics

Code example Gradient Descent vs Stochastic Gradient Descent 62 EG Course Deep Learning for Graphics

Regularization in SGD: Weight Decay Gradient: Batch: [1..N] Noisy (‘Stochastic’) Gradient: b(1), b(2),…, b(B): sampled from [1,N] Minibatch: B elements ‘’Weight decay’’ Back-prop on minibatch Epoch: N samples, N/B batches EG Course Deep Learning for Graphics

Learning rate EG Course Deep Learning for Graphics

Gradient Descent EG Course Deep Learning for Graphics

(S)GD with adaptable stepsize e.g. EG Course Deep Learning for Graphics

(S)GD with momentum Main idea: retain long-term trend of updates, drop oscillations (S)GD (S)GD + momentum EG Course Deep Learning for Graphics

Code example Multi-layer perceptron classification 68 EG Course Deep Learning for Graphics

Step-size Selection & Optimizers: research problem • Nesterov’s Accelerated Gradient (NAG) • R-prop • AdaGrad • RMSProp • AdaDelta • Adam • … EG Course Deep Learning for Graphics

Neural Network Training: Old & New Tricks Old: (80’s) Stochastic Gradient Descent, Momentum, “weight decay” New: (last 5-6 years) Dropout ReLUs Batch Normalization EG Course Deep Learning for Graphics

Linearization: may need higher dimensions http://colah.github.io/posts/2014-03-NN-Manifolds-Topology/ EG Course Deep Learning for Graphics

Reminder: Overfitting, in images Classification just right Regression EG Course Deep Learning for Graphics

Previously: l2 Regularization Per-sample loss Per-layer regularization EG Course Deep Learning for Graphics

Dropout Each sample is processed by a ‘decimated’ neural net Decimated nets: distinct classifiers But: they should all do the same job EG Course Deep Learning for Graphics

Dropout block EG Course Deep Learning for Graphics ‘Feature noising’

Test time: Deterministic Approximation EG Course Deep Learning for Graphics

Dropout Performance EG Course Deep Learning for Graphics

‘Neuron’: Cascade of Linear and Nonlinear Function Sigmoidal (“logistic”) Rectified Linear Unit (RELU) EG Course Deep Learning for Graphics

Reminder: a network in backward mode Outputs Gradient signal scaling: <1 (actually <0.25) from above EG Course Deep Learning for Graphics

Vanishing Gradients Problem Gradient signal scaling: <1 (actually <0.25) from above Do this 10 times: updates in the first layers get minimal Top layer knows what to do, lower layers “don’t get it” Sigmoidal Unit: Signal is not getting through! EG Course Deep Learning for Graphics

Vanishing Gradients Problem: ReLU Solves It Gradient signal Scaling: {0,1} from above EG Course Deep Learning for Graphics

External Covariate Shift: your input changes 10 am 2pm 7pm EG Course Deep Learning for Graphics

“Whitening”: Set Mean = 0, Variance = 1 Photometric transformation: I  a I + b • Make each patch have zero mean: • Then make it have unit variance: EG Course Deep Learning for Graphics

Internal Covariate Shift Neural network activations during training: moving target EG Course Deep Learning for Graphics

Batch Normalization Whiten-as-you-go: EG Course Deep Learning for Graphics

Batch Normalization: used in all current systems EG Course Deep Learning for Graphics

Convolutional Neural Networks

Fully-connected Layer Example: 200x200 image 40K hidden units ~2B parameters!!! Spatial correlation is local - Waste of resources - we have not enough training samples anyway.. - EG Course Deep Learning for Graphics

Locally-connected Layer Example: 200x200 image 40K hidden units Filter size: 10x10 4M parameters Note: This parameterization is good when input image is registered (e.g., face recognition). EG Course Deep Learning for Graphics

Convolutional Layer Share the same parameters across different locations (assuming input is stationary): Convolutions with learned kernels EG Course Deep Learning for Graphics

Convolutional Layer EG Course Deep Learning for Graphics

Neural Network Basics Niloy Mitra Iasonas Kokkinos Paul Guerrero - PowerPoint PPT Presentation

Deep Learning for Graphics Neural Network Basics Niloy Mitra Iasonas Kokkinos Paul Guerrero Vladimir Kim Kostas Rematas Tobias Ritschel UCL UCL/Facebook UCL Adobe Research U Washington UCL EG Course Deep Learning for Graphics Timetable

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks Neural Net Basics Dan Klein, John DeNero UC Berkeley Slides adapted from Greg

Deep Learning Primer Nishith Khandwala Neural Networks Overview Neural Network Basics

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1

Neural Network II Neural Network II Week 8 1 Team Homework Assignment #10 Team Homework

Introduction to Neural Machine Translation Gongbo Tang 16 September 2019 Outline Why Neural

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Neural Networks for Machine Learning Lecture 2a An overview of the main types of neural network

Human-Robot Interaction: Language Acquisition with Neural Network Alvin Rindra Fazrie 09.11.2015

Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training

(Very) Brief Introduction to Neural Networks IITP-03 Algorithms for NLP 1 / 31 Learning

Recurrent Neural Network Xiaogang Wang xgwang@ee.cuhk.edu.hk February 26, 2019 cuhk Xiaogang

Neural network applications ALVINN (Pomerleau, mid 1990s) Autonomous Land Vehicle in Neural

Artificial Neural Networks By: Kodi Neumiller Overview What is an artificial neural network

Activity 1 Word Classes Starter spellings augh/ough The grapheme ough is a very

Through Computer and Information Science June 4, 2012 ICCS2012 Dr. Frederica Darema Air Force

CNT 5410 - Computer and Network Security: Web Security Professor Kevin Butler Fall 2015

Microservices, Unikernels Portland State University CS 430P/530 Internet, Web & Cloud Systems

On Seeing Stuff: The Perception of Materials by Humans and Machines, By Adelson Semantic

Governors Advisory Council for Veterans Services Arrowheads Community Club Fort Indiantown Gap

ACE S E uro pe e de ra tio n E uro pe a n Ca pita ls a nd Citie s o f Spo rt F Mic he lle Vo

Fees Paid to US Based Healthcare Professionals for Consulting & Speaking Services 1st Quarter

Neural Network Basics Niloy Mitra Iasonas Kokkinos Paul Guerrero - PowerPoint PPT Presentation

Deep Learning for Graphics Neural Network Basics Niloy Mitra Iasonas Kokkinos Paul Guerrero Vladimir Kim Kostas Rematas Tobias Ritschel UCL UCL/Facebook UCL Adobe Research U Washington UCL EG Course Deep Learning for Graphics Timetable

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks Neural Net Basics Dan Klein, John DeNero UC Berkeley Slides adapted from Greg

Deep Learning Primer Nishith Khandwala Neural Networks Overview Neural Network Basics

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1

Neural Network II Neural Network II Week 8 1 Team Homework Assignment #10 Team Homework

Introduction to Neural Machine Translation Gongbo Tang 16 September 2019 Outline Why Neural

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Neural Networks for Machine Learning Lecture 2a An overview of the main types of neural network

Human-Robot Interaction: Language Acquisition with Neural Network Alvin Rindra Fazrie 09.11.2015

Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training

(Very) Brief Introduction to Neural Networks IITP-03 Algorithms for NLP 1 / 31 Learning

Recurrent Neural Network Xiaogang Wang xgwang@ee.cuhk.edu.hk February 26, 2019 cuhk Xiaogang

Neural network applications ALVINN (Pomerleau, mid 1990s) Autonomous Land Vehicle in Neural

Artificial Neural Networks By: Kodi Neumiller Overview What is an artificial neural network

Activity 1 Word Classes Starter spellings augh/ough The grapheme ough is a very

Through Computer and Information Science June 4, 2012 ICCS2012 Dr. Frederica Darema Air Force

CNT 5410 - Computer and Network Security: Web Security Professor Kevin Butler Fall 2015

Microservices, Unikernels Portland State University CS 430P/530 Internet, Web &amp; Cloud Systems

On Seeing Stuff: The Perception of Materials by Humans and Machines, By Adelson Semantic

Governors Advisory Council for Veterans Services Arrowheads Community Club Fort Indiantown Gap

ACE S E uro pe e de ra tio n E uro pe a n Ca pita ls a nd Citie s o f Spo rt F Mic he lle Vo

Fees Paid to US Based Healthcare Professionals for Consulting &amp; Speaking Services 1st Quarter

Microservices, Unikernels Portland State University CS 430P/530 Internet, Web & Cloud Systems

Fees Paid to US Based Healthcare Professionals for Consulting & Speaking Services 1st Quarter