Neural networks across space & time Dave Snowdon @davesnowdon - PowerPoint PPT Presentation

Neural networks across space & time Dave Snowdon @davesnowdon https://www.linkedin.com/in/davesnowdon/

About me • Java & javascript by day • Python & clojure by night • Amateur social roboticist • Been learning about deep learning for 18 months

Agenda • Why neural networks • How do neural networks work • Convolutional neural networks • Recurrent neural networks

Why neural networks?

Why care about deep learning? • Impressive results in a wide range of domains • image classification, text descriptions of images, language translation, speech generation, speech recognition… • Predictable execution (inference) time • Amenable to hardware acceleration • Automatic feature extraction

What are features? Average statement length Number of variables 10 PRINT “Hello QCon London” 20 GOTO 10 Number of statements Cyclomatic complexity

Feature extraction Traditional machine learning process Pre- Extract Data Model Results features process Deep learning process Pre- Data Model Results process

Neural network downsides • Need to define the model and it’s training parameters • Large models can take days or weeks to train • May need a lot of data. > 10K examples

How neural networks work

Deep learning != your brain NOT YOUR NEURAL NETWORK

Neuron model bias (weight for fixed input) b input 0 x 0 w 0 weight 0 weight 1 w 1 S S input 1 x 1 u u F( ) output m m weight N w N input N x N

Neuron model 0.8 0.5 0.1 -0.5 S 1 -1.65 F( ) u m 4 -0.5

Neuron model 0.8 identity 0.5 0.1 -0.5 S 1 -1.65 F( ) u -1.65 m 4 -0.5

Neuron model 0.8 sigmoid 0.5 0.1 -0.5 S 1 -1.65 F( ) u 0.1611 m 4 -0.5

Neuron model 0.8 tanh 0.5 0.1 -0.5 S 1 -1.65 F( ) u -0.9289 m 4 -0.5

Neuron model 0.8 ReLU 0.5 0.1 -0.5 S 1 -1.65 F( ) u 0 m 4 -0.5

Neural networks are not graphs

Neural networks are like onions (they have layers and can make you cry) Input layer Hidden layer Output layer

Why layers? x x x x x Layer 1 Layer 2

Neural networks are like onions (they have layers and can make you cry) W 2 W 1 � � W W W W � � 11 12 13 14 ⎧ ⎫ � � W 11 W 12 W 13 W W W W ⎪ ⎪ � � 21 22 23 24 � � ⎪ ⎪ W 21 W 22 W 23 ⎪ ⎪ ⎨ ⎬ W 31 W 32 W 33 ⎪ ⎪ ⎪ ⎪ Input layer output = f(W 2 . f(W 1 . Input + B 1 ) + B 2 ) Hidden layer Output layer W 41 W 42 W 43 ⎪ ⎪ ⎩ ⎭

Going deeper Input layer Hidden layer Hidden layer Output layer

What do the layers do? Successive layers model higher level features

What input can a network accept? • Anything you like as long as it’s a tensor • Tensor = general multi-dimensional numeric quantity • scalar = tensor of 0 dimensions (AKA rank 0) • vector = 1 dimensional tensor (rank 1) • matrix = 2 dimensional tensor (rank 2) • tensor = N dimensional tensor (rank > 2)

Images Can represent image as tensor of rank 3 Source: https://www.slideshare.net/BertonEarnshaw/a-brief-survey-of-tensors

One-hot encoding : input “enums” FAVOURITE PROGRAMMING LANGUAGE JAVA CLOJURE PYTHON JAVASCRIPT 1 0 0 0 BARRY 0 1 0 0 BRUCE RUSSEL 0 0 1 0

One-hot encoding: output Also useful for output Probability distribution JAVA CLOJURE PYTHON JAVASCRIPT 0.6 0.1 0.1 0.2 BARRY 0.15 0.75 0.05 0.05 BRUCE RUSSEL 0.34 0.05 0.6 0.01

Back propagation Input example Training example Expected output Cost Error Function (also known as cost or loss) ⎛ ⎛ ⎞ ⎞ w 11 w 11 w 12 w 12 ⎜ ⎜ ⎟ ⎟ ⎛ ⎞ ⎛ ⎞ ⎜ ⎜ ⎟ ⎟ w 21 w 22 w 21 w 22 w 11 w 11 w 12 w 12 w 13 w 13 w 14 w 14 ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ w 31 w 32 w 31 w 32 ⎜ ⎜ ⎟ ⎟ w 21 w 21 w 22 w 22 w 23 w 23 w 24 w 24 ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ w 41 w 42 w 41 w 42 ⎜ ⎟ w 31 w 31 w 32 w 32 w 33 w 33 w 34 w 34 ⎝ ⎠ ⎝ ⎠ ⎝ ⎝ ⎠ ⎠

Neural networks across space & time Dave Snowdon @davesnowdon - PowerPoint PPT Presentation

Neural networks across space & time Dave Snowdon @davesnowdon https://www.linkedin.com/in/davesnowdon/ About me Java & javascript by day Python & clojure by night Amateur social roboticist Been learning about deep

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Hyperbolic Neural Networks Hyperbolic Neural Networks Use hyperbolic space instead of Euclidean

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Neural Networks 0. Logistics Spring 2019 1 Neural Networks are taking over! Neural networks

Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training

Neural Networks 1. Introduction Fall 2017 Neural Networks are taking over! Neural networks

Neural Networks Neural Net Basics Dan Klein, John DeNero UC Berkeley Slides adapted from Greg

Relaxation and Hopfield Networks Neural Networks Neural Networks - Hopfield 1 Bibliography

Neural Networks 1. Introduction Spring 2020 1 Neural Networks are taking over! Neural

Introduction to Artificial Intelligence Neural Networks - Deep Learning for NLP Janyl Jumadinova

Q2 and half lf-year revie iew 2020 Pekka Vauramo, President and CEO Eeva Sipil, CFO Forward

Results 2018 . . . . . . . . . . . . . . . . .

Interim report Jan-Sep 2007 Income statement Jan-Sep 2007 (Mkr) Jan-Sep Jan-Sep 2007 2006

DEVELOP | ACQUIRE | PARTNER June 2020 Agree Realty Corporation Overview (NYSE: ADC) Net lease

New York University 2016 System for KBP Event Nugget: A Deep Learning Approach Thien Huu Nguyen,

IFM 2003 Geneva 2003 Alternative Strategies Hedge Funds Geneva, February 2003 Hedge funds

Transforming RWE Essen, 01 Dec 2015 Peter Terium Bernhard Gnther Stephan Lowis Chief

VIATRIS: A New Champion for Global Health May 11, 2020 1 Forward-Looking Statements This