neural turing machines
play

Neural Turing Machines Tristan Deleu June 23, 2016 @tristandeleu - PowerPoint PPT Presentation

Neural Turing Machines Tristan Deleu June 23, 2016 @tristandeleu Deep Learning The building blocks + Convolutional Fully connected Recurrent Others Layer Layer Layer Object Recognition Predictions Speech Recognition


  1. Neural Turing Machines Tristan Deleu � June 23, 2016 @tristandeleu

  2. Deep Learning

  3. The building blocks + Convolutional 
 Fully connected 
 Recurrent 
 Others 
 Layer Layer Layer Object Recognition Predictions Speech Recognition � � � Object Detection Language Processing � � Image Segmentation �

  4. Examples + = Object Predictions Face detection Detection + = Speech Predictions Automatic speech recognition Recognition = + Image Predictions Image segmentation Segmentation

  5. Examples + = Language Language Processing Processing Machine translation + = Language Predictions Sentiment analysis Processing = + + Object Language Predictions Image captioning Recognition Processing

  6. Frameworks Lasagne Lasagne Theano Torch Tensorflow Caffe Keras Neon MXNet Chainer CNTK

  7. Theano + Lasagne https://github.com/Lasagne/Lasagne/blob/master/examples/mnist.py

  8. Neural Turing Machines

  9. Recurrent Neural Network y t +1 y t − 1 y t h t +1 h t − 1 h t LSTM t +1 LSTM t − 1 LSTM t x t +1 x t − 1 x t

  10. Memory-augmented Networks BOAT Neural Network ? Boats float on water You can’t sail against the wind Boats do not fly … • Inspired by neuroscience • Memory-augmented networks : add an external memory to neural networks to act as a knowledge base • Keep track of intermediate computations — The story to answer the question in QA problems 
 Memory Networks & Dynamic Memory Networks

  11. Memory-augmented Networks Memory Networks Dynamic Memory Networks Neural GPU Neural Stack/Queue/DeQue Stack-augmented RNN

  12. Turing Machine 0 1 0 1 0 0 1 1 1 0 q 0 Current state Read Operation New state Write q 0 q 1 0 0 q 0 q 0 1 0 q 1 q 0 0 1 q 1 q 1 1 0 · · ·

  13. Neural Turing Machine 0 1 0 1 0 0 1 1 1 0 q 0 Current state Read Operation New state Write Input Output q 0 q 1 0 0 q 0 q 0 1 0 ? q 1 q 0 0 1 q 1 q 1 1 0 · · ·

  14. Heads M t 0 1 0 1 0 0 1 1 � � � � w t Turing Machine Neural Turing Machine

  15. Neural Turing Machine y t +1 y t − 1 y t h t +1 h t − 1 h t � � Controller FF t − 1 FF t FF t +1 � � M t − 1 M t Read heads r t +1 r t − 1 r t � � x t +1 x t − 1 x t Write heads x t +1 x t − 1 x t

  16. Neural Turing Machine y t +1 y t − 1 y t h t +1 h t − 1 h t � � Controller LSTM t +1 LSTM t − 1 LSTM t � � M t − 1 M t Read heads r t +1 r t − 1 r t � � x t +1 x t − 1 x t Write heads x t +1 x t − 1 x t

  17. Neural Turing Machine Input Output � NTM � � � Controller � � � � Write heads Read heads � � Memory

  18. Open-source Library medium.com/snips-ai � github.com/snipsco/ntm-lasagne �

  19. NTM-Lasagne

  20. Algorithmic Tasks • Goal : Learn full algorithms only from input/output examples 
 Generate as much data as we need Input Output � ? • Strong Generalization : Generalize beyond the data the NTM has seen during training 
 Longer sequences for example ? P ( X, Y )

  21. Copy task Inputs EOS Outputs

  22. Training

  23. Copy task

  24. Copy task

  25. Copy task Length 120

  26. Copy task Length 150

  27. Repeat Copy task Inputs x5 EOS Outputs

  28. Repeat Copy task

  29. Repeat Copy task

  30. Associative Recall task Inputs Outputs

  31. Associative Recall task

  32. Associative Recall task

  33. Priority Sort task

  34. bAbI tasks

  35. bAbI tasks John John Mary Mary garden garden Mary went to the garden John went to the garden Mary went back to the hallway Sandra Sandra Sandra journeyed to the bathroom John went to the hallway Mary went to the bathroom hallway hallway bathroom bathroom

  36. bAbI tasks

  37. Conclusion • The NTM is able to learn algorithms only from examples • It shows better generalization performances compared to other recurrent architectures 
 For example LSTMs • Fully differentiable structure 
 Drawback: generalization is still not quite perfect • New take on Artificial Intelligence 
 Trying to teach machines things they can do, the same way we would learn them • Resources Theano: http://deeplearning.net/software/theano/ • Lasagne: http://lasagne.readthedocs.io/en/latest/ • NTM-Lasagne: https://github.com/snipsco/ntm-lasagne • � June 23, 2016 @tristandeleu

  38. Thank you � June 23, 2016 @tristandeleu

Recommend


More recommend