Recurrent Neural Networks CS 6956: Deep Learning for NLP Overview - PowerPoint PPT Presentation

Recurrent Neural Networks CS 6956: Deep Learning for NLP

Overview 1. Modeling sequences 2. Recurrent neural networks: An abstraction 3. Usage patterns for RNNs 4. BiDirectional RNNs 5. A concrete example: The Elman RNN 6. The vanishing gradient problem 7. Long short-term memory units 1

Overview 1. Modeling sequences 2. Recurrent neural networks: An abstraction 3. Usage patterns for RNNs 4. BiDirectional RNNs 5. A concrete example: The Elman RNN 6. The vanishing gradient problem 7. Long short-term memory units 2

What can we do with such an abstraction? 1. The encoder: Convert a sequence into a feature vector for subsequent classification 2. A generator: Produce a sequence using an initial state 3. A transducer: Convert a sequence into another sequence 4. A conditioned generator (or an encoder-decoder): Combine 1 and 2

1. An Encoder Convert a sequence into a feature vector for subsequent classification Initial state I like cake 4

1. An Encoder Convert a sequence into a feature vector for subsequent classification A neural network Initial state I like cake 5

1. An Encoder Convert a sequence into a feature vector for subsequent classification loss A neural network Initial state I like cake 6

1. An Encoder Convert a sequence into a feature vector for subsequent classification Example: Encode a sentence or a phrase into a feature vector for a classification task such as sentiment classification loss A neural network Initial state I like cake 7

2. A Generator Produce a sequence using an initial state I like cake Initial state ∅ ∅ ∅ 8

2. A Generator Produce a sequence using an initial state loss I like cake Initial state ∅ ∅ ∅ 9

2. A Generator Produce a sequence using an initial state Maybe the previous output becomes the current input loss I like cake Initial state ∅ I like 10

2. A Generator Produce a sequence using an initial state Examples: Text generation tasks loss I like cake Initial state ∅ I like 11

3. A Transducer Convert a sequence into another sequence Verb Pronoun Noun Initial state I like cake 12

3. A Transducer Convert a sequence into another sequence loss Verb Pronoun Noun Initial state I like cake 13

4. Conditioned generator Or an encoder-decoder: First encode a sequence, then generate another one First encode a sequence Initial state I like cake 14

4. Conditioned generator Or an encoder-decoder: First encode a sequence, then generate another one Then decode it to produce a different sequence मला आवडतो केक Initial state I ∅ ∅ like cake ∅ 15

4. Conditioned generator Or an encoder-decoder: First encode a sequence, then generate another one Example: A building block for neural machine translation मला आवडतो केक Initial state I ∅ ∅ like cake ∅ 16

Stacking RNNs • A commonly seen usage pattern • An RNN takes an input sequence and produces an output sequence • The input to an RNN can itself be the output of an RNN – stacked RNNs, also called deep RNNs • Two or more layers often seems to improve prediction performance 17

Recurrent Neural Networks CS 6956: Deep Learning for NLP Overview - PowerPoint PPT Presentation

Recurrent Neural Networks CS 6956: Deep Learning for NLP Overview 1. Modeling sequences 2. Recurrent neural networks: An abstraction 3. Usage patterns for RNNs 4. BiDirectional RNNs 5. A concrete example: The Elman RNN 6. The vanishing

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

The Power of Linear Recurrent Neural Networks Neural Networks Was knnen lineare rekurrente

Recurrent Neural Network Xiaogang Wang xgwang@ee.cuhk.edu.hk February 26, 2019 cuhk Xiaogang

CHAPTER VII VII CHAPTER Learning in Recurrent Networks Learning in Recurrent Networks CHAPTER

Recurrent Neural Networks Greg Mori - CMPT 419/726 Goodfellow, Bengio, and Courville: Deep

Understanding LSTM Networks Recurrent Neural Networks An unrolled recurrent neural network The

CSEP 517: Natural Language Processing Recurrent Neural Networks Autumn 2018 Luke Zettlemoyer

Recurrent Neural Networks CS60010: Deep Learning Abir Das IIT Kharagpur Mar 11, 2020

Computa(on through dynamics Using recurrent neural networks to unveil mechanism in neural

IN5550 Neural Methods in Natural Language Processing Recurrent Neural Networks Stephan Oepen

NLP Programming Tutorial 8 - Recurrent Neural Nets Graham Neubig Nara Institute of Science and

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Introduction to Recurrent Neural Networks Jakob Verbeek Modeling sequential data with Recurrent

Lecture 12: Planning algorithms Solution Initial state s 0 Planner (= sequence

EQUITABLE CAKE CUTTING Katarna Cechlrov, UPJ Koice, Slovakia common work with J.

Having Your Cake and Eating it too! Reconstructing Direction in a Liquid Scintillator Detector

CS133 Computational Geometry Intersection Problems 1 Riddle: Fair Cake-cutting Using only one

Game Theory Extensive Form Games: Applications Levent Ko ckesen Ko c University Levent

Scala in the JEE world How and why we have used Scala to implement portions of typical Java EE

MOL2NET , 2017 , 3, doi:10.3390/mol2net-03-015339 2 The physical-chemical characterization of

Prac6cal Lessons From Crea6ng the Control-Alt-Hack Card