recent advances and key challenges
play

Recent Advances and Key Challenges Russ Salakhutdinov Machine - PowerPoint PPT Presentation

Recent Advances and Key Challenges Russ Salakhutdinov Machine Learning Department Carnegie Mellon University Canadian Institute for Advanced Research Key Challenges Multimodal Learning Reasoning, Attention and Memory Natural Language


  1. Recent Advances and Key Challenges Russ Salakhutdinov Machine Learning Department Carnegie Mellon University Canadian Institute for Advanced Research

  2. Key Challenges • Multimodal Learning • Reasoning, Attention and Memory • Natural Language Understanding • Deep Reinforcement Learning • Unsupervised Learning / One-Shot & Transfer Learning

  3. Deep Learning: Image Understanding TAGS : strangers, coworkers, conventioneers, attendants Nearest Neighbor Sentence: people taking pictures of a crazy person Model Samples • a group of people in a crowded area • a group of people are walking and talking • a group of people, standing around and talking

  4. Caption Generation There is a cat sitting A little boy with a bunch on a shelf of friends on the street A car is parked in the middle of nowhere Kiros, Salakhutdinov, Zemel, ICML 2014

  5. Caption Generation A man holding a red The handlebars are The two birds are trying apple in his mouth trying to ride a bike rack to be seen in the water Kiros, Salakhutdinov, Zemel, ICML 2014

  6. Caption Generation with Visual Attention A man riding a horse in a field. Xu et al, ICML 2015

  7. Caption Generation with Visual Attention Xu et al, ICML 2015

  8. Key Challenges • Multimodal Learning • Reasoning, Attention and Memory • Natural Language Understanding • Deep Reinforcement Learning • Unsupervised Learning / One-Shot & Transfer Learning

  9. Who-Did-What Dataset Context : “…arrested Illinois governor Rod Blagojevich and his chief of staff • John Harris on corruption charges … included Blogojevich allegedly conspiring to sell or trade the senate seat left vacant by President-elect Barack Obama…” Query : President-elect Barack Obama said Tuesday he was not aware of • alleged corruption by X who was arrested on charges of trying to sell Obama’s senate seat. Answer : Rod Blagojevich • Onishi, Wang, Bansal, Gimpel, McAllester, EMNLP, 2016

  10. Gated Attention Mechanism • Use Recurrent Neural Networks (RNNs) to encode a document and a query. • Use element-wise multiplication to model the interactions between document and query: Dhingra, Liu, Yang, Cohen, Salakhutdinov, 2016

  11. Multi-hot Architecture Reasoning over multiple sentences requires several passes over the context • Dhingra, Liu, Yang, Cohen, Salakhutdinov, 2016

  12. Reasoning and Attention Context : “…arrested Illinois governor Rod Blagojevich and his chief of staff John Harris on • corruption charges … included Blogojevich allegedly conspiring to sell or trade the senate seat left vacant by President-elect Barack Obama…” Query : “President-elect Barack Obama said Tuesday he was not aware of alleged corruption • by X who was arrested on charges of trying to sell Obama’s senate seat .” Answer : Rod Blagojevich • Layer 2 Layer 1

  13. Memory Networks Memory Multiple passes over context help with sequential reasoning Weston, Chopra, Bordes, ICLR 2015; Sukhbaatar et al., NIPS 2015

  14. Broad-Context Language Modeling Her plain face broke into a huge smile when she saw Terry. “Terry!” she called out. She rushed to meet him and they embraced. “Hon, I want you to meet an old friend, Owen McKenna. Owen, please meet Emily.'' She gave me a quick nod and turned back to X LAMBADA dataset, Paperno et al., 2016

  15. Broad-Context Language Modeling Her plain face broke into a huge smile when she saw Terry . “Terry!” she called out. She rushed to meet him and they embraced. “Hon, I want you to meet an old friend, Owen McKenna. Owen, please meet Emily .'' She gave me a quick nod and turned back to X LAMBADA dataset, Paperno et al., 2016

  16. Broad-Context Language Modeling Her plain face broke into a huge smile when she saw Terry . “Terry!” she called out. She rushed to meet him and they embraced. “Hon, I want you to meet an old friend, Owen McKenna. Owen, please meet Emily .'' She gave me a quick nod and turned back to X X = Terry LAMBADA dataset, Paperno et al., 2016

  17. Incorporating Prior Knowledge Coreference Her plain face broke into a Core NLP huge smile when she saw Dependency Parses Terry . “Terry!” she called out. She rushed to meet him and they embraced. Entity relations Freebase “Hon, I want you to meet an old friend, Owen McKenna. Owen, please meet Emily .'’ She gave Word relations me a quick nod and WordNet turned back to X Recurrent Neural Network Text Representation Dhingra, Jin, Yang, Cohen, Salakhutdinov NAACL 2018

  18. Explicit Memory Mary got the football She went to the kitchen She left the ball there RNN Coreference Hyper/Hyponymy Dhingra, Jin, Yang, Cohen, Salakhutdinov NAACL 2018

  19. Explicit Memory Mary got the football She went to the kitchen She left the ball there RNN M t Coreference e 1 e | E | . . . h 0 Hyper/Hyponymy M t +1 h 1 g t RNN h t . M emory as A cyclic G raph . . E ncoding (MAGE) - RNN h t − 1 x t Dhingra, Jin, Yang, Cohen, Salakhutdinov NAACL 2018

  20. Open Domain Question Answering • Finding answers to factual questions posed in Natural Language: Who voiced Meg in Family Guy? A. Lacey Chabert, Mila Kunis Who first voiced Meg in Family Guy? A. Lacey Chabert Bhuwan Dhingra et. al. 2018

  21. Text Augmented Knowledge Graphs Knowledge Source Answers Questions Lacey Chabert, Mila Kunis Who voiced Meg in Family Guy? Lacey Chabert, Mila Kunis Who voiced Meg in Family Guy? Lacey Chabert, Mila Kunis Who voiced Meg in Family Guy? 1982 Which year was Blade Runner released? 1982 Which year was Blade Runner released? Which year was Blade Runner released? 1982 Real Madrid Which club did Cristiano Ronaldo play for in 2011? Real Madrid Which club did Cristiano Ronaldo play for in 2011? Which club did Cristiano Ronaldo play for in 2011? Real Madrid Bhuwan Dhingra et. al. 2018

  22. Knowledge Base as a Knowledge Source Lacey Chabert KB Query Graph Who first voiced Meg in Family Guy? Semantic Parsing Bhuwan Dhingra et. al. 2018

  23. Text as a Knowledge Source Step 1 (Information Retrieval): Retrieve passages relevant to the Question using shallow methods Step 2 (Reading Comprehension): Perform deep reading of passages to extract answers Bhuwan Dhingra et. al. 2018

  24. Text Augmented Knowledge Graph Who first voiced Meg in Family Guy? Entity Linking TF-IDF based Personalized Pagerank sentence retrieval character-in voiced-by Meg Griffin d 1 Meg Griffin is a character from the animated television series Family Guy d 2 Originally voiced by Lacey Chabert Lacey Chabert Family Guy during the first season, she has been voiced by Mila Kunis since season 2 Mila Kunis Bhuwan Dhingra et. al. 2018

  25. Key Challenges • Multimodal Learning • Reasoning, Attention and Memory • Natural Language Understanding • Deep Reinforcement Learning • Unsupervised Learning / One-Shot & Transfer Learning

  26. Learning Behaviors Observation Action Learning to map sequences of observations to actions, for a particular goal

  27. Reinforcement Learning Action Reward Observation / State

  28. Deep Reinforcement Learning Action h 3 W 3 Deep Reward h 2 Neural W 2 Net h 1 W 1 v Observation / State

  29. Deep RL with Memory Action Learned External Memory Reward Observation / State Differentiable Neural Computer, Graves et al., Nature, 2016; Neural Turing Machine, Graves et al., 2014

  30. Deep RL with Memory Action Learned Structured Memory Reward Observation / State Parisotto, Salakhutdinov, ICLR 2018

  31. Random Maze with Indicator • Indicator: Either blue or pink Ø If blue, find the green block Ø If pink, find the red block • Negative reward if agent does not find correct block in N steps or goes to wrong block.

  32. Deep RL with Structured Memory M t M t +1 Write Write Read with Attention Parisotto, Salakhutdinov, 2017

  33. Building Intelligent Agents Action Learned External Memory Reward Knowledge Base Observation / State

  34. Task-oriented Language Grounding Chaplot et al., AAAi 2019

  35. Active Neural Localization and SLAM Chaplot, Parisotto, Salakhutdinov, ICLR2018

  36. Building Intelligent Agents Action Learned External Memory Learning from Fewer Reward Examples, Fewer Knowledge Experiences Base Observation / State

  37. Thank you

Recommend


More recommend