si425 nlp
play

SI425 : NLP Missing Topics and the Future Who cares about NLP? NLP - PowerPoint PPT Presentation

SI425 : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly Most top-tier universities now have NLP faculty (Stanford, Cornell, Berkeley, MIT, UPenn, CMU, Hopkins, etc) Commercial NLP hiring: Google,


  1. SI425 : NLP Missing Topics and the Future

  2. Who cares about NLP? • NLP has expanded quickly • Most top-tier universities now have NLP faculty (Stanford, Cornell, Berkeley, MIT, UPenn, CMU, Hopkins, etc) • Commercial NLP hiring: Google, Amazon, Microsoft, IBM, LinkedIn • Web startups in Silicon Valley are eating up NLP students • Navy, DoD, NSA, NIH : all funding NLP research 2

  3. What NLP topics did we miss? • Speech Recognition 3

  4. What NLP topics did we miss? • Speech Recognition 4

  5. What NLP topics did we miss? • Machine Translation 5

  6. What NLP topics did we miss? • Machine Translation • IBM Models (1 through 5) • Neural Network Translation 6

  7. Machine Translation 7

  8. Learning Translations • Huge corpus of “aligned sentences”. • Europarl • Corpus of European Parliamant proceedings • The EU is mandated to translate into all 21 official languages • 21 languages, (semi-) aligned to each other

  9. Machine Translation Technology • Hand-held devices for military • Speak english -> recognition -> translation -> generate Urdu • Translate web documents • Education technology? • Doesn’t yet receive much of a focus

  10. Information Extraction

  11. What NLP topics did we miss? • Dialogue Systems Do you think I don’t care. Anakin likes me? 11

  12. Dialogue Systems • Dialogue Systems • Why? Heavy interest in human-robot communication. • UAVs require teams of 5+ people for each operating machine • Goal: reduce the number of people • Give computer high-level dialogue commands, rather than low-level system commands 12

  13. Dialogue Systems • Dialogue Systems • Dialogue is a fascinating topic. Not only do we need to understand language, but now discourse cues: • Questions require replies • Imperatives/Commands • Acknowledgments: “ok” • Back-channels: “uh huh”, “mm hmm” • 13

  14. Dialogue Systems • BERT-like models • Input: • [CLS] how are you ? [SEP] great thanks [END] • [CLS] hello [SEP] hi what’s up [END] • … 14

  15. El Fin • Secret 1: 15

  16. El Fin • Secret 1: I intentionally made some of our labs ambiguous 16

  17. El Fin • Secret 1: I intentionally made some of our labs ambiguous Under-defined tasks with unclear expected results 17

  18. El Fin • Secret 1: I intentionally made some of our labs ambiguous Under-defined tasks with unclear expected results • Secret 2: 18

  19. El Fin • Secret 1: I intentionally made some of our labs ambiguous Under-defined tasks with unclear expected results • Secret 2: I tried to teach you skills that have nothing to do with NLP 19

  20. El Fin • Secret 1: I intentionally made some of our labs ambiguous Under-defined tasks with unclear expected results • Secret 2: I tried to teach you skills that have nothing to do with NLP Experimentation Error Analysis 20

  21. El Fin • Secret 1: I intentionally made some of our labs ambiguous Under-defined tasks with unclear expected results • Secret 2: I tried to teach you skills that have nothing to do with NLP Experimentation Error Analysis • Secret 3: 21

  22. El Fin • Secret 1: I intentionally made some of our labs ambiguous Under-defined tasks with unclear expected results • Secret 2: I tried to teach you skills that have nothing to do with NLP Experimentation Error Analysis • Secret 3: I appreciate the hard work you put into the class 22

  23. 23

  24. What NLP topics did we miss? Unsupervised Learning 24

  25. What NLP topics did we miss? Unsupervised Learning • Most of this semester used data that had human labels. • Bootstrapping was our main counter- example: it is mostly unsupervised. • Many many algorithms being researched to learn language and knowledge without humans, only using text. 25

Recommend


More recommend