with emotion and personality
play

with Emotion and Personality: Mind (Brain Internal States) August - PowerPoint PPT Presentation

Conversational Agents with Emotion and Personality: Mind (Brain Internal States) August 12th, 2019 Soo-Y oung Lee Director , Institute for Artificial Intelligence School of EE / Brain Science Research Center Korea Advanced Institute of


  1. Conversational Agents with Emotion and Personality: Mind (Brain Internal States) August 12th, 2019 Soo-Y oung Lee Director , Institute for Artificial Intelligence School of EE / Brain Science Research Center Korea Advanced Institute of Science & T echnology sylee@kaist.ac.kr , http://ki.kaist.ac.kr

  2. Contents ➢ Background ➢ Emotional Conversational Agents: A Korean AI Flagship Project • Engineering Approach ➢ Understanding Human Mind (Brain Internal States): • Cognitive Neuroscience Approach • Maybe use to make near-ground-truth labels for Engineering Approach ➢ Summary 2 KAIST Institute for Artificial Intelligence

  3. Background

  4. Smart Speaker and Beyond  From Voice Control and Q&A Devices  Via Personal Assistant  T o Digital Companion (Office Mate) 4 KAIST Institute for Artificial Intelligence

  5. Personal Assistant: Artificial Secretary (Braintec h’ 21: 1998-2008)  Dual Goals  Understand brain information processing mechanism  Develop Personal Assistant (or Artificial Secretary) 5 KAIST Institute for Artificial Intelligence

  6. Emotional Conversational Agent (June 2016-April 2019) 6 KAIST Institute for Artificial Intelligence

  7. Companions We Need at Office and Home  We want intelligent companions who understand me and situations well and respond accordingly at any time at any place.  Personal Companion or Office Mate • from pets to companions 7 KAIST Institute for Artificial Intelligence

  8. Beyond Personal Assistant: Digital Companion • Everywhere (Home, Automobile, Office, etc.) • Personality (not one-for-all) • Interaction with context/emotion/intention/situation 8 KAIST Institute for Artificial Intelligence

  9. Mind: Brain Internal Space Agreement/ Disagreement Agreement/ D to others isagreement Per erso sonali lity with explicit in tention of one self Trust/Distrust Emotion to others Ethics Known/Unknown (Memory) Attended/Unattended Time Emotion Intention Trust Memory Ethics Personality Dynamics Fast Slow 9 KAIST Institute for Artificial Intelligence

  10. Situation Awareness Needs both explicit and implicit information  (IEEE Spectrum, June 2008) 10 KAIST Institute for Artificial Intelligence

  11. Teach AI to understand and respond to human mind  11 KAIST Institute for Artificial Intelligence

  12. Decision/Action and Mind/Environments  Human decision making is different from person to person, and from time to time.  affected by internal states (mind) which may have temporal dynamics and unknown environments. Action[n]=f(Audio[n],Video[n],Mind[n],Environment[n]) Mind[n+1]=Mind[n]+g 1 (Mind[n],Audio[n],Video[n],Action[n]) Environments[n+1]=Environments[n] +g 2 (Environments[n],Audio[n],Video[n],Action[n]) 14  Develop Human-Agent Interaction based on internal state models. (Game Theory / Theory-of-Mind) KAIST Institute for Artificial Intelligence

  13. Environments: Unknown Space  Road condition  Weather  Economy  Politics  etc. 13 KAIST Institute for Artificial Intelligence

  14. Internal States : Mind O[n+1]=f{A[n],V[],M[n]} O[n+1] M[n+1]=g{A[n],V[],M[n],K[n]} Motor/Vocal Layer Environments ( Internal States (Mind) Unknown States) I[n+1] • Road condition • Weather • Economics • Politics Visual Output Layer Audio Output Layer Hierarchical • etc. Knowledge K Visual Hidden Layer Audio Hidden Layer Visual Input Layer Audio Input Layer V[n] A[n] 14 KAIST Institute for Artificial Intelligence

  15. 3 approaches to solve real-world problems • If you or others KNOW how to solve the problem, Just solve the problem with best existing methods. • If NOT, If there exists ENOUGH DATA, Use existing Deep Learning models. (You may need refine system parameters adaptively.) If SOME data is available, Develop new model(s), collect data, and improve the model for the problem. (You may need combine the human approaches / domain knowledge and neural network theory. If NO data is available, Conduct cognitive science experiments to find the knowledge. 15 KAIST Institute for Artificial Intelligence

  16. Emotional Conversational Agents 16 KAIST Institute for Artificial Intelligence

  17. Companion with Emotional Intelligence  AI Agents with whom people may fall in love and like to work at office. 17 KAIST Institute for Artificial Intelligence

  18. Research Modules M1 : Emotion & Person M0 : M4 : Ethical Intelligence Recognition Data Collection Unethical Words/Sentences Emotion T ext Multi- Dillema & Fairness/Bias modal Age/Gender Speech Emotio n Rec. Human Personality Learning User Image/Video Identification Stress M2 : Emotion Expression M3 : Emotional Intelligence Platform Natural Lang Life Logging Multi- Proc (Personal Database) modal Text-To- Multi-User Conversational Emotion Speech Companion with Mind Expressio Facial (Emotional Conversation, n Expression Psychological Therapy) 18 KAIST Institute for Artificial Intelligence

  19. ECA Testbed Android APP KAIST Institute for Artificial Intelligence 19

  20. Data Collection KAIST Institute for Artificial Intelligence 20

  21. Emotion Recognition from Text ➢ Dual attention mechanism: local and global ➢ From essay to conversation ➢ Accuracy (6 classes + neutral): 78 – 88 % (with ensemble) KAIST Institute for Artificial Intelligence 21

  22. Recognition from Images ➢ Emotion ➢ Gender ➢ Age ➢ Stress ➢ Speaker KAIST Institute for Artificial Intelligence 22

  23. Facial Expression Recognition in the Wild (1 st Ranked, EmotiW2015)  Advanced Committee with diverse CNNs and hierarchical structure <Kim et al., ICMI ’ 15> <Kim et al., J. Multimodal User In., 2016> 23 KAIST Institute for Artificial Intelligence

  24. Facial Expression Recognition in the Wild ( Image-based session @ Emoti W’ 15 challenge)  7-class FER of movie scenes, # (training, validation, test) images = (958 , 436, 372) + external training data (~35,000) Accuracy (%) {LPQ-pHOG} + rbfSVM : baseline 39.1 The Best Single Deep CNN 57.3 Single-Level Committee w/ Simple Ave. Rule : conventional 58.3 Single-Level Committee w/ Exp Weight Rule 60.5 61.6 Hierarchical Committee w/ {Exp Weight, Simple Ave., Majority Vote} 24 KAIST Institute for Artificial Intelligence

  25. Recognition from Speech ➢ Emotion ➢ Speaker ➢ Stress ➢ Disentangling different speech features • Phoneme • Emotion • Personality • Etc. KAIST Institute for Artificial Intelligence 25

  26. KAIST Institute for Artificial Intelligence 26

  27. Multimodal Integration with Top-Down Attention O[n+1] Motor/Vocal Layer Visual Output Layer Audio Output Layer Visual Hidden Layer Audio Hidden Layer Visual Input Layer Audio Input Layer A[n] V[n] 27 KAIST Institute for Artificial Intelligence

  28. Multimodal Integrated Recognition ➢ Early Integration, Late Integration, and Attention • Bottom-Up Attention (Self Attention) • Top-Down Attention Internal External Cue Cue Classifier Attended Output Output Top-Do Down wn Bottom-Up Recognition Attentio ion Attended Features Bottom-Up Attention Input Features Brain Environment Input Stimulus KAIST Institute for Artificial Intelligence 28

  29. Speech Synthesis: Emotional TTS (Y. Lee, et al., NIPS Workshop 2017) MLP Emotion Embedding 29 KAIST Institute for Artificial Intelligence

  30. Emotional TTS (Y . Lee, et al., NIPS Workshop 2017) http://143.248.97.172:9000/ • Continuous emotional strength Suprise Happy Disgust ? Sad Angry Fear 감정 세기 30 KAIST Institute for Artificial Intelligence

  31. More Controls on Emotional Speech Emotional Strength Mixed Emotion KAIST Institute for Artificial Intelligence 31

  32. Personalized Voices ➢ Embedding learning from multiple speakers KAIST Institute for Artificial Intelligence 32

  33. Emotional Facial Expression (Prof. JY Noh) Scale  4 3 Joy 2 Joy Sadness Anger 1 0 Surprise Disgust Fear 2 0 1 3 4 Scale Anger 33 KAIST Institute for Artificial Intelligence

  34. Facial Expression Synthesis KAIST Institute for Artificial Intelligence 34

  35. Dialogue Generator ➢ Chit-Chat ➢ HappyTalk KAIST Institute for Artificial Intelligence 35

  36. Chaotbot with Chit Chat (3 rd rank at NIPS2017 ConvAI Competition) 36 KAIST Institute for Artificial Intelligence

  37. Current Approach ➢ Combine rule-based and learning-based chatbots ➢ Personalize with previous conversations • Big 5 personal traits KAIST Institute for Artificial Intelligence 37

  38. Ethics for Conversational Agents ➢ Unethical words ➢ Fairness/Bias ➢ Dilemma ➢ Learning human goals from interactions! KAIST Institute for Artificial Intelligence 38

  39. Ethics for Conversational Agents ➢ Unethical words Mar 24, 2016 39 KAIST Institute for Artificial Intelligence

  40. Ethics for Conversational Agents ➢ Unethical words ➢ Fairness/Bias ➢ Dilemma ➢ Learning human goals from interactions! KAIST Institute for Artificial Intelligence 40

  41. Generic Approach: Learning Human Life Goals ➢ It is impossible to handle each ethical issue separately. ➢ Failure of Rule-based Expert Systems ➢ Each AI companion be different. Learning Life Goals from Mentor(s), i.e., Human Companion ➢ ➢ Human has option to use or not-use AI companion. ➢ If choose to use, he/she will be responsible to the concequences. 41 KAIST Institute for Artificial Intelligence

Recommend


More recommend