modeling the brain s operating system
play

Modeling the Brains Operating System Dana H. Ballard Computer - PowerPoint PPT Presentation

Modeling the Brains Operating System Dana H. Ballard Computer Science Dept. University of Austin Texas, NY, USA International Symposium Vision by Brains and Machines November 13th-17th Montevideo, Uruguay Embodied Cognition Brain


  1. Modeling the Brain’s Operating System Dana H. Ballard Computer Science Dept. University of Austin Texas, NY, USA International Symposium “Vision by Brains and Machines” November 13th-17th Montevideo, Uruguay

  2. Embodied Cognition Brain Body World Maurice Merleau-Ponty 1906- 1961

  3. Timescales 10 3 Memory encoding 10 2 Continuous Activity time sec 10 1 Speed Chess minimum search Sentence generation 10 0 Attention Switching Time Discrete 10 -1 Modal fixation time Shortest Recognition time Round-trip through Cortical Memory 10 -2

  4. QuickTime and a MPEG-4 Video decompressor are needed to see this picture.

  5. Marr = Brooks

  6. Three Levels of a Human “Operating System” Behaviors are scheduled from a pool 3 Behaviors compete Behavior 2 for body’s motor resources 1 Behaviors obtain sensory information

  7. Computational Abstraction Hierarchy Task: Make a PBJ sandwich Component: Remove Jelly Jar Lid Routine: Locate Lid

  8. Multi-tasking As revealed by gaze sharing in human data Mid-Block 40m Stop sign Car Mid-block sign 60 Intersection sign 50 40 Eye 30 20 10 0 0 5 10 15 3 3 3 3 172 10 177 10 182 10 187 10 time(sec) Shinoda and Hayhoe, Vision Research 2001

  9. Visual Routines Roelfsema et al PNAS 2003

  10. Introducing “Walter” QuickTime and aYUV420 codec decompressorare needed to see this picture. Pickup cans Stay on sidewalk Avoid obstacles

  11. Control of visuo-motor routines + only ~4 can run “active” simultaneously + 100-200ms update per behavior “inactive”

  12. Walter’s Visual Routines image Can locations Sidewalk 1-d obstacle locs location

  13. Reinforcement Learning Primer : Before Learning You are here state action

  14. Reinforcement Learning Primer : After Learning γ γ value γ γ γ policy γ γ γ γ

  15. Microbehavior for Litter θ ,d Cleanup 1. Visual 2a. Policy Routine 2b. Value of Policy Q d θ Heading from Walter’s perspective

  16. Learned Microbehaviors Litter Sidewalk Obstacles

  17. Microbehaviors and the body’s resources “active” “inactive” • Gaze direction must use a single best Q value. • Walking direction uses weighted average of Q values.

  18. Which Microbehavior should get the gaze vector? - The best Q The expected Q given given a sample state the state uncertainty

  19. obs side can obs can side

  20. Performance Comparison

  21. Walter crosses the street QuickTime and aYUV420 codec decompressorare needed to see this picture. Pickup cans Stay on sidewalk Avoid obstacles

  22. Running Behaviors: Eye Movement Trace QuickTime and a QuickTime and a decompressor decompressor are needed to see this picture. are needed to see this picture.

  23. Three trials

  24. Eyetracker in V8 helmet QuickTime and aTIFF (Uncompressed) decompressorare needed to see this picture.

  25. Methods Human Ss walk Walter’s route in Virtual Reality. Their 6 dof head Produces position and 2 dof gaze the A curved positions are perception path in real continuously tracked. of a space straight Three subjects were path in used. visual space The resultant video and eye track signal are scored frame-by-frame.

  26. A human walks Walter’s route QuickTime and aYUV420 codec decompressorare needed to see this picture.

  27. Human data: individual fixations Obstacle Litter Sidewalk Corner Crosswalk Otherside

  28. Walter and the humans have similar task prioritie Human subjects(3) Walter(3 trials) 70 120 60 100 50 80 40 60 30 40 20 20 10 0 0 obs side litter Other obs side litter Other

  29. Human data: Two samples with different contexts Obstacle avoidance Near obstacles Litter Crosswalk Approaching Otherside crosswalk Sidewalk Walter

  30. Scheduling Context 14 4.5 25 12 4 3.5 20 10 3 8 15 2.5 2 6 10 1.5 Walter 4 1 5 0.5 2 0 0 0 corner light side crosswalk oherside other corner side obs otherside 25 50 80 45 70 20 40 60 35 15 50 30 Human Ss 40 25 10 20 30 15 20 5 10 10 5 0 0 0 corner light side corner side obs otherside crosswalk otherside other Approaching Waiting for On crosswalk light Crosswalk Walter and the human Ss all exhibit context sensitivities. Human gaze locations are interpreted based on gaze location. The actual internal state is unknown.

  31. Rewards can be changed quickly Litter Sidewalk Obstacles

  32. Changing the reward schedule “ignore the litter” “ignore the obstacles” 60 60 50 50 40 40 Walter 30 30 20 20 10 10 0 0 obs side litter obs side litter 60 60 50 50 40 40 Humans 30 30 20 20 10 10 0 0 obs side litter obs side litter

  33. Match No match 12 Saliency Map 10 vs Gaze 8 courtesy of program 6 provided by 4 Dr. Laurent Itti 2 at the iLab, USC 0 Match No Match

  34. Credit Assignment - MIT model

  35. Credit Assignment - Our Model

  36. The laboratory at Rochester Computer Science Cognitive Science Dana Ballard Mary Hayhoe Brian Sullivan Jelena Jovancevic Constantin Rothkopf Alumni Chen Yu Pili Aivar Nathan Sprague Jochen Triesch Al Robinson Neil Mennie Weilie Yi Jason Droll Xue Gu Jonathan Shaw QuickTime and aTIFF (Uncompressed) decompressorare needed to see this picture. QuickTime and aTIFF (Uncompressed) decompressorare needed to see this picture.

Recommend


More recommend