deborah a dahl
play

Deborah A. Dahl Conversational Technologies Chair, W3C Multimodal - PowerPoint PPT Presentation

The W3C Multimodal Architecture and Interfaces Specification Get Smart: Smart Homes, Cars, Devices and the Web W3C Workshop on Rich Multimodal Application Development 22-23 July 2013, New York Metropolitan Area, USA Deborah A. Dahl


  1. The W3C Multimodal Architecture and Interfaces Specification Get Smart: Smart Homes, Cars, Devices and the Web W3C Workshop on Rich Multimodal Application Development 22-23 July 2013, New York Metropolitan Area, USA Deborah A. Dahl Conversational Technologies Chair, W3C Multimodal Interaction Working Group

  2. Opportunities: New ways of interacting with mobile devices • Talking • Moving the device • Writing on the screen 7/18/2013

  3. Opportunities: New ways of using mobile devices to interact with the world Public Spaces Smart Homes Medical devices Smart Cars 7/18/2013

  4. But today we have… • Proprietary interfaces to system components • Unstructured, ad hoc, communication techniques • Complex interfaces • Lack of interoperability across vendors

  5. Developing applications requires learning and using many proprietary and idiosyncratic interfaces

  6. A standard for architecture and communication can • Improve vendor independence • Reduce the learning curve for developers • Promote extensibility to new devices, modalities and sensors

  7. The W3C Multimodal Architecture • An integration standard from the World Wide Web Consortium • Three major components 7/18/2013

  8. Parts of The MMI Architecture (1): Interaction Manager Coordinates interaction among components 7/18/2013

  9. Parts of the MMI Architecture (2): Modality Components Interaction Manager 7/18/2013

  10. Parts of the MMI Architecture (3): Life Cycle Events • High level messages to modality components to prepare, start, stop, pause resume, send data, check status… • Transport of events is not standardized, but an HTTP example is provided 7/18/2013

  11. MMI Architecture Principles • Modality components only communicate directly with the Interaction Manager, not with each other • Modality component functionality is exposed only through Life-Cycle events • All communication is through Life-Cycle Events • Application-specific information is wrapped by standard events 7/18/2013

  12. Standard and Application-Specific Information MMI Life Cycle event Modality-specific API information Semantics of user input (EMMA) Transport (HTTP, Web Sockets, LAN) Interaction Manager Interaction Manager Interaction Manager Communication Layer Interaction Manager Communication Layer Interaction Manager implementation details (SCXML, JavaScript) Interaction Manager implementation details (SCXML, JavaScript) 7/18/2013

  13. Example: A personal assistant application Identify Speech Start user recognition Speaker Typing identification Wait for request Handwriting Receive Touch events request Analyze SMS request Sports TTS scores Service request Email News Display Respond to user Weather Phone call Interaction Manager State Machine (using SCXML internally)

  14. Communication for One R equest: “Check Weather” StartRequest event StartResponse event Start Identify Speech user StartRequest event recognition Speaker “What will the weather be like today?” identification Wait for StartResponse event ExtensionNotification request Event (with EMMA) Handwriting DoneNotification event Touch Analyze request SMS Sports TTS scores Service “It is currently 32º. request Today’s high Email will be 34º”. News StartRequest event Weather Respond Phone to user call Interaction Manager State Machine StartRequest event (using SCXML internally) Display

  15. More Information • W3C Multimodal Architecture and Interfaces – http://www.w3.org/TR/mmi-arch

  16. Resources • Modality Component Description Best Practices • Interoperability Test Report

  17. Presenter • Deborah Dahl, Chair of the W3C Multimodal Interaction Working Group and Principal at Conversational Technologies • Conversational Technologies – Conversational Technologies provides expertise in speech, natural language and multimodal technologies and standards that empowers its clients to apply these technologies in creating compelling mobile, desktop, and cloud solutions.

Recommend


More recommend