discovering natural language commands in multimodal
play

Discovering Natural Language Commands in Multimodal Interfaces - PowerPoint PPT Presentation

Discovering Natural Language Commands in Multimodal Interfaces Arjun Srinivasan Mira Dontcheva Eytan Adar Seth Walker Speech-enabled multimodal interfaces are becoming popular (1) What operations can I perform? (2) How do I ask the


  1. Discovering Natural Language Commands in Multimodal Interfaces Arjun Srinivasan Mira Dontcheva Eytan Adar Seth Walker

  2. Speech-enabled multimodal interfaces are becoming popular…

  3. (1) What operations can I perform? (2) How do I ask the system to perform them?

  4. Discoverability (1) What operations can I perform? (2) How do I ask the system to perform them?

  5. Discoverability (1) What operations can I perform? (2) How do I ask the system to perform them? 2 nd most common challenge with Voice User Interfaces Patterns for How Users Overcome Obstacles in Voice User Interfaces , Myers et al. CHI 2018

  6. Can we leverage multimodal input to enhance discoverability by suggesting contextually-relevant natural language commands?

  7. Tooltips

  8. When? • Onboarding • During a session • On failure

  9. When? What? • Onboarding • Number of commands • During a session • Coverage vs. Relevance • On failure • Complexity • Phrasing • Parameters

  10. When? What? Where? • Onboarding • Number of commands • Pop-up window • During a session • Coverage vs. Relevance • Tooltips • On failure • Complexity • Embedded in GUI • Phrasing • Panels • Parameters

  11. Three interface variants to present command suggestions Adaptive Embedded Exhaustive

  12. Adaptive Embedded Exhaustive

  13. Adaptive Embedded Exhaustive

  14. Adaptive Embedded Exhaustive

  15. Adaptive Embedded Exhaustive Command Suggestions

  16. Command Filter, Rank, Examples Templates Parameterize

  17. Command Templates Add a name filter on target Make the name filter strength Set fill color to color Change border to color Set stroke size to size Make count copies Remove target Highlight entities in the image ...

  18. Command Filter, Rank, Templates Parameterize Add a name filter on target Change border to color Make the name filter strength Set fill color to color Set fill color to color Set stroke size to size Change border to color Make count copies Set stroke size to size ... Make count copies Remove target color = [red, blue, …] Highlight entities in the image size = [1-10] ... ...

  19. Command Filter, Rank, Examples Templates Parameterize Add a name filter on target Change border to color Change border to blue Make the name filter strength Set fill color to color Set fill color to red Set fill color to color Set stroke size to size Set stroke size to 10 Change border to color Make count copies Make 5 copies Set stroke size to size ... ... Make count copies Remove target color = [red, blue, …] Highlight entities in the image size = [1-10] ... ...

  20. Command Filter, Rank, Examples Templates Parameterize Add a name filter on target Change border to color Change border to blue Make the name filter strength Set fill color to color Set fill color to red Set fill color to color Set stroke size to size Set stroke size to 10 Change border to color Make count copies Make 5 copies Set stroke size to size ... ... Make count copies Remove target color = [red, blue, …] Highlight entities in the image size = [1-10] ... ...

  21. Color this green Add a red stroke Make 2 copies Delete

  22. Available Operations Add Effect Fill Delete Copy Border

  23. Operation Available Selection Operations Target Type Usage Freq. Display Freq. Add Effect Fill Delete Copy Border

  24. Operation Available Phrasing Selection Operations Templates Target Type Usage Freq. Display Freq. Fill Border Copy Delete

  25. Operation Available Phrasing Selection Operations Templates Target Type Usage Freq. Display Freq. Fill Change color to ___ Color this ___ Border Set the fill to ___ Copy Change fill of ___ to ___ Delete …

  26. Operation Template Selection & Available Phrasing Selection Parameterization Operations Templates Target Type Input Type Target State Usage Freq. Usage Freq. Display Freq. Display Freq. Fill Change color to ___ Color this ___ Border Set the fill to ___ Copy Change fill of ___ to ___ Delete …

  27. Operation Template Selection & Available Phrasing Selection Parameterization Operations Templates Target Type Input Type Target State Usage Freq. Usage Freq. Display Freq. Display Freq. Fill Change color to ___ ___ = [blue, green, Color this ___ Border red, Set the fill to ___ …] Copy Change fill of ___ to ___ Delete …

  28. Operation Template Selection & Available Phrasing Examples Selection Parameterization Operations Templates Target Type Input Type Target State Usage Freq. Usage Freq. Display Freq. Display Freq. Fill Change color to ___ ___ = [blue, Color this green green, Color this ___ Border red, Set the fill to ___ …] Copy Change fill of ___ to ___ Delete …

  29. Operation Template Selection & Available Phrasing Examples Selection Parameterization Operations Templates Target Type Input Type Target State Usage Freq. Usage Freq. Display Freq. Display Freq. Fill Color this green Add a red stroke Border Make 2 copies Delete Copy Delete

  30. Evaluation

  31. Evaluation • Between-subjects online study with 24 participants on UserTesting.com • Platform: Chrome running on a touch-enabled Microsoft Surface Pro • Minimal Training: Short videos about the basic interface and how to invoke suggestions (no details about available operations and speech commands)

  32. Evaluation • Task: Three before-after image editing tasks • Duration: 32 min (avg.) • Compensation: $10

  33. Edit the image on the left to make it look like the image on the right. Note that it is okay if your output does not look exactly the same as the target image below but try to make it look as similar as possible. (source) (target)

  34. Speech Usage Summary • Total of 834 spoken commands issued (avg. 49) during 17/24 sessions (6 exhaustive, 5 adaptive, 6 embedded)

  35. Speech Command Failures • 369/834 (44%) spoken commands failed: Error % Error Type 65% Speech recognition & recording errors 18% Phrasing errors 7% Operation-object mapping errors 5% Unsupported operations 5% Parameter errors

  36. Speech Command Failures • 369/834 (44%) spoken commands failed: Error % Error Type 65% Speech recognition & recording errors 18% Phrasing errors 7% Operation-object mapping errors 5% Unsupported operations 5% Parameter errors

  37. Suggestions encourage and aid natural language interaction Exhaustive Embedded Adaptive Overall (avg.) (avg.) (avg.) (avg.) Suggestions helped me learn 4 3.67 4.4 4.02 how to talk to the system Suggestions encouraged me 3.83 3.67 4.2 3.88 to talk to the system *scores between 1-5 5 is “ strongly agree ”

  38. Explanations for domain specific commands • Suggestions do not overcome lack of domain knowledge

  39. Explanations for domain specific commands • Command suggestions as interactive widgets

  40. Future work • Supporting additional command • Validating framework in other application domains (e.g. data visualization) types (e.g. gesture + speech) PixelTone: A Multimodal Interface for Image Editing Laput et al., CHI 2013

  41. Conclusions • Contextual command suggestions aid discoverability and encourage natural language interaction • Direct manipulation can be used to teach natural language interaction

  42. bit.ly/ Thank you voice-hints • Contextual command suggestions aid discoverability Arjun Srinivasan ( @10_arjun ) Mira Dontcheva and encourage natural language interaction Eytan Adar • Direct manipulation can be used to teach natural Seth Walker language interaction

Recommend


More recommend