Don't Know P ROJECT P ITCH CS294S/W F ALL 2020 Semantic Parsing - PowerPoint PPT Presentation

Error Detection: Know What you Don't Know P ROJECT P ITCH CS294S/W F ALL 2020

Semantic Parsing • Is the task of converting what the user says to executable code. Natural language ThingTalk What is a Chinese restaurant in Restaurant, servesCuisine =~ “ Chinese ” Palo Alto? && geo =~ “ Palo Alto ” • Depending on the test questions, commercial VAs are ~70-85% accurate. • (And we see lower numbers in research papers)

Semantic Parsing • Virtual assistants are far from perfect. • The result is user frustration • Users have to repeat their command several times • Sometimes the wrong command is executed • But the conversation does not have to end with a mistake • Very Big Question: How can we build parsers that seek user’s feedback and fix their own mistakes? • Project-size Question: How can we build parsers that know they made a mistake?

High-Level Project Plan • Step 1: Choose a semantic parsing dataset (Schema2QA, MultiWOZ, etc.) • Step 2: Ideate (we have some ideas!) • Step 3: Implement your ideas, train models • Step 4: Iterate • Step 5: (Bonus) Integrate your model into Almond • Step 6: Profit! • i.e. go down as one of the people who helped disrupt the emerging virtual assistant oligopoly and lower the power of a few companies over consumers!

Natural Response Generation for Virtual Assistants P ROJECT P ITCH CS294S/W F ALL 2020

Almond The Virtual Assistant You can try Almond version 1.99 at almond-dev.stanford.edu For now, you can ask about the weather or restaurants or connect it to your spotify account. The following is a conversation I had with it, without any edits.

restaurants stars a an restaurant

Natural Response Generation for VAs • We have: • A large set of synthetic multi-turn dialogues for several domain • In each turn, what VA needs to say back to the user in ThingTalk code • A baseline model that converts ThingTalk code to natural language • A baseline neural network that tries to “fix” the response • Question: How do we make responses more natural?

I'm sorry, but I don't have a restaurant that matches your request. I found Evita Estiatorio, Ramen Nagi and Zareen’s , all of which have a rating of 4.5 stars . It’s a restaurant with a 4.5 -star rating, located at 420 Emerson Street, Palo Alto, CA 94301 . Evita Estiatorio is an expensive restaurant.

The Problem • The “fixes” are not always correct. • Pieces of information might get dropped • Additional information might be hallucinated by the neural network • There seems to be a trade-off between naturalness and correctness in the current system. • Correctness is important for VAs, especially in sensitive domains like banking

High-Level Project Plan • Step 1: Define/find a suitable evaluation metric for correctness • Step 2: Ideate (we have some ideas!) • Step 3: Implement your ideas, train models • Step 4: Iterate • Step 5: Conduct human evaluation • Step 6: (Bonus) Integrate your changes with Almond • Step 7: Profit! • i.e. go down as one of the people who helped disrupt the emerging virtual assistant oligopoly and lower the power of a few companies over consumers!

Tools to Find a Solution • Natural Language Processing • Heavy use of pretrained language models like BERT, BART and GPT-2 • Human evaluation on Amazon Mechanical Turk

Don't Know P ROJECT P ITCH CS294S/W F ALL 2020 Semantic Parsing - PowerPoint PPT Presentation

Error Detection: Know What you Don't Know P ROJECT P ITCH CS294S/W F ALL 2020 Semantic Parsing Is the task of converting what the user says to executable code. Natural language ThingTalk What is a Chinese restaurant in Restaurant,

They Don t Want Them Or You t Want Them Or You They Don Don t Have Them: t Have

Don Juans Troubles Don Juans Troubles Hey, Anna, how are you? Don Juans Troubles Hey,

Know how. Know now. Know how. Know now. Please Thank our sponsor! The Nebraska Soybean Board

What You Dont Know What You Dont Know What You Dont Know What You Dont Know That

The Art The Art when you don't know! Define what you want when you do know! of of Know

I Know it Was the Blood Verse 1 I know it was the blood I know it was the blood I know it was

HOW TO BECOME AN EFFECTIVE GROUP FACILITATOR How do I prepare? Know your Know your Know your

Lower Don Trail Master Plan Refresh Public Open House_September 17 2019 1 Lower Don Trail

DON Cybersecurity/Information Assurance Workforce Management Chris Kelsall DON CIO, Director,

Typical English mistakes The system consist of three main component. Giorgio Buttazzo don't forget

BACKGROUND JOB PROCESSING DO'S AND DON'TS BACKGROUND JOB PROCESSING - DO'S AND DON'TS IMAGE

The Power of Unknowns Harnessing what you don't know to estimate project duration John Keklak

BALTI BALTI MORE MORE WE KNOW WE KNOW BALTIMORE BALTIMORE WE KNOW WE KNOW DELOITTE

We Know It ! We Know It ! WeKnowIt WeKnowIt Emerging, Collective Intelligence for personal,

WELCOME! You need to know what you know, and know what you dont know. Then work on your areas

(11-14) How much do you know about the internet? Make sure you stay SAFE AND SECURE ONLINE YOU

Stanford Rebuild: An Entrepreneurial Path Forward Stefanos Zenios Entrepreneurs adapt Sams

Predicting and suggesting in the job market EIT Summer School 2016, Bosn, June 27, 2016

Nanowire- -Based Based Nanowire Programmable Programmable Architectures Architectures

Wire Crossing Patterns Maxim Potekhin (BNL) Wirecell Summit@LBNL 12/09/2015 Nominal pattern

fly : Untethered Multi-user VR for Commodity Mobile Devices Xing Liu, Christina Vlachou, Feng

Project Overview, and Working with Clients UNC COMP 523 Wed Aug 12, 2020 Prof. Jeff Terrell 1

Writing Research Grant Applications Andrew Derrington Parker Derrington Ltd Programme Things

DP ProtoDUNE Technical Design Review 24 th April 17 C.Cantini on behalf of ETHZ Group 24/04/2017