Speaker State Elizabeth Shriberg Speech Technology and Research Lab - PowerPoint PPT Presentation

Jun 29, 2023 •252 likes •312 views

Speaker State Elizabeth Shriberg Speech Technology and Research Lab SRI International, Menlo Park, CA May 7-8, 2015 NSF Speech Science Workshop Overview Umbrella term covering variations within an individual Emotional Cognitive

Speaker State Elizabeth Shriberg Speech Technology and Research Lab SRI International, Menlo Park, CA May 7-8, 2015 NSF Speech Science Workshop
Overview • Umbrella term covering variations within an individual – Emotional – Cognitive (uncertainty, engagement) – Health (stress, fatigue, Parkinson’s…) – Mental health (depression, PTSD, MCI, mTBI) – Social, pragmatic (engagement, entrainment) • Synergy with some of the other talks here: Anton, Tom, Julia, Florian …. • Standard approach – Annotate data  “gold standard” – Extract features from speech (words, acoustic, prosodic, discourse) – Machine learning to predict annotations – Range of metrics for evaluation • Funding: some govt, some commercial; but limited • Interest from industry – e.g. call centers, but largely ASR based and data is often proprietary. May 7-8, 2015 NSF Speech Science Workshop
Impact for Speech Technology 1. Detection of state from speech – For adaptation / action of system / filtering – For monitoring / filtering – Massively applicable, including for passive speech, especially with increases in mobile phone use and apps – Growing interest in industry in emotion, but speech content analysis is generally behind that of text and video 2. Improvement of speech recognition (via modeling of context for better train/test data matching) May 7-8, 2015 NSF Speech Science Workshop
Challenges • Major effects of speaker, context, semantics but almost no understanding of effects • Hundreds of papers/year, but we start over with each data set • Small data sets • Annotation issues – validity, reliability, unit of analysis • Common evaluations — have been great service to community but focus has been on large feature sets + deep learning  we’re adding layers, not understanding • Feature sets biased toward those available from ASR • Metrics and evaluation • Sensitive data sets can’t be shared May 7-8, 2015 NSF Speech Science Workshop
Future Directions • Core pursuits – Understand how to decouple effects of speaker from state, and context – Go smaller, not bigger. What’s the minimum feature set and what can we learn from it? – Value generalization across data sets – learn what features and approaches transfer to new data – Explore robustness in real world data – studies often assume better audio than we can get in real applications – Understand role of lexical, visual, physiological information – increasingly available and need to understand where speech offers added value • Needs – Invest in longitudinal data with real-world spontaneous speech – Add spontaneous collection to studies in medical community – Community focus on annotations and meaningful metrics – working group support if no government evaluations – User studies that involve real-world end applications May 7-8, 2015 NSF Speech Science Workshop

Recommend

LLVM and the state of sanitizers on BSD Speaker : David Carlier Software engineer living in

LLVM and the state of sanitizers on BSD Speaker : David Carlier Software engineer living in Ireland, contribute to various opensource projects directly or indirectly related to FreeBSD and OpenBSD mainly, from enterprise solutions to more

437 views • 15 slides

Counting and Locating Multiple Solutions of Estimating Equations Speaker: Donald Richards (Penn

Counting and Locating Multiple Solutions of Estimating Equations Speaker: Donald Richards (Penn State University) This talk is based on joint work with: Despina Stasi (Penn State University) Elizabeth Gross (NC State University) Sonja

322 views • 18 slides

ABOUT THE SPEAKER Nik Bienkowski Co-Founder HANetf Key Topics For Today State of the European

ABOUT THE SPEAKER Nik Bienkowski Co-Founder HANetf Key Topics For Today State of the European Fund and ETF Market European and US ETF Market Compared Opportunities for ETF Issuers in Europe Considerations for Launching ETFs in

342 views • 31 slides

PIPELINE Speaker Series September 13, 2018, 8:00 am Speaker Series Agenda Welcome and

PIPELINE Speaker Series September 13, 2018, 8:00 am Speaker Series Agenda Welcome and Introductions Name, organization, and favorite thing about back to school time Introduction of Speaker Series Janel Anderson of Working

812 views • 35 slides

Speech Processing 15-492/18-492 Speaker ID Who is speaking? Speaker ID, Speaker Recognition

Speech Processing 15-492/18-492 Speaker ID Who is speaking? Speaker ID, Speaker Recognition Speaker ID, Speaker Recognition When do you use it When do you use it Security, Access Security, Access Speaker specific

924 views • 34 slides

Action Coalition Web Conference April 5, 2016 [Venue/Audience] [Date] [Speaker name and title]

Building a Culture of Health, State by State Action Coalition Web Conference April 5, 2016 [Venue/Audience] [Date] [Speaker name and title] Building a Culture of Health: State by State [Venue/Audience] [Date] [Speaker name and title] Building

463 views • 27 slides

Configuration and Management of Speaker Verification Systems W3C Workshop on Speaker Biometrics

Configuration and Management of Speaker Verification Systems W3C Workshop on Speaker Biometrics and VoiceXML 3.0 Chuck Johnson Architect iBiometrics, Inc. Introduction For peak performance of a Speaker Verification solution, the VoiceXML

464 views • 10 slides

Debate: Writing and Presentation Mr. Winand Debate Proposition America is losing its competitive

Debate: Writing and Presentation Mr. Winand Debate Proposition America is losing its competitive edge. Affirmation (For) Opposition (Against) 1 st Speaker 1 st Speaker 2 nd Speaker 2 nd Speaker 3 rd Speaker* 3 rd Speaker* *Third speaker may not

378 views • 5 slides

Speaker Name Speaker Title The only constant is CHANGE Give your customers more of what they

Speaker Name Speaker Title The only constant is CHANGE Give your customers more of what they want and less of what they dont. @gerryobrion NAVIGATION Tiger Woods Wins How do I do it? Nobody Buys Anything ? ? We choose between

873 views • 60 slides

SESSION TITLE Moving Toward an Automated Environment Moderator: Speakers: Moderator Speaker 1

Session Number OpsTech Session 4 Runway Condition Assessment SESSION TITLE Moving Toward an Automated Environment Moderator: Speakers: Moderator Speaker 1 Speaker 2 Speaker 3 Speaker 4 Rob Kikillus, Airport Daniel Cohen-Nir, Senior

1.05k views • 51 slides

Session Name: Sub-name www.gfoa.org #GFOA2018 TITLE Topic #1 Sub-Topic #1

8:30 10:30 May 6, 2018 Room XXXX 112 th Annual Conference May 6-9, 2018 St. Louis, Missouri Moderator/Speakers: Speaker 1 Title, Organization Speaker 2 Title, Organization Speaker 3 Title, Organization Speaker 4 Title,

709 views • 50 slides

Combining Speech and Speaker Recognition - A Joint Modeling Approach Hang Su Supervised by:

Introduction and Motivation Backgrounds on Speech and Speaker Recognition Connecting Speech and Speaker Recognition Joint Modeling of Speech and Speaker Conclusion and Future Work Combining Speech and Speaker Recognition - A Joint Modeling

729 views • 72 slides

W3C Speaker Identification W3C Speaker Identification and Verification Workshop and Verification

W3C Speaker Identification W3C Speaker Identification and Verification Workshop and Verification Workshop Speaker Verification in a Multi-Vendor Environment Mr Ross Summerfield (with support from Dr Ted Dunstone and Dr Clive Summerfield)

440 views • 15 slides

Speaker: H. Christopher Frey, North Carolina State University Moderator: Holly S. Stallworth,

Speaker: H. Christopher Frey, North Carolina State University Moderator: Holly S. Stallworth, U.S. Environmental Protection Agency Presenter Bio Dr. H. Christopher Frey is Distinguished University Professor of Environmental Engineering in the

984 views • 79 slides

The Path to Success Speaker: Ann Hambly Speaker: Ann Hambly 1 About 1 st Service Solutions

CMBS Short Sale: The Path to Success Speaker: Ann Hambly Speaker: Ann Hambly 1 About 1 st Service Solutions Founded in 2005 by Ann Hambly Ann Hambly & Mike Meisenbach are Co-CEOs Advocated over $11 billion to date Current

402 views • 17 slides

PAST, PRESENT AND FUTURE PROJECTS R. Mozzillo (speaker), L. Franchi (speaker), L. Feruglio, F.

CUBESAT TEAM OF POLITECNICO DI TORINO: PAST, PRESENT AND FUTURE PROJECTS R. Mozzillo (speaker), L. Franchi (speaker), L. Feruglio, F. Stesina, S. Corpino Department of Mechanical and Aerospace Engineering 1st Symposium on Space Educational

514 views • 15 slides

Speaker line-up calibration of the i-vector based speaker recognition system for forensic

1 Centre for Language and Speech Technology Radboud University Nijmegen The Netherlands Speaker line-up calibration of the i-vector based speaker recognition system for forensic application M. I. Mandasari, D. van Leeuwen and M. McLaren The

383 views • 21 slides

Fundamentals of Fundamentals of Structural Vibration Speaker: Speaker: Prof. FUNG Tat Ching

Fundamentals of Fundamentals of Structural Vibration Speaker: Speaker: Prof. FUNG Tat Ching Date & Time: Wed 20 August 2014, 1:30 - 5:30 pm Venue: CEE Seminar Room D (N1-B4C-09B) School of Civil and Environmental Engineering N Nanyang

461 views • 28 slides

The Listener as Speaker: Implications for Teaching Listening Henry D. Schlinger., Jr. California

7/27/2014 The Listener as Speaker: Implications for Teaching Listening Henry D. Schlinger., Jr. California State University, Los Angeles The Listener as Speaker: Implications for Teaching Listening Part I What is Listening?

644 views • 23 slides

FOSDEM - 2 fvrier 2020 Sp Speaker prese sentation Speaker name: Jehan Monnier Function:

FOSDEM - 2 fvrier 2020 Sp Speaker prese sentation Speaker name: Jehan Monnier Function: Software engineer since 1999 Involved in Linphones development since 2010. Agen Ag enda Video intercom uses cases. Introduction to SIP VoIP

635 views • 34 slides

Name of the Speaker : Karan kural Co-Speaker : Deepshikha Singh Company Name : Srijan

Name of the Speaker : Karan kural Co-Speaker : Deepshikha Singh Company Name : Srijan Technologies Pvt. Ltd. Place: New Delhi. GitHub Pull Request Builder Plugin for Jenkins Tools We will going to use: What is Jenkins? Jenkins is an

219 views • 20 slides

Uncertainty Modeling without Subspace Methods for Text-Dependent Speaker Recognition Patrick

Speaker Recognition Task and Features Two Backends Experiments Uncertainty Modeling without Subspace Methods for Text-Dependent Speaker Recognition Patrick Kenny, Themos Stafylakis, Md. Jahangir Alam and Marcel Kockmann Odyssey Speaker and

236 views • 19 slides

Nearest-Biclusters Collaborative Filtering Philadelphia, 20 August 2006 Speaker : Panagiotis

Nearest-Biclusters Collaborative Filtering Philadelphia, 20 August 2006 Speaker : Panagiotis Symeonidis PhD Candidate Scholar of the State Scholarships Foundation Aristotle University of Thessaloniki, Greece symeon@delab.csd.auth.gr http:/ /

597 views • 33 slides

Public Health Genetics & Resources May 28, 2020 Speaker Sylvia Mann, M.S., C.G.C.

Public Health Genetics & Resources May 28, 2020 Speaker Sylvia Mann, M.S., C.G.C. Supervisor, State of Hawaii Department of Health Genomics Section Project Director, Western States Regional Genetics Network Chair, National

328 views • 17 slides