Enhanced Robot Audition Based on Microphone Array Source Separation - PowerPoint PPT Presentation

Apr 19, 2023 •269 likes •420 views

Enhanced Robot Audition Based on Microphone Array Source Separation with Post-Filter Jean-Marc Valin , Jean Rouat, Franois Michaud Department of Electrical Engineering and Computer Engineering Universit de Sherbrooke, Qubec, Canada

Enhanced Robot Audition Based on Microphone Array Source Separation with Post-Filter Jean-Marc Valin , Jean Rouat, François Michaud Department of Electrical Engineering and Computer Engineering Université de Sherbrooke, Québec, Canada Jean-Marc.Valin@USherbrooke.ca
Motivations The context: mobile robot and cocktail party efgect The problem: separating sound sources The solution: microphone array with both linear and non-linear processing Microphones Separated X n ( k ,l ) Sources Geometric Y m ( k ,l ) Sources ^ S m ( k ,l ) Post- source S m ( k ,l ) fjlter separation
Approach Frequency-domain processing Geometric Source Separation (GSS) Minimize leakage under constraints Adapted for real-time processing Post-fjlter Cancels remaining interferences Based on Ephraim and Malah estimator Handles both stationary and non-stationary noise/interference
Geometric Source Separation Frequency domain: Constrained optimization Minimize correlation of the outputs: Subject to geometric constraint: Modifjcations to original GSS algorithm Instantaneous computation of correlations Stochastic-gradient descent
Post-Filter Overview Noise estimate as the sum of two components (stationary + transient)
Background Noise Estimation Minima-Controlled Recursive Average (Cohen) Noise estimate is adapted during quiet periods Applied for each source of interest Initial estimate provided directly from the microphones
Interference Estimation Source separation leaks Incomplete adaptation Inaccuracy in localization Reverberation Imperfect microphones Estimation from other separated sources
Suppression Rule Ephraim & Malah spectral estimator Gain is modifjed to take into account probability of source being present (Cohen)
Experimental Setup Array of 8 inexpensive microphones on a Pioneer2 robot Automatic localization Noisy conditions 350 ms reverberation time
Results (Signal-to-Noise Ratio) Three voices recorded separately so clean signal is available
Results (spectrograms) Input GSS Post-fjlter output Reference
Results (recognition with post-fjlter) Japanese isolated word recognition (SIG2 robot) 3 simultaneous sources 200 word vocabulary 90 degrees separation mixed GSS only GSS+pf right 66% 71% left 15% 21% center 41% 53% 14% reduction in error rate
Conclusion Geometric Source Separation Real-time minimization of leakage Source separation post-fjlter Interference estimated using other sources Future work Robustness to reverberation original processed Better integration with speech recognition Using the post-fjlter to estimate ASR feature reliability
Questions?

Recommend

Robothlon Team competition, each team programs a robot for each event Events Robot

Robothlon Team competition, each team programs a robot for each event Events Robot rally - Race your robot round Robot retrieve - Fetch balls with the robot Robot rumble - Last robot moving in a ring Points

274 views • 5 slides

singly linked lists Sept. 18, 2017 1 Recall last lecture: Java array array array array of

COMP 250 Lecture 5 singly linked lists Sept. 18, 2017 1 Recall last lecture: Java array array array array of int of Shape (unspecified objects type) 34 657 -232 -823 23 1192 0 null 0 null I have drawn each of these as array

696 views • 31 slides

A synthetic aperture microphone array for the meeting room TNO TPD Synthetic aperture microphone

Construction of arbitrarily positioned virtual microphones A synthetic aperture microphone array for the meeting room TNO TPD Synthetic aperture microphone array September 23, 2004 1 About OKTO (operation room of the near future)

303 views • 13 slides

Active Audition and Sensorimotor Integration for Sound Source Localization Mathieu Bernard 25

Active Audition and Sensorimotor Integration for Sound Source Localization Mathieu Bernard 25 novembre 2011 1/27 Mathieu Bernard - ISIR - BVS Active Audition and Sensorimotor Integration Introduction CIFRE thesis. Co-direction : Patrick

608 views • 27 slides

An Enhanced Global Router An Enhanced Global Router An Enhanced Global Router An Enhanced Global

An Enhanced Global Router An Enhanced Global Router An Enhanced Global Router An Enhanced Global Router With Consideration of With Consideration of General Layer Directives General Layer Directives Hsien Lee 1 , Yen Jung Chang 1 , and Ting

368 views • 33 slides

Robust Sound Source Localization Using a Microphone Array on a Mobile Robot Jean-Marc Valin,

Laboratory on Mobile Robotics and Intelligent Systems LABORIUS Robust Sound Source Localization Using a Microphone Array on a Mobile Robot Jean-Marc Valin, Franois Michaud, Jean Rouat, Dominic Ltourneau Department of Electrical Engineering

681 views • 15 slides

Rational Robot A Test Automation Tool What is Rational Robot? Rational Robot is a complete

Rational Robot A Test Automation Tool What is Rational Robot? Rational Robot is a complete set of components for automating the testing of Microsoft Windows client/server and Internet applications. (Rational Robot user guide) Rational

742 views • 41 slides

Verifying the Motion of a Robot Arm Akul Penugonda 1 /6 Akul Penugonda - Robot Arm Motion 2

Akul Penugonda - Robot Arm Motion Verifying the Motion of a Robot Arm Akul Penugonda 1 /6 Akul Penugonda - Robot Arm Motion 2 /6 Akul Penugonda - Robot Arm Motion Why bother? 3 /6 Akul Penugonda - Robot Arm Motion Safety and Assumptions

247 views • 6 slides

What is a robot? A robot is an intelligent system that interacts with the Robot Lecture 2:

What is a robot? A robot is an intelligent system that interacts with the Robot Lecture 2: Robot Basics physical environment through sensors and sensors effectors effectors. CS 344R/393R: Robotics Environment Benjamin Kuipers

600 views • 6 slides

Microphone Array Processing for Distant Speech Recognition From close-talking microphones to

SPSC - Microphone Array Processing for Distant Speech Recognition Microphone Array Processing for Distant Speech Recognition From close-talking microphones to far-field sensors Hannes Pessentheiner Signal Processing and Speech Communication

482 views • 32 slides

Review We can declare an array of any type, even other arrays A 2D array is an array of

Review We can declare an array of any type, even other arrays A 2D array is an array of arrays float[][] myFloats = new float[10][20]; All elements of a 2D array can be accessed using nested loops for (int i=0; i<10; i++) {

681 views • 33 slides

Cache Performance 1 C and cache misses (1) int array[1024]; // 4KB array int even_sum = 0,

Cache Performance 1 C and cache misses (1) int array[1024]; // 4KB array int even_sum = 0, odd_sum = 0; for ( int i = 0; i < 1024; i += 2) { even_sum += array[i + 0]; odd_sum += array[i + 1]; } Assume everything but array is kept in

923 views • 67 slides

Robot audition and its deployment Kazuhiro Nakadai Principal Researcher, Honda Research Institute

Honda Research Institute JP Robot audition and its deployment Kazuhiro Nakadai Principal Researcher, Honda Research Institute Japan Co. Ltd. Visiting Professor, Tokyo Institute of Technology Visiting Professor, Waseda University 2nd Workshop

310 views • 29 slides

Noise Reduction in Robot Audition Tanja Flemming University of Hamburg Faculty of Mathematics,

MIN Faculty Department of Informatics Noise Reduction in Robot Audition Tanja Flemming University of Hamburg Faculty of Mathematics, Informatics and Natural Sciences Department of Informatics Technical Aspects of Multimodal Systems 16.

704 views • 27 slides

and Retrieval Source: H. Jegou Source: H. Jegou Source: H. Jegou Source: H. Jegou Source: H.

Semantic Image Indexing and Retrieval Source: H. Jegou Source: H. Jegou Source: H. Jegou Source: H. Jegou Source: H. Jegou Source: H. Jegou Source: H. Jegou Source: H. Jegou Outline State of the nation Early description methods

2.13k views • 130 slides

Establishing a Korean Robot Ethics Charter 2007. 4. 14 Robot Division, Ministry of Commerce,

Establishing a Korean Robot Ethics Charter 2007. 4. 14 Robot Division, Ministry of Commerce, Industry and Energy, KOREA H.B.Shim (shbs@mocie.go.kr) Outline Trends of the Robot Industry in Korea Trends of Robot-Developing Technologies in

125 views • 11 slides

Post-Quantum Authentication in TLS 1.3: A Performance Study Di Dimitr trios Sikeridis 1, 1,2 ,

NDSS 2020, February 26, 2020 Post-Quantum Authentication in TLS 1.3: A Performance Study Di Dimitr trios Sikeridis 1, 1,2 , Panos Kampanakis 2 , Michael Devetsikiotis 1 1 Dept. of Electrical and Computer Engineering, The University of New

795 views • 23 slides

Towards Accurate Post-training Network Quantization via Bit-split and Stitching Peisong Wang ,

Towards Accurate Post-training Network Quantization via Bit-split and Stitching Peisong Wang , Qiang Chen, Xiangyu He, Jian Cheng Institute of Automation, Chinese Academy of Sciences 1 Outline Background Motivation Approach

189 views • 18 slides

POST SECONDARY PLANNING NIGHT September 29, 2020 Virtual Meeting Rachel DeWyngaert HS Grades

POST SECONDARY PLANNING NIGHT September 29, 2020 Virtual Meeting Rachel DeWyngaert HS Grades 10-12 School Counselor Lenore Kingsmore Principal AGENDA: College search review Next steps College application procedures

240 views • 22 slides

The Scope of Sequential Screening with Ex-Post Participation Constraints Francisco Castro

The Scope of Sequential Screening with Ex-Post Participation Constraints Francisco Castro Columbia University Joint work with D. Bergemann (Yale) and G. Weintraub (Stanford) Microsoft, March 2019 1/23 Problem: Sequential Screening When

839 views • 71 slides

Implementation of a fluid model for the non-linear interaction between runaway electrons and

Summary Implementation of a fluid model for the non-linear interaction between runaway electrons and background plasma V. Bandaru 1 , M. Hoelzl 1 , G. Papp 1 , P. Aleynikov 2 , G. Huijsmans 3 1 Max-Planck-Institute for Plasma Physics, Garching,

556 views • 16 slides

Building Java Programs Chapter 5 Lecture 5-1: while Loops, Fencepost Loops, and Sentinel Loops

Building Java Programs Chapter 5 Lecture 5-1: while Loops, Fencepost Loops, and Sentinel Loops reading: 5.1 5.2 1 2 A deceptive problem... Write a method printLetters that prints each letter from a word separated by commas. For

548 views • 20 slides

Threshold Ring Signatures: New Security Definitions and Post-Quantum Security Abida Haque ,

Problem Description Current State of the Art Our Contribution Our Scheme Summary References Threshold Ring Signatures: New Security Definitions and Post-Quantum Security Abida Haque , Alessandra Scafuro North Carolina State University May

738 views • 61 slides

Efficient Synthesis with Probabilistic Constraints Samuel Drews, Aws Albarghouthi, Loris

Efficient Synthesis with Probabilistic Constraints Samuel Drews, Aws Albarghouthi, Loris DAntoni Probabilistic Correctness Properties x ~ Uniform([0, 1]) post := Pr[ P ( x ) = 1] > 0 1 P ( x ) := 0 x 0.2 ] 0 0.2 1 P (

469 views • 28 slides