Matching the Analysis Scheme to the Signal Fritz Menzer - PowerPoint PPT Presentation

Time-Frequency Analysis for Audio Workshop Matching the Analysis Scheme to the Signal Fritz Menzer (fritz.menzer@epfl.ch) Communication Systems, 5 th year Ecole Polytechnique F´ ed´ erale de Lausanne 15th April, 2004

Overview 1 Introduction 3 2 Perfect Reconstruction - who cares? 4 2.1 Definition of perfect reconstruction . . . . . . . . . . . . . . . . . . 4 2.2 Do we need perfect reconstruction? . . . . . . . . . . . . . . . . . . 5 3 Harmonic Band Wavelet Transform 7 3.1 Coefficient modeling . . . . . . . . . . . . . . . . . . . . . . . . . . 10 3.2 Advantages / Drawbacks . . . . . . . . . . . . . . . . . . . . . . . . 11 4 From HBWT to inharmonic sound modeling 12 4.1 Taking filters from different PR filterbanks . . . . . . . . . . . . . . 13 4.2 Why aliasing is not a problem . . . . . . . . . . . . . . . . . . . . . 14 4.3 Method Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17 4.4 Sounds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 5 Time-Frequency Analysis and Granular Synthesis 19 5.1 Time-domain effects . . . . . . . . . . . . . . . . . . . . . . . . . . 25 5.2 Scale of all grains in a 1024-band full-tree wavelet decomposition . . 26 A References 27 2

1 Introduction • If you know what you’re looking at, you can examine it more precisely. 3

2 Perfect Reconstruction - who cares? 2.1 Definition of perfect reconstruction • Definition: Perfect Reconstruction (PR) method: method providing direct and inverse transforms T and T − 1 such that for any signal s , T − 1 ( T ( s )) = s • FFT based methods, Cosine Modulated Filterbanks and Wavelet transforms are usually PR methods. • Simple operations like filtering or distortion do not necessarily allow PR (i.e. it may be impossible to find T − 1 ). Example: Quantisation obviously does not allow to reconstuct the original signal perfectly. 4

2.2 Do we need perfect reconstruction? 1.5 150 1 0.5 100 0 −0.5 50 −1 −1.5 −2 0 0 5 10 15 20 0 5 10 15 20 1.5 80 1 60 0.5 0 40 −0.5 −1 20 −1.5 −2 0 0 5 10 15 20 0 5 10 15 20 samples frequency [kHz] Noise Noise, down- and upsampled by 4 5

Do we need perfect reconstruction? • Not needed for: – Modifying a signal – Handling noise – If the nature of the signal is known • Why use PR methods for compression? – Generality (ideally any signal can be treated) – Localising the source of errors! 6

3 Harmonic Band Wavelet Transform (Polotti and Evangelista, 2000) ... x ( n ) g 0 ( k ) φ ( k ) φ ( k ) DC Comp. ✲ ✲ ✲ ✲ ✲ ✲ ✲ ✲ P 2 2 ❄ ❄ ❄ ψ ( k ) ψ ( k ) ✲ ✲ ✲ ✲ ✲ ✲ 2 2 ❄ ❄ ... g 1 ( k ) φ ( k ) φ ( k ) Sinusoidal ✲ ✲ ✲ ✲ ✲ ✲ ✲ ✲ 2 2 P ❄ ❄ ❄ Part ψ ( k ) ψ ( k ) ✲ ✲ ✲ ✲ ✲ ✲ 2 2 ❄ ❄ ❄ ... ... ... ... g P − 1 ( k ) φ ( k ) φ ( k ) Sinusoidal ✲ ✲ ✲ ✲ ✲ ✲ ✲ ✲ 2 2 P ❄ ❄ ❄ Part ψ ( k ) ψ ( k ) ✲ ✲ ✲ ✲ ✲ ✲ 2 2 ❄ ❄ 7

10000 9000 8000 7000 6000 frequency [Hz] 5000 4000 3000 2000 1000 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 time [sec]

10000 9000 8000 7000 6000 frequency [Hz] 5000 4000 3000 2000 1000 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 time [sec]

� � 3.1 Coefficient modeling • Wavelet Transform • Model the scale residual sinusoidally • Model the wavelet coefficients using LPC 7 scale Φ ω | 4,0 ( ) | residual 6 | � Ψ 4,0 ( )| ω N =4 5 | � Ψ 3,0 ( )| ω n =3 4 3 Ψ ω | � 2,0 ( ) n =2 | 2 Ψ ω | � 1,0 ( ) | n =1 1 0 0 0.5 1 1.5 2 2.5 3 3.5 10

3.2 Advantages / Drawbacks + Meaningful adaptation of frequency and time resolution = ⇒ Visually better resolution + Reasonable model for the coefficients − Works only for monophonic, harmonic sounds − No model for the transients 11

4 From HBWT to inharmonic sound modeling 1800 1600 1400 1200 frequency [Hz] 1000 800 600 400 200 1 2 3 4 5 6 7 time [sec]

4.1 Taking filters from different PR filterbanks 1 st partial 2 nd partial 3 rd partial . . . ω 1 st partial 2 nd partial 3 rd partial . . . ω

4.2 Why aliasing is not a problem If a sinusoid of the form  ˆ   kπ sin P t + ϕ  is the input to a P-channel cosine modulated filterbank, only two bands will output nonzero coefficients: ˆ kπ P ) | � = 0 ⇔ k ∈ { ˆ k − 1 , ˆ | H k ( e j k } partial’s frequency ω = ⇒ there is no aliasing of the sinusoidal part, but only of the part that we model as noise! 14

4.3 Method Overview Analysis analyse signal → find N partials → determine filterbank ↓ calculate 2 N sets of filterbank coefficients + residual ↓ calculate wavelet transform (WT) of filterbank coefficients ↓ model the WT coefficients sinusoidally and with LPC Synthesis reconstruct WT coefficients ↓ perform inverse wavelet transform → get filterbank coefficients ↓ inverse filterbank ↓ add residual (or not) 17

4.4 Sounds • Original Gong • Reconstructed from the Filterbank Coefficients • Synthesized from model parameters • 1 octave pitch-shifted Gong • Time-stretched Gong • Sinusoidal-only Gong • First wavelet scale only • Harmonic Gong 18

5 Time-Frequency Analysis and Granular Synthesis • Any Time-Frequency Transform implements a sort of Granular Synthesis. • Each coefficient corresponds to a grain • Grains are played at precise instants (instead of randomly) • To produce a grain, set all coefficients to zero, except one that will be set to one. Then perform the inverse transform. 19

Windowed FFT (STFT) grain 2 x 10 −3 1.5 1 0.5 0 −0.5 −1 −1.5 −2 0 2 4 6 8 10 12 time [msec] play 20

Cosine Modulated Filterbank grain 0.15 0.1 0.05 0 −0.05 −0.1 −0.15 −0.2 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 time [msec] play 21

Full-tree wavelet “grain” 0.15 0.1 0.05 0 −0.05 −0.1 −0.15 −0.2 0 10 20 30 40 50 60 time [msec] play 22

HBWT grain (noise part) 0.15 0.1 0.05 0 −0.05 −0.1 −0.15 −0.2 0 2 4 6 8 10 12 14 16 time [msec] play 23

HBWT grain (sinusoidal part) 0.05 0.04 0.03 0.02 0.01 0 −0.01 −0.02 −0.03 −0.04 −0.05 0 20 40 60 80 100 120 140 160 time [msec] play 24

5.1 Time-domain effects 0.15 0.1 0.05 0 −0.05 −0.1 −0.15 −0.2 0 1 2 3 4 5 6 7 time [msec] 0.15 0.1 0.05 0 −0.05 −0.1 −0.15 −0.2 0 1 2 3 4 5 6 7 time [msec] Channel 8: one grain played continuously Channel 9: one grain played continuously 25

5.2 Scale of all grains in a 1024-band full-tree wavelet decomposition x 10 4 2.2 2 1.8 1.6 1.4 frequency [Hz] 1.2 1 0.8 0.6 0.4 0.2 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 time [sec] play 26

A References • Article on Harmonic Band Wavelet Transform by Polotti and Evangelista http://lcavwww.epfl.ch/publications/publications/2000/PolottiE00b.pdf • DAFx 2002 paper on adaptation to inharmonic sounds http://lcavwww.epfl.ch/publications/publications/2002/PolottiE02.pdf • Some material (presentation slides, Matlab functions and pure data objects) http://www.xsmusic.ch/ 27

Matching the Analysis Scheme to the Signal Fritz Menzer - PowerPoint PPT Presentation

Time-Frequency Analysis for Audio Workshop Matching the Analysis Scheme to the Signal Fritz Menzer (fritz.menzer@epfl.ch) Communication Systems, 5 th year Ecole Polytechnique F ed erale de Lausanne 15th April, 2004 Overview 1

7.5 Bipartite Matching Matching Matching. Input: undirected graph G = (V, E). M E

Matching of Matrix Elements and Parton Showers CKKW matching in e + e collisions Lecture 2:

Global Shape Matching Section 3.3: Articulated Matching using Graph Cuts Global Shape Matching:

Scheme Announcements Scheme Scheme is a Dialect of Lisp 4 Scheme is a Dialect of Lisp What

Tx Signal: 1000 Hz sine wave; Attenuation; Random noise with 0.5ms spike Tx Signal Noise Rx

Matching Bipartite Matching Input Given a (undirected) graph G = ( V , E ) Input Given a bipartite

What can Scheme learn from JavaScript? Scheme Workshop 2014 Andy Wingo Me and Scheme Guile

THE TRAINING LAYOFF SCHEME THE TRAINING LAYOFF SCHEME 1 October 2009 The Training Layoff Scheme

Government Pension Scheme (LGPS) Scheme Administration Defined Benefit Scheme National

Countryside Stewardship Scheme Farm Update North Events Overview and Update on scheme for 2018

Hokio Drainage Scheme Scheme Facts Scheme Assets. 4 floodgated culverts 45 km of

Speech Processing 15-492/18-492 Speech Synthesis Signal Processing Signal Manipulation Signal

Waveform Generation Fundamental part of signal processing is the signal. Within the

Sampling a Signal an analog signal together with some samples of the signal. The samples

Signal Types Recall even digital signals are just voltages Analog signal Continuous

Signal Types Recall even digital signals are just voltages Analog signal Continuous

Cloudius Systems presents: Writing a Modern Highly Scalable Application Where Linux Helps You,

Linear Prediction Analysis of Speech Sounds Berlin Chen 2004 References: 1. X. Huang et. al.,

Optimize Primary Care Teams to Meet Patients Medical AND Behavioral Needs A 12- month IHI

Vocoders 1 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass

Precision Measurement of Parity-violation in Deep Inelastic Scattering Over a Broad Kinematic

Instrumenting Your Business For Success With DevOps Robert Benefield Evolve Beyond, Ltd

Digital Cinematography Color Science Basics 11/5/15 2:56 PM 40211

A Study of Embedding Operations and Locations for Steganography in H.264 Video Andreas Neufeld

Sambuz

Useful Links

Newsletter

Mail Us