Tracking Perform ance of the MMax Conjugate Gradient Algorithm Bei - PowerPoint PPT Presentation

Tracking Perform ance of the MMax Conjugate Gradient Algorithm Bei Xie and Tam al Bose Wireless@VT Bradley Dept. of Electrical and Computer Engineering Virginia Tech

Outline  Motivation  Background  Conjugate Gradient (CG) Algorithm  Partial Update Methods  Partial Update CG Algorithm  Tracking Performance Analysis  Simulations  Summary

Motivation  How to reduce the computational complexity of an adaptive filter? Solutions: Using partial update (PU) methods.  What are the tracking performance by using partial update methods to CG?

Conjugate Gradient Algorithm  Solve system with form w R b Equivalent to find a w to minimize the cost function   1   T T J n n n n w ( ) w ( ) Rw ( ) w ( ) b 2 The gradient of the cost function is:      J n n w ( ) ( ) w R b w The residual vector is defined as:       n J n n w g ( ) w ( ) b R ( ) w

A line search method for minimizing the cost function has the form:     n n n w ( ) w ( 1 ) p ( ) p (n) is the direction vector CG chooses the direction is conjugately orthogonal to the previous directions  ,  T n m n m ( Rp ) ( ) 0 p

      J n n Now we find to minimize w ( 1 ) p ( )       J n n n w ( 1 ) ( ) p ( )          T T n n n n n R w p p p ( ) ( 1 ) ( ) ( ) ( ) b 0      T T n n n n R p ( ) b w ( 1 ) p ( ) g ( 1 )    n ( ) T T n n n n R R ( ) ( ) ( ) ( ) p p p p The residual vector is also equal to          n n n n n g ( ) b R w ( ) b R w ( 1 ) ( ) p ( )     n n n g ( 1 ) ( ) Rp ( ) The residual vector is orthogonal to the previous direction vectors,  ,  T n m n m ( ) ( ) 0 g p

CG chooses the direction in the form of    n n β n n p ( 1 ) g ( ) ( ) p ( ) Use Polak-Ribière (PR) method,   T   n n n ( ) ( 1 ) ( ) g g g  β n ( )   T n n g ( 1 ) g ( 1 ) PR method is chosen because it is a non-reset method and performs better for non-constant matrix R CG with PR method usually converges faster than Fletcher- Reeves (FR) method

CG Algorithm in adaptive filter system A basic adaptive filter system model is   T d n n v n * w ( ) ( ) ( ) x d(n) is the desired signal x (n)=[x(n),x(n-1),…,x(n-N+1)] T is the input data vector of an unknown system * ] T is the impulse response w * =[w 1 * ,w 2 * ,…,w N vector of the unknown system w * is constant for time-invariant system w * changes for time-varying sytem v(n) is a white noise

 To estimate the R and b in w R b The exponentially decaying data window is used n         n i T T n i i n n n R ( ) x ( ) x ( ) R ( 1 ) x ( ) x ( )  i 0 n         n i n d i i n d n n b ( ) ( ) x ( ) b ( 1 ) ( ) x ( )  i 0 λ is the forgetting factor

The CG algorithm in an adaptive filter system is summarized as: Initial conditions: w (0)= 0 , R (0)= 0 , p (1)= g (0)     T n n n n ( ) ( 1 ) ( ) ( ) R R x x  T n n p ( ) g ( 1 )          η is used to guarantee n ( ) , 0 . 5 T n n n R p ( ) ( ) p ( ) convergence     n n n ( ) ( 1 ) ( ) w w p             T n n n n n n n n n d n n n w g ( ) b ( ) R ( ) ( ) g ( 1 ) ( ) R ( ) p ( ) x ( ) ( ) x ( ) w ( 1 )   T   n n n g ( ) g ( 1 ) g ( )  β n ( )   T n n g ( 1 ) g ( 1 )    n n β n n ( 1 ) ( ) ( ) ( ) p g p

Partial Update (PU) Methods  Update part of the weights to save the computational complexity  Each update step, update M<N coefficients  Basic PU methods include periodic, sequential, stochastic, and MMax methods The periodic method: update the  weights at every S th iteration and copy the weights at the other iterations, where   N  M S    

The sequential method: choose the  subset of the weights in a round-robin fashion. The stochastic method: is a randomized  version of the sequential method. Usually a uniformly distributed random process will be applied. The MMax method: the elements of the  weight w are updated according to the position of the M largest elements of the input vector x (n).

Partial Update CG Algorithm The partial update CG algorithm in an adaptive filter system is summarized as:     T ˆ ˆ n n n n ˆ ˆ ( ) ( 1 ) ( ) ( ) R R x x  T n n p ( ) g ( 1 )          n ( ) , 0 . 5 T ˆ n n n p ( ) R ( ) p ( )     n n n ( ) ( 1 ) ( ) w w p           ˆ T n n n n n n d n n n ˆ g ( ) g ( 1 ) ( ) R ( ) p ( ) x ( ) ( ) x ( ) w ( 1 )   T   n n n g ( ) g ( 1 ) g ( )  β n ( )   T n n g ( 1 ) g ( 1 )    n n β n n p ( 1 ) g ( ) ( ) p ( )

 n n n ˆ ( ) ( ) ( ) x I x M   i n  ( ) 0 0 1   i n   0 ( )    n 2 ( ) I   M    0   i n    0 0 ( ) N N      i n M i n ( ) , ( ) 0 , 1 k k  k 1 The number of multiplications of CG is 3N 2 +10N+3 per sample The number of multiplications of PU CG is 2N 2 +M 2 +9N+M+3 per sample

The MMax method: the elements of the input x are chosen according to the position of the M largest elements of the input vector x (n)     if n n M 1 x ( ) max x ( ) ,    k l N l i n  1 ( ) k otherwise  0 SORTLINE method is used for comparison Mmax CG needs 2 + 2 log 2 N comparisons

Tracking Performance Analysis of PU CG Desired signal becomes:  x  T d n n n v n * ( ) ( ) w ( ) ( ) Time-varying system w * (n) uses a first-order Markov model      n n n * * w ( ) w ( 1 ) ( )  is very close to unity  ( n is process noise )

Assumptions: • The coefficient error w (n)- w *(n) is small and independent of the input signal x (n) at steady state • White noise v(n) is independent of the input signal x (n) and is independent of process noise η (n) • Input signal x (n) is independent of both v(n) and η (n)

At steady state, the MSE of PU CG with correlated input is     2 2   T E e n E d n n n ( ) ( ) x ( ) w ( )        2 1 ~ ~            T ˆ tr 2 2 1 R R R R R      v v     2   1 1      T E n n R x ( ) x ( )   ~ ~     T E n n n ˆ R x ( ) x ( ) ( 1 ) R ( ) ~ ~     T n n n n ˆ R ( ) R ( 1 ) x ( ) x ( )    ˆ T E n n ˆ ˆ R x ( ) x ( )

At steady state, the MSE of PU CG with white input is     N ( 1 )  2        E e n 2 2 2 2 4 ( ) ~ v v x x x   ˆ 1  2   tr 2 ( R )  x   2 1   tr 2 ( R ) x   ˆ tr 2 ( R ) x ˆ ~   tr 2 ( R ) ~ x   Variance of noise   E v n 2 2 ( ) v

For MMax method and white input,       2 2 2 2 , ~ x x x x ˆ κ <1, κ is close to 1      N 2 ( 1 ) 2       E e n tr 2 2 2 ( ) ( R )    v v x      2 1 1 For white process noise η (n)      N 2 ( 1 ) 2        E e n 2 2 2 2 ( )    v v x      2 1 1

Simulations System identification model The initial impulse response of unknown system is 16-order (N=16) FIR filter w*(n)=[0.01,0.02,-0.04,-0.08,0.15,-0.3,0.45, 0.6, 0.6,0.45,-0.3,0.15,-0.08,-0.04,0.02,0.01] T

The variance of the input noise σ v 2 =0.0001 Parameter λ =0.9 and η =0.6 White input, variance is 1  in Markov model is 0.9998

Tracking Performance of MMax CG 30 30 MMax CG, M=8,   =0.0001 MMax CG, M=4,   =0.0001 20 20 MMax CG, M=8,   =0.001 MMax CG, M=4,   =0.001 MMax CG, M=8,   =0.01 MMax CG, M=4,   =0.01 10 10 0 0 MSE(dB) MSE(dB) -10 -10 -20 -20 -30 -30 -40 -40 -50 -50 0 500 1000 1500 2000 2500 3000 3500 4000 0 500 1000 1500 2000 2500 3000 3500 4000 samples samples Comparison of MSE of Comparison of MSE of MMax CG for varying MMax CG for varying process noise η , M=4 process noise η , M=8

Tracking Perform ance of the MMax Conjugate Gradient Algorithm Bei - PowerPoint PPT Presentation

Tracking Perform ance of the MMax Conjugate Gradient Algorithm Bei Xie and Tam al Bose Wireless@VT Bradley Dept. of Electrical and Computer Engineering Virginia Tech Outline Motivation Background Conjugate Gradient (CG) Algorithm

Conjugate Gradient (CG) Majid Lesani Alireza Masoum Overview Backpropagation Gradient

Validation, Synthesis Validation, Synthesis and Perform ance Perform ance Evaluation of of

Conjugate gradient methods for stochastic Galerkin finite element saddle point matrices B T A

Tourism Exp o Perform a nce Im p rov em ent 30 May 2013 Pw C Hum a n Resource Serv ices (HRS)

Resources for Computational Linguistics Annotation Tools: RSTTool &MMAX Presentation by

Bryan Veal Annie Foong Intel R&D Perform ance Scalability of a Multi-core W eb Server

GeoCom putational I ntelligence and GeoCom putational I ntelligence and High-perform ance

Choosing Priors Probability Intervals 18.05 Spring 2014 Conjugate priors A prior is conjugate

Conjugate gradient training algorithm Steepest descent algorithm Definitions: So far: j

Todays Discussion Conjugate gradient algorithm To date: 1. Choose an initial weight vector w

Tracking H akan Ard o March 4, 2013 H akan Ard o Tracking March 4, 2013 1 / 57

Gradient Analysis NMDS Indirect Gradient Analysis NMDS Direct Gradient Analysis Objective:

Overview Introduction Object Tracking Vehicle Tracking Theory & Implementation

Tracking H akan Ard o February 22, 2012 H akan Ard o Tracking February 22, 2012 1

SPEEDING UP CONJUGATE GRADIENT SOLVERS BY 10X Mathias Wagner, Developer Technology Engineer GTC

Minimization strategy for choice of the stopping index in conjugate gradient type methods for

Creating You Lift Yourself by Yourself Mr. Paritosh Pathak Strategic Networking Coach

Update on Carbon Reporting project Presentation to LowCVP Fuels Working Group 2 Draft

COVERAGE FOR THE COST OF MITIGATING DAMAGE Eric Friend, Traylor Bros. Inc. Lynette Thompson,

Strategies For Public Entities June 12, 2017 Topics Superintendents View Legal Chapter 78

Business Update: FY-20 May 2020 Key Highlights 2 Key Highlights 3 Key Highlights Q4-FY 20

The CAT Vehicle Testbed: A Simulator with Hardware in the Loop for Autonomous Vehicle

D IVIDE AND C ONQUER : Partitioning OSPF networks with SDN Marcel Caria, Tamal Das, and Admela

INSTITUTE OF DEVELOPMENT STUDIES KOLKATA DD 27/D, SECTOR I, SALT LAKE, KOLKATA 700 064 Admission to

Tracking Perform ance of the MMax Conjugate Gradient Algorithm Bei - PowerPoint PPT Presentation

Tracking Perform ance of the MMax Conjugate Gradient Algorithm Bei Xie and Tam al Bose Wireless@VT Bradley Dept. of Electrical and Computer Engineering Virginia Tech Outline Motivation Background Conjugate Gradient (CG) Algorithm

Conjugate Gradient (CG) Majid Lesani Alireza Masoum Overview Backpropagation Gradient

Validation, Synthesis Validation, Synthesis and Perform ance Perform ance Evaluation of of

Conjugate gradient methods for stochastic Galerkin finite element saddle point matrices B T A

Tourism Exp o Perform a nce Im p rov em ent 30 May 2013 Pw C Hum a n Resource Serv ices (HRS)

Resources for Computational Linguistics Annotation Tools: RSTTool &amp;MMAX Presentation by

Bryan Veal Annie Foong Intel R&amp;D Perform ance Scalability of a Multi-core W eb Server

GeoCom putational I ntelligence and GeoCom putational I ntelligence and High-perform ance

Choosing Priors Probability Intervals 18.05 Spring 2014 Conjugate priors A prior is conjugate

Conjugate gradient training algorithm Steepest descent algorithm Definitions: So far: j

Todays Discussion Conjugate gradient algorithm To date: 1. Choose an initial weight vector w

Tracking H akan Ard o March 4, 2013 H akan Ard o Tracking March 4, 2013 1 / 57

Gradient Analysis NMDS Indirect Gradient Analysis NMDS Direct Gradient Analysis Objective:

Overview Introduction Object Tracking Vehicle Tracking Theory &amp; Implementation

Tracking H akan Ard o February 22, 2012 H akan Ard o Tracking February 22, 2012 1

SPEEDING UP CONJUGATE GRADIENT SOLVERS BY 10X Mathias Wagner, Developer Technology Engineer GTC

Minimization strategy for choice of the stopping index in conjugate gradient type methods for

Creating You Lift Yourself by Yourself Mr. Paritosh Pathak Strategic Networking Coach

Update on Carbon Reporting project Presentation to LowCVP Fuels Working Group 2 Draft

COVERAGE FOR THE COST OF MITIGATING DAMAGE Eric Friend, Traylor Bros. Inc. Lynette Thompson,

Strategies For Public Entities June 12, 2017 Topics Superintendents View Legal Chapter 78

Business Update: FY-20 May 2020 Key Highlights 2 Key Highlights 3 Key Highlights Q4-FY 20

The CAT Vehicle Testbed: A Simulator with Hardware in the Loop for Autonomous Vehicle

D IVIDE AND C ONQUER : Partitioning OSPF networks with SDN Marcel Caria, Tamal Das, and Admela

INSTITUTE OF DEVELOPMENT STUDIES KOLKATA DD 27/D, SECTOR I, SALT LAKE, KOLKATA 700 064 Admission to

Resources for Computational Linguistics Annotation Tools: RSTTool &MMAX Presentation by

Bryan Veal Annie Foong Intel R&D Perform ance Scalability of a Multi-core W eb Server

Overview Introduction Object Tracking Vehicle Tracking Theory & Implementation