discrete time chen fliess series for
play

Discrete-time ChenFliess Series for Learning and Adaptive Control - PowerPoint PPT Presentation

Discrete-time ChenFliess Series for Learning and Adaptive Control W. Steven Gray Old Dominion University, Norfolk, Virginia USA ACPMS Seminar August 21, 2020 Joint work with Luis A. Duffaut Espinosa and G. S. Venkatesh . Supported by


  1. Discrete-time Chen–Fliess Series for Learning and Adaptive Control ∗ W. Steven Gray Old Dominion University, Norfolk, Virginia USA ACPMS Seminar August 21, 2020 ∗ Joint work with Luis A. Duffaut Espinosa and G. S. Venkatesh . Supported by NSF grants CMMI-1839378 and CMMI-1839387.

  2. ACPMS Seminar Overview 1. Motivation - The canonical control problem 2. Chen series and Chen–Fliess series 3. Discrete-time analogues 4. Implementation of a learning unit 5. Application to adaptive control 2

  3. ACPMS Seminar 1. Motivation - The canonical control problem y u F � plant Fig. 1.1: Open-loop control • Suppose F is an operator mapping a set of input functions U to a set of output functions Y . • Select some desired y d ∈ Range( F ) ⊆ Y . • The canonical control problem is to determine a right inverse u = F − 1 [ y d ] ∈ U such that F [ u ] = ( F ◦ F − 1 )[ y d ] = y d . • Whenever F − 1 can be computed explicitly, this is called open-loop control. 3

  4. ACPMS Seminar • In many applications F is known to be realized by some finite dimensional state space model m � z = g 0 ( z ) + ˙ g i ( z ) u i , z (0) = z 0 i =1 y j = h j ( z ) , j = 1 , . . . , ℓ, where the g i are vector fields in local coordinates, and h j maps the state to the j -th output. • If ( g, h, z 0 ) are known, then under certain conditions (well defined relative degree), F − 1 exists on a neighborhood of z 0 and is computable. • But normally a state space model is only known approximately, so this option is not always feasible. 4

  5. ACPMS Seminar u e y d y H � F � + _ controller plant Fig. 1.2: Closed-loop control • A more practical approach is to design a mapping H : Y → U so that lim t → ∞ | y d ( t ) − y ( t ) | = 0. • This is closed-loop control via dynamic inversion. • In general closed-loop control is known to reduce the sensitivity of the output to the plant, ∂y/∂F . • The design of H still relies on some knowledge of F , usually a nominal state space model ( g, h, z 0 ). 5

  6. ACPMS Seminar u e y d y H � F � + _ plant adaptive � controller learning unit Fig. 1.3: Adaptive closed-loop control • When F is unknown, the idea is to learn F as the system operates and then tune H in real-time. This is known as adaptive control. • All such control systems are implemented in discrete-time. • The goal is to describe an adaptive control system whose implementation utilizes a discrete-time Chen–Fliess series. 6

  7. ACPMS Seminar 2. Chen series and Chen–Fliess series Definition 2.1: (Chen, 1952) Let X = { x 0 , x 1 , . . . , x m } . For any fixed u ∈ L m 1 [ t 0 , t 1 ] and t ∈ [ t 0 , t 1 ] one can associate the formal power series in R �� X �� � P [ u ]( t, t 0 ) = η E η [ u ]( t, t 0 ) , η ∈ X ∗ where the map E η : L m 1 [ t 0 , t 1 ] → C [ t 0 , t 1 ] is defined inductively by setting E ∅ [ u ] = 1 and letting � t E x i η [ u ]( t, t 0 ) = u i ( τ ) E η [ u ]( τ, t 0 ) dτ, t 0 with x i ∈ X , η ∈ X ∗ , and u 0 ( t ) := 1. Such a series is called a Chen series. Remark: For any fixed t ≥ 0, P [ u ]( t ) := P [ u ]( t, 0) is an exponential Lie series satisfying � � m d � dtP [ u ] = x 0 + x i u i P [ u ] , P [ u ](0) = 1 . i =1 7

  8. � � ACPMS Seminar u u v# � u v v t t t c t b t d t c t b t d t a t a 0 0 Fig. 2.1 The catenation of two inputs u and v at t = τ The set of functions � L m L m 1 (0) := 1 [0 , T ] 0 ≤ T < ∞ is a monoid under this catenation operator. Theorem 2.1: (Chen’s identity) Given ( u, v ) ∈ L m 1 [ t a , t b ] × L m 1 [ t c , t d ], τ ∈ [ t a , t b ], and t ∈ [ τ, τ + ( t d − t c )] it follows that P [ v ](( t − τ ) + t c , t c ) P [ u ]( τ, t a ) = P [ v # τ u ]( t, t a ) . 8

  9. ACPMS Seminar Remarks: • The set of Chen series G C ( X ) = { P [ u ]( t ) ∈ R �� X �� : u ∈ L m 1 [0 , T ] , 0 ≤ t ≤ T < ∞} defines a monoid under the Cauchy product. • P : L m 1 (0) → G C ( X ) acts as a monoid homomorphism. • G C ( X ) constitutes a group if the drift letter x 0 is omitted. Definition 2.2: (Fliess, 1981) For c ∈ R ℓ �� X �� , the corresponding Chen–Fliess series is � y ( t ) = F c [ u ]( t ) := ( c, η ) E η [ u ]( t ) η ∈ X ∗ � = ( c, η )( P [ u ]( t ) , η ) η ∈ X ∗ =: ( c, P [ u ]( t )) . 9

  10. ACPMS Seminar Remarks: • If there exists real numbers K, M ≥ 0 such that | ( c, η ) | ≤ KM | η | | η | ! , ∀ η ∈ X ∗ then the series defining F c converges. • F c defined is said to be realizable when there exists a state space model m � z = g 0 ( z ) + ˙ g i ( z ) u i , z ( t 0 ) = z 0 i =1 y j = h j ( z ) , j = 1 , 2 , . . . , ℓ, such that y j = F c j [ u ] = h j ( z ), j = 1 , 2 , . . . , ℓ . • In this case, for any word η = x i k · · · x i 1 ∈ X ∗ ( c j , η ) = L g η h j ( z 0 ) := L g i 1 · · · L g ik h j ( z 0 ) , where L g i h j is the Lie derivative of h j with respect to g i . 10

  11. ACPMS Seminar 3. Discrete-time analogues Here inputs are sequences from the normed linear space l m +1 ( N 0 ) := { ˆ u = (ˆ u ( N 0 ) , ˆ u ( N 0 + 1) , . . . ) : � ˆ u � ∞ < ∞} , ∞ u m ( N )] T , N ≥ N 0 . where ˆ u ( N ) := [ˆ u 0 ( N ) , ˆ u 1 ( N ) , . . . , ˆ u ∈ l m +1 Definition 3.1: Given any N ≥ N 0 and ˆ ( N 0 ), a discrete-time ∞ Chen series is defined as � S [ˆ u ]( N, N 0 ) = ηS η [ˆ u ]( N, N 0 ) , η ∈ X ∗ where N � S x i η [ˆ u ]( N, N 0 ) = u i ( k ) S η [ˆ ˆ u ]( k, N 0 ) k = N 0 with x i ∈ X , η ∈ X ∗ , and S ∅ [ˆ u ]( N, N 0 ) := 1. Remark: If N 0 = 0 then S [ˆ u ]( N, 0) is abbreviated as S [ˆ u ]( N ). 11

  12. ACPMS Seminar The discrete-time Chen series S [ˆ u ] satisfies a difference equation. For η = x i k · · · x i 1 ∈ X ∗ , define ˆ u i k ( N ) · · · ˆ u η ( N ) = ˆ u i 1 ( N ) and � c u ( N ) = u η ( N ) η. ˆ η ∈ X ∗ Example 3.1: If X = { x 1 } , then ˆ u x 1 ( N ) = ˆ u 1 ( N ) and ∞ u 1 ( N ) x 1 ) k = (1 − ˆ � u 1 ( N ) x 1 ) − 1 . c u ( N ) = (ˆ k =0 u ∈ l m +1 Theorem 3.1: For any ˆ ( N 0 ) and N ≥ N 0 , ∞ S [ˆ u ]( N + 1 , N 0 ) = c u ( N + 1) S [ˆ u ]( N, N 0 ) with S [ˆ u ]( N 0 , N 0 ) = c u ( N 0 ). In addition, ← − − N � S [ˆ u ]( N, N 0 ) = c u ( i ) . i = N 0 12

  13. ACPMS Seminar Remarks: • There is a discrete-time analogue of Chen’s identity. • Both l m +1 ∞ ,e (0) := l m +1 (0) ∪ { ˆ 0 } and the set of discrete-time Chen ∞ series, M C , form monoids. • S : l m +1 ∞ ,e (0) → M C is a monoid homomorphism. • Given c ∈ R ℓ �� X �� , the corresponding discrete-time Chen–Fliess series is defined as y ( N ) = ˆ � ˆ F c [ˆ u ]( N ) := ( c, η ) S η [ˆ u ]( N, N 0 ) η ∈ X ∗ = ( c, S [ˆ u ]( N, N 0 )) . • ˆ F c [ˆ u ] approximates its continuous-time counterpart, F c [ u ], with computable error bounds (Duffaut Espinosa, Ebrahimi-Fard, G., 2017). 13

  14. ACPMS Seminar Theorem 3.2: The monoid M C ( X ) has a faithful infinite dimensional ← − − − � N real representation Π given by Π( S [ˆ u ]( N )) = i =0 S ( i ), where S ( i ) is any matrix representation of the R -linear map on R �� X �� given by the catenation map C : d �→ c u ( i ) d . Example 3.2: If X = { x 1 } then for all i ≥ 0   1 0 0 0 · · ·   u 1 ( i ) ˆ 1 0 0 · · ·       u 2 ˆ 1 ( i ) u 1 ( i ) ˆ 1 0 · · · S ( i ) =     u 3 u 2   ˆ 1 ( i ) ˆ 1 ( i ) u 1 ( i ) ˆ 1 · · ·     . . . . ...   . . . . . . . . u ]( N )) = S ( N ) · · · S (0). and Π( S [ˆ Remark: The goal is to find a convenient monoid representation of M C ( X ) when X has more than one letter. 14

  15. � � ACPMS Seminar 4. Implementation of a learning unit y p discrete-time Chen-Fliess series � MSE parameter u estimator y Fig. 4.1 Learning unit based on a discrete-time Chen–Fliess series • The main idea is to approximate some unknown plant y = F c [ u ] by a truncated discrete-time Chen–Fliess series, y ( N ) = ˆ � F J ˆ c [ˆ u ]( N ) := ( c, η ) S η [ˆ u ]( N ) , η ∈ X ≤ J in order to predict future outputs. • All that is available to the learning unit is input-output data. 15

  16. ACPMS Seminar • The first step is to express ˆ F J c [ˆ u ]( N ) in the form of a regression y ( N ) = φ T ( N ) θ 0 , N ≥ 1 , ˆ where u ]( N )] T φ ( N ) = [ S η 1 [ˆ u ]( N ) S η 2 [ˆ u ]( N ) · · · S η l [ˆ θ 0 = [( c, η 1 ) ( c, η 2 ) · · · ( c, η l )] T with l = card( X ≤ J ) and assuming some fixed order ( η 1 , η 2 , . . . , η l ). • If an estimate of θ 0 is available at time N − 1, say ˆ θ ( N − 1), a corresponding prediction of ˆ y ( N ) is y p ( N ) := φ T ( N )ˆ θ ( N − 1) . ˆ • ˆ θ ( N − 1) can be generated using any textbook recursive MSE estimation algorithm. The objective is to find an analogous implementation for the regressor, φ ( N ). 16

Recommend


More recommend