Machine Learning for Signal Processing
Predicting and Estimation from Time Series
Bhiksha Raj Class 22. 14 Nov 2013
14 Nov 2013 11-755/18797 1
Time Series Bhiksha Raj Class 22. 14 Nov 2013 14 Nov 2013 - - PowerPoint PPT Presentation
Machine Learning for Signal Processing Predicting and Estimation from Time Series Bhiksha Raj Class 22. 14 Nov 2013 14 Nov 2013 11-755/18797 1 Administrivia No class on Tuesday.. Project Demos: 5 th December (Thursday). Before
14 Nov 2013 11-755/18797 1
14 Nov 2013 11-755/18797 2
automobile, if it is:
– Idling; or – Travelling at constant velocity; or – Accelerating; or – Decelerating
(SPL) in the sound
– The SPL is measured once per second
14 Nov 2013 11-755/18797 3
14 Nov 2013 11-755/18797 4
14 Nov 2013 11-755/18797 5
45 70 65 60 P(x|idle) P(x|decel) P(x|cruise) P(x|accel)
– Assuming all transitions from a state are equally probable
14 Nov 2013 11-755/18797 6
45 P(x|idle) Idling state 70 P(x|accel) Accelerating state 65 Cruising state 60 Decelerating state 0.5 0.5 0.33 0.33 0.33 0.33 0.33 0.25 0.25 0.25 0.33 0.25 I A C D I 0.5 0.5 A 1/3 1/3 1/3 C 1/3 1/3 1/3 D 0.25 0.25 0.25 0.25
14 Nov 2013 11-755/18797 7
Idling Accelerating Cruising Decelerating
0.25 0.25 0.25 0.25
– The observation modifies our belief in the state of the system
– Note, these don’t have to sum to 1 – In fact, since these are densities, any of them can be > 1
14 Nov 2013 11-755/18797 8
45 70 65 60 P(x|idle) P(x|decel) P(x|cruise) P(x|accel)
14 Nov 2013 11-755/18797 9
14 Nov 2013 11-755/18797 10
Idling Accelerating Cruising Decelerating
0.0 0.57 0.42 8.3 x 10-5
– P(idling|idling) = 0.5; – P(idling | deceleration) = 0.25 – P(idling at T=1| x0) = P(IT=0|x0) P(I|I) + P(DT=0|x0) P(I|D) = 2.1 x 10-5
– P(ST=1 | x0) = SST=0 P(ST=0 | x0) P(ST=1|ST=0)
14 Nov 2013 11-755/18797 11
I A C D I 0.5 0.5 A 1/3 1/3 1/3 C 1/3 1/3 1/3 D 0.25 0.25 0.25 0.25 I A C D
Idling Accelerating Cruising Decelerating 0.0 0.57 0.42 8.3 x 10-5
14 Nov 2013 11-755/18797 12
Idling Accelerating Cruising Decelerating
0.0 0.57 0.42 8.3 x 10-5 2.1x10-5 0.33 0.33 0.33 P(ST=1 | x0) = SST=0 P(ST=0 | x0) P(ST=1|ST=0)
14 Nov 2013 11-755/18797 13
45 70 65 60 P(x|idle) P(x|decel) P(x|cruise) P(x|accel)
14 Nov 2013 11-755/18797 14
– It is NOT a local decision based on x1 alone – Because of the Markov nature of the process, the state at T=0 affects the state at T=1
14 Nov 2013 11-755/18797 15
Idling Accelerating Cruising Decelerating
0.0 0.713 0.0014 0.285
14 Nov 2013 11-755/18797 16
Idling Accelerating Cruising Decelerating 0.0 0.713 0.0014 0.285 Idling Accelerating Cruising Decelerating 0.0 0.57 0.42 8.3 x 10-5
probability
states considers all observations x0 ... xT
– A natural outcome of the Markov nature of the model
for HMMs to within a normalizing constant
14 Nov 2013 11-755/18797 17
Predict the distribution of the state at T Update the distribution of the state at T after observing xT T=T+1
P(ST | x0:T-1) = SST-1 P(ST-1 | x0:T-1) P(ST|ST-1) P(ST | x0:T) = C. P(ST | x0:T-1) P(xT|ST)
PREDICT UPDATE
– P(x0:T,ST) = P(xT|ST) SST-1 P(x0:T-1, ST-1) P(ST|ST-1)
– P(ST|x0:T) = (SS’T P(x0:T,S’T))-1 P(x0:T,ST) = C P(x0:T,ST)
14 Nov 2013 11-755/18797 18
Predict the distribution of the state at T Update the distribution of the state at T after observing xT T=T+1 PREDICT UPDATE PREDICT UPDATE
P(ST | x0:T-1) = SST-1 P(ST-1 | x0:T-1) P(ST|ST-1) P(ST | x0:T) = C. P(ST | x0:T-1) P(xT|ST)
P(x0:T,ST) = P(xT|ST) SST-1 P(x0:T-1, ST-1) P(ST|ST-1)
P(x0:T-1,ST) = SST-1 P(x0:T-1, ST-1) P(ST|ST-1)
P(x0:T,ST) = P(xT|ST) P(x0:T-1,ST)
14 Nov 2013 11-755/18797 19
14 Nov 2013 11-755/18797 20
Estimate(ST) Predict the distribution of the state at T Update the distribution of the state at T after observing xT T=T+1
Estimate(ST) = argmax STP(ST | x0:T) P(ST | x0:T-1) = SST-1 P(ST-1 | x0:T-1) P(ST|ST-1) P(ST | x0:T) = C. P(ST | x0:T-1) P(xT|ST)
– P(xT|x0:T-1) = SST P(xT|ST) P(ST|x0:T-1)
14 Nov 2013 11-755/18797 21
Predict P(xT|x0:T-1) Predict the distribution of the state at T Update the distribution of the state at T after observing xT T=T+1 Predict xT
P(ST | x0:T-1) = SST-1 P(ST-1 | x0:T-1) P(ST|ST-1) P(ST | x0:T) = C. P(ST | x0:T-1) P(xT|ST)
14 Nov 2013 11-755/18797 22
14 Nov 2013 11-755/18797 23
14 Nov 2013 11-755/18797 24
– st is the state of the system at time t – et is a driving function, which is assumed to be random
the previous time instant and the driving term at the current time
– ot is the observation at time t – gt is the noise affecting the observation (also random)
the system and the noise
14 Nov 2013 11-755/18797 25
1 t t t
t t t
seen
– The state is the position of navlab or the star
way of knowing about the state
– Sensor readings (for navlab) or recorded image (for the telescope)
t t t
1 t t t
14 Nov 2013 11-755/18797 27
14 Nov 2013 11-755/18797 28
14 Nov 2013 11-755/18797 29
t t t
t t t
g t s g t t
) , ( : ) , (
g g g g g
14 Nov 2013 11-755/18797 30
t t t
t t t
g t s g t t
) , ( : ) , (
g g g g g ) ( ) ( ) 1 ( ) ( ) ( ) 1 ( ... ) 1 ( ) 1 ( | ) ( |
) , (
n n
t t t t t s g
t
g g g g
g
g
g
t t s g
t
| ) ( |
) , (
14 Nov 2013 11-755/18797 31
} { 1 } { 1 1
s s
1 t t t
14 Nov 2013 11-755/18797 32
P(s1 | O0,O1) C P(s1 | O0) P(O1|s1) P(s1| O0 ,O1) C P(s1| O0) P(O1| s1) Update after O1:
P(s0) P(s) P(s0 | O0) C p (s0)P(O0| s0) Update after O0: Prediction at time 1:
) | ( ) O | ( ) O | (
1 1 s
s s P s P s P
1 1
) | ( ) O | ( ) O | ( ds s s P s P s P
P(s0) p (s0) P(s0| O0) C P(s0) P(O0| s0) Prediction at time 0: P(s) s
p
0.2 0.3 0.4 0.1 1 2 3 1 2 3
t t t
1 t t t
Update after Ot:
Prediction at time t:
1
) | ( ) O | ( ) O | (
1 1
: 1 1
:
t
s t t t t
s s P s P s P
1 1 1
: 1 1
:
) | ( ) O | ( ) O | (
t t t t t
ds s s P s P s P
1
: t : t t t t
t t t
1 t t t
1
: t : t t t t
1 2 3
Initial state prob.
Parameters
1 t t s
t t t
1 t t t
) | ( } {
1
i s j s P T
t t ij
Transition prob Observation prob
1 2 3
14 Nov 2013 11-755/18797 36
t t t t
t t t t
1
e e e e
m e m e p e
1
5 . exp | | ) 2 ( 1 ) (
T d
P
g g g g
m g m g p g
1
5 . exp | | ) 2 ( 1 ) (
T d
P
14 Nov 2013 11-755/18797 37
T d
1
t t t t
1 t t t t
– Since the only uncertainty is from the noise
14 Nov 2013 11-755/18797 38
t t t t
g g
g g
t t t t t
14 Nov 2013 11-755/18797 39
g g
g g
t t t t
g g
14 Nov 2013 11-755/18797 40
) ( ) ( 5 . exp ) ( ) ( 5 . exp
1 2 1 1
Bs
s s R s s C
T T
m m
1 1 1 1 1 1 1 1
T T T
Not a good estimate --
14 Nov 2013 11-755/18797 41
g g
0 o
1 1 1 1 1 1 1 1
, ) ( ;
B B R
s R B B R s Gaussian
T T T g g g g
m
ˆ , ˆ ; ) | ( R s s Gaussian
P
14 Nov 2013 11-755/18797 42
e e
1 1 e e
t t t t t
t t t t
1
14 Nov 2013 11-755/18797 43
dx b Ax y b Ax y C x x C dx b Ax y Gaussian x Gaussian
y T x x T x y x x
) ( ) ( 5 . exp ) ( ) ( 5 . exp ) , ; ( ) , ; (
1 2 1 1
m m m
T x y x
Gaussian
14 Nov 2013 11-755/18797 44
y x x
T x y x
x x
g
T x y x
14 Nov 2013 11-755/18797 45
1 1 1
1 1 1 e e
1 1 1
e e
T
1 1 1 1 1
e e
t t t t
1
14 Nov 2013 11-755/18797 46
1 1 1 1 1 g g
1 1 1 1 : 1
T
1 1 1 1 1
e e
14 Nov 2013 11-755/18797 47
1 1 1 1 1 1 1 1
, ) ( ;
t T t t t T t t t t T t t t
B B R
s R B B R s Gaussian
g g g g
m
t t t t t
:
) | (
: 0 t t o
s P
T t t t t t t t t
1 1 1 :
e e
t t t t t
1 :
t t t t
1 t t t t
P(s0| O0) C P(s0) P(O0| s0)
1 1
) | ( ) O | ( ) O | ( ds s s P s P s P
P(s1| O0:1) C P(s1| O0) P(O1| s0)
1 1 2 1 : 1 1 : 2
) | ( ) O | ( ) O | ( ds s s P s P s P
P(s2| O0:2) C P(s2| O0:1) P(O2| s2) All distributions remain Gaussian P(s) P(st|st-1) P(Ot|st) P(s0) P(s)
a priori Transition prob. State output prob
t t t t
t t t t
1
14 Nov 2013 11-755/18797 49
e
1 1 :
t t t t t
) ( )] | ( [ ˆ
1 1 1 1 1 : g g g
m
t T t t t t T t t t t t
s R B B R
P mean s
14 Nov 2013 11-755/18797 50
) ( )] | ( [ ˆ
1 1 1 1 1 : g g g
m
t T t t t t T t t t t t
s R B B R
P mean s
– The slice in the figure is Gaussian
– Uncertainty is reduced
12 Nov 2013 11755/18797 51
) ), ( ( ) | (
1 1 xy xx T yx yy x xx yx y
C C C C x C C N x y P
m m
) , ( ) , , (
, , , , , ,
yy yx xy xx y x
x y
k k k k k k
C C C C N k P m m
14 Nov 2013 11-755/18797 52
1 1 1 1 1 1 1 1 1 1 1 1 1 1
B A B C A B B A B C B A B C B A A B B A B C B A A C B B A
T T T T T T T
14 Nov 2013 11-755/18797 53
Y X Z
YY T XY XY XX Z
1 1 1 1 1 1 1 1 1 1 1 1 1 1 XY XX T XY YY XX T XY XY XX T XY YY XY XX T XY XY XX XX T XY XY XX T XY YY XY XX XX Z
C C C C C C C C C C C C C C C C C C C C C C C C C C
14 Nov 2013 11-755/18797 54
Y X Z
YY T XY XY XX Z
Z Z T Z
1
) ( ) (
1 1 1 1 X XX YX Y XY XX T XY YY T X XX YX Y
X C C Y C C C C X C C Y m m m m
1 1 1 1 1 1 1 1 1 1 1 1 1 1 XY XX T XY YY XX T XY XY XX T XY YY XY XX T XY XY XX XX T XY XY XX T XY YY XY XX XX Z
C C C C C C C C C C C C C C C C C C C C C C C C C C
14 Nov 2013 11-755/18797 55
) ( ) ( 5 .
1 1 1 1 X XX YX Y XY XX T XY YY T X XX YX Y
X C C Y C C C C X C C Y m m m m
Z Z T Z
1
) ( ) ( 5 . exp
1 1 1 1 X XX YX Y XY XX T XY YY T X XX YX Y
X C C Y C C C C X C C Y K m m m m
XY XX T XY YY X XX YX Y
1 1
– The slice in the figure is Gaussian
– Uncertainty is reduced
12 Nov 2013 11755/18797 56
) ), ( ( ) | (
1 1 xy xx T yx yy x xx yx y
C C C C x C C N x y P
m m
) , ( ) , , (
, , , , , ,
yy yx xy xx y x
x y
k k k k k k
C C C C N k P m m
14 Nov 2013 11-755/18797 57
1 :
t
e e p g
g g 1
5 . exp | | ) 2 ( 1 ) (
T d
P Assuming g is 0 mean Dropping subscript t and o0:t-1 for brevity
O is a linear function of s
Hence O is also Gaussian
O O
14 Nov 2013 11-755/18797 58
1 :
t
g
g
T
1 : g
T t
s
,
14 Nov 2013 11-755/18797 59
O
) , ; ( ) (
g
g g Gaussian P
O O
O
14 Nov 2013 11-755/18797 60
O O
s s B
O
m
) , ; ( ) (
g
g g Gaussian P
s s
s
C C C C
, , , ,
R RB BR BRB
T T O g
g
T
s
,
O
14 Nov 2013 11-755/18797 61
O O
O
) , ; ( ) (
g
g g Gaussian P
R RB BR BRB
T T O g
14 Nov 2013 11-755/18797 62
1 : 1 : O O t t
T T T 1
g
14 Nov 2013 11-755/18797 63
XY XX T XY YY X XX YX Y
1 1
g
T
s
,
T T T T 1 1
g g
:
t
T T 1
g
14 Nov 2013 11-755/18797 64
Note that we are not computing g
t t
t
: 1 : 1
| | :
t T T T T
t
1 1 |
: 1
g g
T T
t
1 |
: 1
g
14 Nov 2013 11-755/18797 65
e
1 1 :
t t t t pred t t
t T t t t T t t t t T t t t T t t t t
R B B R s B B R B B R I
s
1 1 1 : 1
) ( ) ) ( ( | ˆ
g g
t t t t
1 t t t t
14 Nov 2013 11-755/18797 66
e
1 1 :
t t t t pred t t
t t T t t t T t t t t
1
g
T t t t t
1
e
t T t t t T t t t t T t t t T t t t
1 1
g g
14 Nov 2013 11-755/18797 67
e
1
t t
t t t t
T t t t t
1
e
t t t t t t
1
g T t t t T t t t
t t t t
1 t t t t
14 Nov 2013 11-755/18797 68
14 Nov 2013 11-755/18797 69
t t t t
t t t t
1
14 Nov 2013 11-755/18797 70
14 Nov 2013 11-755/18797 71
14 Nov 2013 11-755/18797 72
t t t
1 t t t
14 Nov 2013 11-755/18797 73
t t t
1 t t t