Week 6: An Introduction to Time Series Dependent data, - PowerPoint PPT Presentation

BUS41100 Applied Regression Analysis Week 6: An Introduction to Time Series Dependent data, autocorrelation, AR and periodic regression models Max H. Farrell The University of Chicago Booth School of Business

Time series data and dependence Time-series data are simply a collection of observations gathered over time. For example, suppose y 1 , . . . , y T are ◮ annual GDP, ◮ quarterly production levels, ◮ weekly sales, ◮ daily temperature, ◮ 5-minutely stock returns. In each case, we might expect what happens at time t to be correlated with time t − 1 . 1

Suppose we measure temperatures, daily, for several years. Which would work better as an estimate for today’s temp: ◮ The average of the temperatures from the previous year? ◮ The temperature on the previous day? How would this change if the readings were iid N ( µ, σ 2 ) ? Correlated errors require fundamentally different techniques. 2

Example: Y t = average daily temp. at O’Hare, Jan-Feb 1997. > weather <- read.csv("weather.csv") > plot(weather$temp, xlab="day", ylab="temp", type="l", + col=2, lwd=2) 50 40 30 temp 20 10 0 0 10 20 30 40 50 60 day ◮ “sticky” sequence: today tends to be close to yesterday. 3

Example: Y t = monthly U.S. beer production (Mi/barrels). > beer <- read.csv("beer.csv") > plot(beer$prod, xlab="month", ylab="beer", type="l", + col=4, lwd=2) 70 50 beer 30 10 0 0 10 20 30 40 50 60 70 month ◮ The same pattern repeats itself year after year. 4

> plot(rnorm(200), xlab="t", ylab="Y_t", type="l", + col=6, lwd=2) 3 2 1 Y_t 0 −1 −2 0 50 100 150 200 t ◮ It is tempting to see patterns even where they don’t exist. 5

Checking for dependence To see if Y t − 1 would be useful for predicting Y t , just plot them together and see if there is a relationship. Daily Temp at O'Hare 50 ● Corr = 0.72 ● ● ● ● ● 40 ● ◮ Correlation ● ● ● ● ● ● ● ● ● ● ● 30 ● ● ● ● ● between Y t and temp(t) ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 20 ● Y t − 1 is called ● ● ● ● ● ● ● ● autocorrelation. 10 ● ● ● ● ● ● ● 0 ● ● ● 0 10 20 30 40 50 temp(t−1) 6

We can plot Y t against Y t − ℓ to see ℓ -period lagged relationships. 50 50 ● ● Lag 2 Corr = 0.46 ● Lag 3 Corr = 0.21 ● ● ● ● ● ● 40 40 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 30 ● 30 ● ● ● ● ● ● ● ● ● temp(t) temp(t) ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 20 ● ● 20 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 10 10 ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 0 ● ● ● ● ● ● 0 10 20 30 40 50 0 10 20 30 40 50 temp(t−2) temp(t−3) ◮ It appears that the correlation is getting weaker with increasing ℓ . 7

Autocorrelation To summarize the time-varying dependence, compute lag- ℓ correlations for ℓ = 1 , 2 , 3 , . . . In general, the autocorrelation function (ACF) for Y is r ( ℓ ) = cor( Y t , Y t − ℓ ) For our O’Hare temperature data: > print(acf(weather$temp)) 0 1 2 3 4 5 6 7 8 1.00 0.71 0.44 0.20 0.07 0.09 0.04 -0.01 -0.09 9 10 11 12 13 14 15 16 17 -0.10 -0.07 0.03 0.05 -0.01 -0.06 -0.06 0.00 0.10 8

R’s acf function shows the ACF visually. Series weather$temp 1.0 0.6 ACF 0.2 −0.2 0 5 10 15 Lag It provides a visual summary of our data dependence. (Blue lines mark “statistical significance” for the acf values.) 9

The beer data shows an alternating dependence structure which causes time series oscillations. Series beer$prod 1.0 0.5 ACF 0.0 −0.5 0 5 10 15 20 25 30 Lag 10

An acf plot for iid normal data shows no significant correlation. Series rnorm(40) 1.0 0.6 ACF 0.2 −0.2 0 10 20 30 40 Lag . . . but what about next time? 11

Autoregression The autoregressive model of order one holds that iid ∼ N (0 , σ 2 ) . AR (1) : Y t = β 0 + β 1 Y t − 1 + ε t , ε t This is just a SLR model of Y t regressed onto lagged Y t − 1 . ◮ Y t depends on errors going all the way back to the beginning, but the whole past is captured by only Y t − 1 It assumes all of our standard regression model conditions. ◮ The residuals should look iid and be uncorrelated with ˆ Y t . ◮ All of our previous diagnostics and transforms still apply. 12

AR (1) : Y t = β 0 + β 1 Y t − 1 + ε t Again, Y t depends on the past only through Y t − 1 . ◮ Previous lag values ( Y t − 2 , Y t − 3 , . . . ) do not help predict Y t if you already know Y t − 1 . Think about daily temperatures: ◮ If I want to guess tomorrow’s temperature (without the help of a meterologist!), it is sensible to base my prediction on today’s temperature, ignoring yesterday’s. Other examples: Consumption, stock prices, . . . . 13

For the O’Hare temperatures, there is a clear autocorrelation. > tempreg <- lm(weather$temp[2:59] ~ weather$temp[1:58]) > summary(tempreg) ## abbreviated output Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) 6.70580 2.51661 2.665 0.0101 * weather$temp[1:58] 0.72329 0.09242 7.826 1.5e-10 *** --- Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1 Residual standard error: 8.79 on 56 degrees of freedom Multiple R-squared: 0.5224, Adjusted R-squared: 0.5138 F-statistic: 61.24 on 1 and 56 DF, p-value: 1.497e-10 ◮ The autoregressive term ( b 1 ≈ 0 . 7 ) is highly significant! 14

We can check residuals for any “left-over” correlation. > acf(tempreg$residuals) Series tempreg$residuals 1.0 0.6 ◮ Looks like ACF we’ve got a 0.2 good fit. −0.2 0 5 10 15 Lag 15

For the beer data, the autoregressive term is also highly significant. > beerreg <- lm(beer$prod[2:72] ~ beer$prod[1:71]) > summary(beerreg) ## abbreviated output Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) 10.64818 3.56983 2.983 0.00395 ** beer$prod[1:71] 0.69960 0.08748 7.997 2.02e-11 *** --- Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1 Residual standard error: 14.08 on 69 degrees of freedom Multiple R-squared: 0.481, Adjusted R-squared: 0.4735 F-statistic: 63.95 on 1 and 69 DF, p-value: 2.025e-11 16

But residuals show a clear pattern of left-over autocorrelation. > acf(beerreg$residuals) Series beerreg$residuals 1.0 ◮ We’ll talk later about 0.5 how to ACF model this 0.0 type of pattern ... −0.5 0 5 10 15 20 25 30 Lag 17

Many different types of series may be written as an AR (1) . AR (1) : Y t = β 0 + β 1 Y t − 1 + ε t The value of β 1 is key! ◮ If | β 1 | > 1 , the series explodes . ◮ If | β 1 | = 1 , we have a random walk . ◮ If | β 1 | < 1 , the values are mean reverting . Not only does the behavior of the series depend on β 1 , but so does the sampling distribution of b 1 ! 18

Exploding series For AR term > 1 , the Y t ’s move exponentially far from Y 1 . β 1 = 1 . 05 ● 80000 ● ● ● ● ● 60000 ● ● ● ● ● ● ◮ What does ● 40000 ● xs ● ● ● ● prediction ● ● ● ● ● ● ● 20000 ● mean here? ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 50 100 150 200 Index 19

Autocorrelation of an exploding series is high for a long time. Series xs 1.0 0.8 0.6 ACF 0.4 0.2 0.0 0 10 20 30 40 Lag 20

Week 6: An Introduction to Time Series Dependent data, - PowerPoint PPT Presentation

BUS41100 Applied Regression Analysis Week 6: An Introduction to Time Series Dependent data, autocorrelation, AR and periodic regression models Max H. Farrell The University of Chicago Booth School of Business Time series data and dependence

Lead Screw Motors LSM08 Series LSM11 Series LSM14 Series LSM17 Series

Time Series Analysis and Mining with R Time Series Decomposi- tion Time Series Forecasting

Time Matters Week 7 Week 6 Prototyping + Needfinding Week 7 Week 8 Implementation Week 9

MATH2130-F17 Week 13 Week 14 Week 15, Inner Farid Aliniaeifard Product Space CU BOULDER

Outline Time series and forecasting Time series objects 1 in R Basic time series functionality

Math 610 Section 700 - Recitation week 3 week 4 week 6 week 8 TA: Peng Wei Office: Blocker

standard series Overview DP series DX series H series M series bitte hier

Galatians: week 3 Galatians 3:1-29 Week 1: Galatians 1:1-2:14 Week 2: Galatians 2:15-21 Week 3:

E- -Series: Series: Water Mist Extinguishers Water Mist Extinguishers E E- -Series: Series:

Fourier Series Fourier Sine Series Fourier Cosine Series Fourier Series Convergence

Introduction to Time Series Basic Concepts Time series concepts well cover Elements of

www. velpaprojects .com Finishing your property the VELPA way Time plan Week 1 - 4 Week 5 - 8

Vermont M nt Marble: A e: Americas s nt Stone Monument Sto Class S s Schedule e Week

Week 1: Christ: The Source of True Happiness Week 2: Happiness, the Gospel and Living Well Week

Cycle time: 40 sec Cycle time: 12 sec Cycle time: 0.75 sec Cycle time: 1.25 sec Cycle time: 5

Time Series Time Series Time Series Prof. Paolo Ciaccia Prof. Paolo Ciaccia http://www-

LOD2 Stack and the NLP2RDF project http://slideshare.net/kurzum http://nlp2rdf.org

The Android security jungle: pitfalls, threats and survival tips Scott Alexander-Bown @scottyab

Outline XML Query Languages CS 235: XPATH XQUERY Introduction to Databases

1 1000 0000 0100 0000 Fast multiplication Division hardware 0010 0000 01000 11000 0001

The Way of Optimizing and Troubleshooting Our Lua Waf agentzh@gmail.com Yichun Zhang

Tensor-Matrix Products with a Compressed Sparse Tensor Shaden Smith George Karypis University

CALL OF DUTY: DEV OPS STEPHEN BURTON, TECH EV ANGELIST , APPDYNAMICS my company Im Steve

A Support Engineer Walkthrough on pt-stalk Marcos Albe Marcelo Altmann Principal Support

Week 6: An Introduction to Time Series Dependent data, - PowerPoint PPT Presentation

BUS41100 Applied Regression Analysis Week 6: An Introduction to Time Series Dependent data, autocorrelation, AR and periodic regression models Max H. Farrell The University of Chicago Booth School of Business Time series data and dependence

Lead Screw Motors LSM08 Series LSM11 Series LSM14 Series LSM17 Series

Time Series Analysis and Mining with R Time Series Decomposi- tion Time Series Forecasting

Time Matters Week 7 Week 6 Prototyping + Needfinding Week 7 Week 8 Implementation Week 9

MATH2130-F17 Week 13 Week 14 Week 15, Inner Farid Aliniaeifard Product Space CU BOULDER

Outline Time series and forecasting Time series objects 1 in R Basic time series functionality

Math 610 Section 700 - Recitation week 3 week 4 week 6 week 8 TA: Peng Wei Office: Blocker

standard series Overview DP series DX series H series M series bitte hier

Galatians: week 3 Galatians 3:1-29 Week 1: Galatians 1:1-2:14 Week 2: Galatians 2:15-21 Week 3:

E- -Series: Series: Water Mist Extinguishers Water Mist Extinguishers E E- -Series: Series:

Fourier Series Fourier Sine Series Fourier Cosine Series Fourier Series Convergence

Introduction to Time Series Basic Concepts Time series concepts well cover Elements of

www. velpaprojects .com Finishing your property the VELPA way Time plan Week 1 - 4 Week 5 - 8

Vermont M nt Marble: A e: Americas s nt Stone Monument Sto Class S s Schedule e Week

Week 1: Christ: The Source of True Happiness Week 2: Happiness, the Gospel and Living Well Week

Cycle time: 40 sec Cycle time: 12 sec Cycle time: 0.75 sec Cycle time: 1.25 sec Cycle time: 5

Time Series Time Series Time Series Prof. Paolo Ciaccia Prof. Paolo Ciaccia http://www-

LOD2 Stack and the NLP2RDF project http://slideshare.net/kurzum http://nlp2rdf.org

The Android security jungle: pitfalls, threats and survival tips Scott Alexander-Bown @scottyab

Outline XML Query Languages CS 235: XPATH XQUERY Introduction to Databases

1 1000 0000 0100 0000 Fast multiplication Division hardware 0010 0000 0*1000 1*1000 0001

The Way of Optimizing and Troubleshooting Our Lua Waf agentzh@gmail.com Yichun Zhang

Tensor-Matrix Products with a Compressed Sparse Tensor Shaden Smith George Karypis University

CALL OF DUTY: DEV OPS STEPHEN BURTON, TECH EV ANGELIST , APPDYNAMICS my company Im Steve

A Support Engineer Walkthrough on pt-stalk Marcos Albe Marcelo Altmann Principal Support

1 1000 0000 0100 0000 Fast multiplication Division hardware 0010 0000 01000 11000 0001