Lecture 3: Euler-Equation Estimation Simon Gilchrist Boston Univerity and NBER EC 745 Fall, 2013
Euler equation tests This lecture considers some further implications and tests of the “standard” asset pricing model: for any gross return R t +1 , we have the equation: � βu ′ ( C t +1 ) � = 1 , E t R t +1 u ′ ( C t ) Hence for any gross return R t +1 , we have � βu ′ ( C t +1 ) �� � R t +1 − R f = 0 . E t t +1 u ′ ( C t ) These equations hold for any asset which you can buy or sell (at the margin): stocks, bonds, options, commodities, etc. Key intuition: assets are priced according to their covariance with consumption growth. Assets which pay off when consumption growth is high (in good times) are risky, hence they will need to have high expected returns for investors to want to hold them.
Euler equation methods Earlier lecture studies the implication of the Euler equation, plus an assumption that consumption growth is iid and dividends = consumption. Euler equation methods requires very few assumptions - no need to specify the shocks, the technology (as you would in a GE model), or the endowment process. The logic of Euler equation methods applies to models others than the CRRA representative agent model. Example: Suppose utility depends on c t and other observable x t , say U ( c t , x t ) , then we have M t +1 = βU 1 ( c t +1 , x t +1 ) , U 1 ( c t , x t ) and E t ( M t +1 R t +1 ) = 1 can similarly be tested.
Euler equation tests Specify u ( C ) = C 1 − γ 1 − γ . For any variable Z t known at time t , we have: � � � − γ � C t +1 R t +1 − 1 = 0 , Z t E t β C t Since Z t known at t : � � �� � − γ � C t +1 E t Z t β R t +1 − 1 = 0 C t Take unconditional expectations: � � �� � − γ � C t +1 E Z t β R t +1 − 1 = 0 . C t This is a set of restrictions that the theory imposes on the joint distribution of consumption growth, returns and Z t .
Alternative interpretation Express the Euler equation as: � βu ′ ( C t +1 ) � R t +1 − 1 = ε t +1 , u ′ ( C t ) where � βu ′ ( C t +1 ) � � βu ′ ( C t +1 ) � ε t +1 = R t +1 − 1 − E t R t +1 − 1 u ′ ( C t ) u ′ ( C t ) By construction ε t +1 is a forecast error – the difference between realized and expected values. Therefore it is uncorrelated with any variable known at time t : E ( Z t ε t +1 ) = 0 No need for distributional assumption, except that returns and consumption growth must be stationary.
Instrument Relevance Any stationary variable Z t realized at time t is a valid instrument. if Z t is uncorrelated with future returns and consumption, then it does not add any restriction since � � �� � − γ � C t +1 0 = E Z t β R t +1 − 1 C t � � � − γ � C t +1 = E ( Z t ) E R t +1 − 1 β . C t Instrument relevance: Z t must forecast future returns or future consumption growth.
Empirical Implementation This equation holds for any returns R t +1 , Z t , so we have NK equations, where N = # of returns and K = # of “instruments” Z t , and we have 2 parameters ( β, γ ). Intuitively, we simply pick β and γ to set the sample version of these equations equal to zero, i.e. T � � � − γ 1 � C t +1 � R t +1 − 1 = 0 . Z t β T C t t =1
Exact identification: Let Z t = 1 and apply to excess aggregate stock returns: choose γ such that T � − γ � 1 � C t +1 � t +1 − R f � R e = 0 . t +1 T C t t =1 Note: this may not be possible! i.e. with the data there may not be a γ s.t. this holds. Let Z t = 1 find β and γ to match both stock returns and the risk-free rate: T � � � − γ � C t +1 1 � R e β t +1 − 1 = 0 , T C t t =1 T � � − γ � 1 � C t +1 � R f β t +1 − 1 = 0 . T C t t =1 Two eqns in two unknowns. Again, this may not be possible to find both β and γ in sample.
Over-identification If more than equations, then of course (generically) cannot set all the equations equal to zero. In practice minimize the weighted errors: β,γ g ′ min T W T g T , � � � − γ � � T C t +1 where g T = 1 t =1 Z t β R t +1 − 1 and W T is a T C t weighting matrix. Can pick W T = I, or W T so that moments have the same scale, or choose W in a statistically optimal way. Formula for W T is the inverse of the asymptotic variance of the moment vector: ��� − 1 � � T � � − γ 1 � C t +1 � W = V ar β R t +1 − 1 T C t t =1
Over-identifying tests Overidentification: if NK > # of parameters the system is over-identified. � � � − γ � � T C t +1 Test that 1 t =1 Z t β R t +1 − 1 is close to zero 0 . T C t
GMM general formulas Express a model as E [ f ( x t , b )] = 0 The general GMM estimate ˆ b is given by: a T g T (ˆ b ) = 0 , where T g T ( b ) ≡ 1 � f ( x t , b ) , T t =1 and a T is a matrix that defines which linear combination of g T ( b ) will be set to zero.
Asymptotics The asymptotic distribution of the GMM estimate is: √ T (ˆ b − b ) → N [0 , ( ad ) − 1 aSa ′ ( ad ) − 1 ′ ] , where � ∂f � = ∂g T ( b ) d ≡ E ∂b ′ ( x t , b ) , ∂b ′ a ≡ a T , ∞ � E [ f ( x t , b ) f ( x t − j , b ) ′ ] . S ≡ j = −∞ Hansen (1982) gives the sampling distribution of the moments g T ( b ) : √ Tg T (ˆ b ) → N [0 , ( I − d ( ad ) − 1 a ) S ( I − d ( ad ) − 1 a ) ′ ] .
Efficieny The efficient estimate is obtained by setting: a = d ′ S − 1 In this case, √ T (ˆ b − b ) → N [0 , ( d ′ S − 1 d ) − 1 ] The quadratic objective function provides a test of over-identifying restrictions: TJ T = Tg T (ˆ b ) ′ S − 1 g T (ˆ b ) = χ 2 ( NK − x )
Testing parameter restrictions Suppose we estimate a model that imposes R restrictions on b . We can test these restrictions using: T ( J T,R − J T ) = χ 2 ( R ) . where J T,R is the value of objective estimated under the restriction. Comment: when imposing the restrictions we should use the unrestricted weighting matrix. Otherwise the test statistic has a non-central χ 2 distribution. The test statistic is constructed as a quadratic function of the difference between the two moment vectors: ( g T,R − g T ) ′ W T ( g T,R − g T ) If we used alternative weighting matrices the difference in the J statistics is not guaranteed to be positive.
Ordinary Least Squares β solves: Min β E T [( y t − β ′ x t ) 2 ] In this case: x t ( y t − β ′ x t ) = x t ε t , f ( x t , β ) = g T ( β ) = E T [ x t ( y t − β ′ x t )] , a T = I, = − E T ( x t x ′ t ) , d ˆ E T [ x t x ′− 1 β = E T ( x t y t ) t ∞ 1 var (ˆ � T E T [ x t x ′− 1 t − j )] E T [ x t x ′− 1 E ( ε t x t x ′ t − j ε ′ β ) = [ t t j = −∞
An implication of Euler equations With Euler equations we have T g T ( b ) = 1 � Z t ε t +1 ( b ) T t =1 We then choose b to minimize the quadratic g T ( b ) W T g T This is equivalent to choosing b so that d ( g T ( b )) ′ W T g T = 0 db In this case, the optimal weighting matrix is W T = V − 1 g T where � g T ( b ) g T ( b ) ′ � V g T = E
Variance-covariance matrix Note that in Euler equation case: � � E ( f ( x t , b ) f ( x t − s , b )) = E Z t ε t +1 ε ′ t − s +1 Z ′ t − s If we confine attention to one period returns then ε t is serially uncorrelated and this exprssion is zero for s � = 0 . In this case, the variance-covariance matrix can be constructed as T = 1 � � g T ( b ) g T ( b ) ′ � Z t ε t +1 ε ′ t +1 Z ′ E t T t =1 If we use multi-period returns then we induce serial correlation and we need a Newey-West style estimator of the variance of the moment matrix.
Recommend
More recommend