where ht+n is the habit stock, and all other variables are as usually defined. Note that here “habit formation” refers to something different from the colloquial usage of the term: it is not about developing behavioral routines or rules of thumb for saving. Rather, it refers to preferences in which past consumption affects current utility. Utility is decreasing in the habit stock (uh<0): higher past consumption raises the reference point against which current consumption is compared, making any given level of current consumption less satisfying. This utility function is not “time separable” in the original sense, but once habits are included as a state variable it becomes time-separable. The DBC is
To clarify the workings of the Envelope theorem in the case with two state variables, let’s define a function vt that represents value for any choice of ct, not just the optimal one:
The first order condition for (5) with respect to ct is (dropping arguments for brevity and denoting the derivative of f with respect to x at time t as ftx):
The intuition is as follows. Note first that if utility is not affected by habits, then vt+1h=0 and equation (9) reduces to the usual first order condition for consumption, which tells us that increasing consumption by ϵ today and reducing it by Rϵ in the next period must not change expected discounted utility. With habits, an increase in consumption today has a consequence beyond its effect on tomorrow’s resources mt+1: tomorrow’s habit stock will be changed as well. An increase in consumption today of size ϵ increases the size of the habit stock which tomorrow’s consumption is compared to, and therefore reduces tomorrow’s utility by an amount corresponding to the marginal utility of higher habits tomorrow vt+1h. Since vt+1h is negative (higher habits make utility lower), this tells us that the RHS of equation (9) will be a larger positive number than it would be without habits. This means that the level of ct that satisfies the first order condition will be a lower number (higher marginal utility) than before. Hence, habits increase the willingness to delay spending, and increase the saving rate.
Note that the first order condition also implies that
Now consider the total derivative of vt(mt,ht,ct(mt,ht)) with respect to mt. (To reduce clutter, I will write dct(mt,ht)/dmt as dct/dmt). The chain rule tells us that
The Envelope theorem is the shortcut way to obtain this conclusion. The clearest way to use the theorem is by taking the partial derivatives of the vt function with respect to each of its three arguments, using the Chain Rule to take into account the possible dependency of ht and ct on mt:
where the Envelope theorem is what tells you that the ∂vt/∂ct term is equal to zero because you are evaluating the function at ct=ct(mt,ht) (and ∂ht/∂mt is zero by the assumed structure of the problem in which ht is predetermined).
There is a potentially confusing thing about doing it this way, however: when you reach an expression like (14) it is tempting to think to yourself as follows: “ct is a function of mt, and ht+1=ct is also indirectly a function of mt, so the chain rule tells me that when I take the derivative in (14) I need to keep track of these.” In fact, you must treat ∂ct/∂mt and ∂ht+1/∂mt as zero here. The reason is that this is a partial derivative with respect to mt. The dependence of ct (and indirectly ht+1) on mt has already been taken care of in the two terms in (13) that were equal to zero. The confusion here is caused largely by the fact that partial differentiation is an area where standard mathematical notation is basically confusing and poorly chosen.[1]
The shortest way to obtain the end result is, as in the single variable problem, to start with Bellman’s equation and take the partial derivative with respect to mt directly (treating the problem as though ct were a constant):
The intuition for this is as follows. The marginal value of wealth must be equal to the marginal value associated with a tiny bit more consumption. In the presence of habits, the extra consumption yields extra utility today utc but affects value next period by vt+1h (which is a negative number), the discounted consequence of which from today’s perspective is the βvt+1h term.
In a problem with two state variables, the Envelope theorem can be applied to each state (and indeed in general must be applied in order to solve the model).
Again let’s start the brute force way by working through the total derivative of vt. For this problem, the total derivative (again denoting dct(mt,ht)/dht as dct/dht) is:
since ht appears directly only in the u(ct,ht) part of vt(mt,ht,ct). And once again, the shortest way to the answer is to treat ct as though it were a constant in the value function, which yields
so marginal utility grows at rate 1/Rβ. Note that if we assume α=0 so that habits do not matter, we again obtain the standard result that u′(ct)=Rβu′(ct+1).
Now make the final assumption that f(z)=z1−ρ/(1−ρ), implying of course that f′(z)=z−ρ. Equation (28) can be rewritten
where (31) follows from (30) because ct+1/ct=1+(ct+1−ct)/ct≈1+Δlogct+1 and ct−1/ct=(ct−(ct−ct−1))/ct≈1−Δlogct, and (34) follows from (33) because for small η and ϵ, (1+η)/(1+ϵ)≈1+η−ϵ.
Google the string “partial differentiation confusing OCW” to find a fuller description of the problems of standard notation on partial differentiation.