Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Consumption Models with Habit Formation

Authors
Affiliations
Johns Hopkins University
Econ-ARK
Johns Hopkins University
Econ-ARK

1The Problem

Consider a consumer whose goal at date tt is to solve the problem (this section is a simplified version of Carroll (2000)):

maxn=0Ttβnu(ct+n,ht+n)\max \sum_{n=0}^{T-\tNow} \Discount^{n} \uFunc(c_{t+n},\habit_{t+n})

where ht+n\habit_{t+n} is the habit stock, and all other variables are as usually defined. Note that here “habit formation” refers to something different from the colloquial usage of the term: it is not about developing behavioral routines or rules of thumb for saving. Rather, it refers to preferences in which past consumption affects current utility. Utility is decreasing in the habit stock (uh<0\uFunc^{h}<0): higher past consumption raises the reference point against which current consumption is compared, making any given level of current consumption less satisfying. This utility function is not “time separable” in the original sense, but once habits are included as a state variable it becomes time-separable. The DBC is

mt+1=(mtct)R+yt+1.\mRat_{t+1} = (\mRat_{t}-c_{t})\Rfree + y_{t+1}.

However, when habits affect utility we must also specify a process that describes how habits evolve over time. Our assumption will be:

FirstYearVersion=true:

ht+1=ct.\habit_{t+1} = c_{t}.

FirstYearVersion=false:

ht+1=ht+λ(ctht).\habit_{t+1} = \habit_{t} + \lambda (c_{t} - \habit_{t}).

Bellman’s equation for this problem is therefore

vt(mt,ht)=max{ct}  u(ct,ht)+βvt+1((mtct)R+yt+1,ct).\vFunc_{t}(\mRat_{t},\habit_{t}) = \max_{\{c_{t}\}} ~~\uFunc(c_{t},\habit_{t}) + \Discount \vFunc_{t+1}((\mRat_{t}-c_{t})\Rfree+y_{t+1},c_{t}).

To clarify the workings of the Envelope theorem in the case with two state variables, let’s define a function vt\underline{\vFunc}_{t} that represents value for any choice of ctc_{t}, not just the optimal one:

vt(mt,ht,ct)=u(ct,ht)+βvt+1((mtct)R+yt+1,ct)\underline{\vFunc}_{t}(\mRat_{t},\habit_{t},c_{t}) = \uFunc(c_{t},\habit_{t}) +\Discount \vFunc_{t+1}((\mRat_{t}-c_{t})\Rfree+y_{t+1},c_{t})

and define the function ct(mt,ht)\mathbf{c}_{t}(\mRat_{t},\habit_{t}) as the choice of ctc_{t} that solves the maximization (5), so that we have

vt(mt,ht)=vt(mt,ht,ct(mt,ht)).\vFunc_{t}(\mRat_{t},\habit_{t}) = \underline{\vFunc}_{t}(\mRat_{t},\habit_{t},\mathbf{c}_{t}(\mRat_{t},\habit_{t})).

2Optimality Conditions

2.1The First Order Condition

The first order condition for (5) with respect to ctc_{t} is (dropping arguments for brevity and denoting the derivative of ff with respect to xx at time tt as ftxf^{x}_{t}):

0=utc+β(vt+1hRvt+1m)0 = \uFunc^{c}_{t} + \Discount \left(\vFunc^{h}_{t+1} - \Rfree \vFunc^{\mRat}_{t+1} \right)

or, equivalently,

utc=β(Rvt+1mvt+1h).\uFunc^{c}_{t} = \Discount \left(\Rfree \vFunc_{t+1}^{\mRat}- \vFunc_{t+1}^{h}\right).

The intuition is as follows. Note first that if utility is not affected by habits, then vt+1h=0\vFunc^{h}_{t+1}=0 and equation (9) reduces to the usual first order condition for consumption, which tells us that increasing consumption by ϵ\epsilon today and reducing it by Rϵ\Rfree \epsilon in the next period must not change expected discounted utility. With habits, an increase in consumption today has a consequence beyond its effect on tomorrow’s resources mt+1\mRat_{t+1}: tomorrow’s habit stock will be changed as well. An increase in consumption today of size ϵ\epsilon increases the size of the habit stock which tomorrow’s consumption is compared to, and therefore reduces tomorrow’s utility by an amount corresponding to the marginal utility of higher habits tomorrow vt+1h\vFunc^{h}_{t+1}. Since vt+1h\vFunc^{h}_{t+1} is negative (higher habits make utility lower), this tells us that the RHS of equation (9) will be a larger positive number than it would be without habits. This means that the level of ctc_{t} that satisfies the first order condition will be a lower number (higher marginal utility) than before. Hence, habits increase the willingness to delay spending, and increase the saving rate.

Note that the first order condition also implies that

dvtdct=0\frac{d \underline{\vFunc}_{t}}{d c_{t}} = 0

when evaluated at ct=ct(mt,ht)c_{t}=\mathbf{c}_{t}(\mRat_{t},\habit_{t}).

2.2Envelope Conditions

Now consider the total derivative of vt(mt,ht,ct(mt,ht))\underline{\vFunc}_{t}(\mRat_{t},\habit_{t},\mathbf{c}_{t}(\mRat_{t},\habit_{t})) with respect to mt\mRat_{t}. (To reduce clutter, I will write dct(mt,ht)/dmtd \mathbf{c}_{t}(\mRat_{t},\habit_{t})/d \mRat_{t} as dct/dmtd \mathbf{c}_{t}/d \mRat_{t}). The chain rule tells us that

dvtdmt=dctdmtutc+dhtdmt=0uth+β((dctdmt)(vt+1hRvt+1m)+Rvt+1m)=(dctdmt)(utc+β(vt+1hRvt+1m))=0+βRvt+1m\begin{aligned} \frac{d \underline{\vFunc}_{t}}{d \mRat_{t}} & = \frac{d \mathbf{c}_{t}}{d \mRat_{t}}\uFunc^{c}_{t}+\overbrace{\frac{d \habit_{t}}{d \mRat_{t}}}^{=0} \uFunc^{h}_{t} + \Discount \left((\frac{d \mathbf{c}_{t}}{d \mRat_{t}})(\vFunc^{h}_{t+1}-\Rfree \vFunc_{t+1}^{\mRat}) + \Rfree \vFunc_{t+1}^{\mRat} \right) \\ & = \left(\frac{d \mathbf{c}_{t}}{d \mRat_{t}}\right) \underbrace{(\uFunc_{t}^{c}+\Discount\left(\vFunc_{t+1}^{h}-\Rfree \vFunc_{t+1}^{\mRat}\right))}_{=0}+\Discount \Rfree \vFunc_{t+1}^{\mRat} \end{aligned}

where the underbraced term vanishes at ct=ct(mt,ht)c_{t}=\mathbf{c}_{t}(\mRat_{t},\habit_{t}) by the first-order condition (8). Thus we have that

vtm=dvtdmtct=ct(mt,ht)=βRvt+1m.\begin{aligned} \vFunc_{t}^{\mRat} & = \frac{d \underline{\vFunc}_{t}}{d \mRat_{t}}|_{c_{t}=\mathbf{c}_{t}(\mRat_{t},\habit_{t})} \\ & = \Discount \Rfree \vFunc^{\mRat}_{t+1}. \end{aligned}

The Envelope theorem is the shortcut way to obtain this conclusion. The clearest way to use the theorem is by taking the partial derivatives of the vt\underline{\vFunc}_{t} function with respect to each of its three arguments, using the Chain Rule to take into account the possible dependency of ht\habit_{t} and ctc_{t} on mt\mRat_{t}:

vt(mt,ht)=vt(mt,ht,ct(mt,ht))vtm=vtmt+vththtmt=0+vtct=0ctmt+vtctcththtmt=0\begin{aligned} \vFunc_{t}(\mRat_{t},\habit_{t}) & = \underline{\vFunc}_{t}(\mRat_{t},\habit_{t},\mathbf{c}_{t}(\mRat_{t},\habit_{t})) \\ \vFunc_{t}^{\mRat} & = \frac{\partial \underline{\vFunc}_{t}}{\partial \mRat_{t}} + \frac{\partial \underline{\vFunc}_{t}}{\partial \habit_{t}}\underbrace{\frac{\partial \habit_{t}}{\partial \mRat_{t}}}_{=0} + \underbrace{\frac{\partial \underline{\vFunc}_{t}}{\partial c_{t}}}_{=0}\frac{\partial \mathbf{c}_{t}}{\partial \mRat_{t}}+\frac{\partial \underline{\vFunc}_{t}}{\partial c_{t}}\frac{\partial c_{t}}{\partial \habit_{t}}\underbrace{\frac{\partial \habit_{t}}{\partial \mRat_{t}}}_{=0} \end{aligned}

where the Envelope theorem is what tells you that the vt/ct\partial \underline{\vFunc}_{t}/\partial c_{t} term is equal to zero because you are evaluating the function at ct=ct(mt,ht)c_{t}=\mathbf{c}_{t}(\mRat_{t},\habit_{t}) (and ht/mt\partial \habit_{t}/\partial \mRat_{t} is zero by the assumed structure of the problem in which ht\habit_{t} is predetermined).

Now writing out vt/mt\partial \underline{\vFunc}_{t}/\partial \mRat_{t}, (13) becomes

vtm=mt[βvt+1((mtct)R+yt+1,ht+1)]\vFunc^{\mRat}_{t} = \frac{\partial}{\partial \mRat_{t}}\left[\Discount \vFunc_{t+1}((\mRat_{t}-c_{t})\Rfree+y_{t+1},\habit_{t+1})\right]

which the envelope theorem says is equivalent to

vtm=βRvt+1m.\vFunc^{\mRat}_{t} = \Discount \Rfree \vFunc_{t+1}^{\mRat}.

There is a potentially confusing thing about doing it this way, however: when you reach an expression like (14) it is tempting to think to yourself as follows: “ctc_{t} is a function of mt\mRat_{t}, and ht+1=ct\habit_{t+1}=c_{t} is also indirectly a function of mt\mRat_{t}, so the chain rule tells me that when I take the derivative in (14) I need to keep track of these.” In fact, you must treat ct/mt\partial c_{t}/\partial \mRat_{t} and ht+1/mt\partial \habit_{t+1}/\partial \mRat_{t} as zero here. The reason is that this is a partial derivative with respect to mt\mRat_{t}. The dependence of ctc_{t} (and indirectly ht+1\habit_{t+1}) on mt\mRat_{t} has already been taken care of in the two terms in (13) that were equal to zero. The confusion here is caused largely by the fact that partial differentiation is an area where standard mathematical notation is basically confusing and poorly chosen.[1]

The shortest way to obtain the end result is, as in the single variable problem, to start with Bellman’s equation and take the partial derivative with respect to mt\mRat_{t} directly (treating the problem as though ctc_{t} were a constant):

vt(mt,ht)=u(ct,ht)+βvt+1((mtct)R+yt+1,ht+1)vtm(mt,ht)=βRvt+1m(mt+1,ht+1).\begin{aligned} \vFunc_{t}(\mRat_{t},\habit_{t}) & = \uFunc(c_{t},\habit_{t})+\Discount \vFunc_{t+1}((\mRat_{t}-c_{t})\Rfree+y_{t+1},\habit_{t+1}) \\ \vFunc_{t}^{\mRat}(\mRat_{t},\habit_{t}) & = \Discount \Rfree \vFunc_{t+1}^{\mRat}(\mRat_{t+1},\habit_{t+1}). \end{aligned}

Whichever way you do it, substituting (15) into the FOC equation (9) gives

vtm=utc+βvt+1h.\vFunc_{t}^{\mRat} = \uFunc^{c}_{t}+\Discount \vFunc_{t+1}^{h}.

The intuition for this is as follows. The marginal value of wealth must be equal to the marginal value associated with a tiny bit more consumption. In the presence of habits, the extra consumption yields extra utility today utc\uFunc^{c}_{t} but affects value next period by vt+1h\vFunc^{h}_{t+1} (which is a negative number), the discounted consequence of which from today’s perspective is the βvt+1h\Discount \vFunc^{h}_{t+1} term.

2.3Envelope Theorem for ht\habit_{t}

In a problem with two state variables, the Envelope theorem can be applied to each state (and indeed in general must be applied in order to solve the model).

Again let’s start the brute force way by working through the total derivative of vt\underline{\vFunc}_{t}. For this problem, the total derivative (again denoting dct(mt,ht)/dhtd \mathbf{c}_{t}(\mRat_{t},\habit_{t})/d {h}_{t} as dct/dhtd \mathbf{c}_{t}/d \habit_{t}) is:

dvtdht=dctdhtutc+uth+β(dht+1dhtvt+1h+dmt+1dhtvt+1m)=dctdhtutc+uth+β(dht+1dctdctdhtvt+1h+dmt+1dctdctdhtvt+1m)=uth+dctdht(utc+β(vt+1hRvt+1m))=0\begin{aligned} \frac{d \underline{\vFunc}_{t}}{d \habit_{t}} & = \frac{d \mathbf{c}_{t}}{d \habit_{t}} \uFunc^{c}_{t} + \uFunc^{h}_{t} + \Discount \left( \frac{d \habit_{t+1}}{d \habit_{t}}\vFunc_{t+1}^{h} + \frac{d \mRat_{t+1}}{d \habit_{t}} \vFunc_{t+1}^{\mRat}\right) \\ & = \frac{d \mathbf{c}_{t}}{d \habit_{t}} \uFunc^{c}_{t} + \uFunc^{h}_{t} + \Discount \left(\frac{d \habit_{t+1}}{d \mathbf{c}_{t}}\frac{d \mathbf{c}_{t}}{d \habit_{t}} \vFunc_{t+1}^{h} + \frac{d \mRat_{t+1}}{d \mathbf{c}_{t}}\frac{d \mathbf{c}_{t}}{d \habit_{t}} \vFunc_{t+1}^{\mRat} \right) \\ & = \uFunc^{h}_{t} + \frac{d \mathbf{c}_{t}}{d \habit_{t}} \underbrace{\left(\uFunc^{c}_{t} + \Discount (\vFunc_{t+1}^{h} - \Rfree \vFunc_{t+1}^{\mRat} )\right)}_{= 0} \end{aligned}

where again the underbraced term vanishes at ct=ct(mt,ht)c_{t}=\mathbf{c}_{t}(\mRat_{t},\habit_{t}) by (8). Thus we have

vth=dvtdhtct=ct(mt,ht)=uth.\begin{aligned} \vFunc_{t}^{h} & = \frac{d \underline{\vFunc}_{t}}{d \habit_{t}}|_{c_{t} = \mathbf{c}_{t}(\mRat_{t},\habit_{t})} \\ & = \uFunc_{t}^{h}. \end{aligned}

Turning now to more direct use of the envelope theorem, the Chain Rule tells us

vth=vtmtmtht=0+vtht+vtctctht\vFunc_{t}^{h} = \frac{\partial \underline{\vFunc}_{t}}{\partial \mRat_{t}} \overbrace{\frac{\partial \mRat_{t}}{\partial \habit_{t}}}^{=0} + \frac{\partial \underline{\vFunc}_{t}}{\partial \habit_{t}} + \frac{\partial \underline{\vFunc}_{t}}{\partial c_{t}}\frac{\partial \mathbf{c}_{t}}{\partial \habit_{t}}

while the Envelope theorem once again says vt/ct=0\partial \underline{\vFunc}_{t}/\partial c_{t} = 0 at ct=ct(mt,ht)c_{t}=\mathbf{c}_{t}(\mRat_{t},\habit_{t}) so we obtain

vth=vtht=uth\begin{aligned} \vFunc_{t}^{h} & = \frac{\partial \underline{\vFunc}_{t}}{\partial \habit_{t}} \\ & = \uFunc_{t}^{h} \end{aligned}

since ht\habit_{t} appears directly only in the u(ct,ht)\uFunc(c_{t},\habit_{t}) part of vt(mt,ht,ct)\underline{\vFunc}_{t}(\mRat_{t},\habit_{t},c_{t}). And once again, the shortest way to the answer is to treat ctc_{t} as though it were a constant in the value function, which yields

vt(mt,ht)=u(ct,ht)+βvt+1((mtct)R+yt+1,ct)vth(mt,ht)=uth.\begin{aligned} \vFunc_{t}(\mRat_t,\habit_{t}) & = \uFunc(c_{t},\habit_{t})+\Discount \vFunc_{t+1}((\mRat_{t}-c_{t})\Rfree+y_{t+1},c_{t}) \\ \vFunc_{t}^{h}(\mRat_{t},\habit_{t}) & = \uFunc_{t}^{h}. \end{aligned}

From (17) this implies that

vtm=utc+βut+1h.\vFunc_{t}^{\mRat} = \uFunc^{c}_{t}+\Discount \uFunc_{t+1}^{h}.

Roll this equation forward one period and substitute into equation (15) to obtain:

utc+βut+1h=Rβ[ut+1c+βut+2h]\uFunc^{c}_{t} + \Discount \uFunc_{t+1}^{h} = \Rfree \Discount \left[\uFunc^{c}_{t+1}+\Discount \uFunc^{h}_{t+2}\right]

Note that if ut+1h=ut+2h=0\uFunc_{t+1}^{h}=\uFunc_{t+2}^{h}=0 so that habits have no effect on utility, (24) again is solved by the standard time-separable Euler equation.

Now assume that the utility function takes the specific form

u(c,h)=f(cαh)\uFunc(c,h) = \mathbf{f}(c-\alpha h)

which implies derivatives of

uc=fuh=αf.\begin{aligned} \uFunc^{c} & = \mathbf{f}^{\prime} \\ \uFunc^{h} & = -\alpha \mathbf{f}^{\prime}. \end{aligned}

Substituting these into equation (24) we obtain,

ftαβft+1=Rβ[ft+1αβft+2]\mathbf{f}_{t}^{\prime}-\alpha \Discount \mathbf{f}_{t+1}^{\prime} = \Rfree\Discount[\mathbf{f}_{t+1}^{\prime} -\alpha \Discount \mathbf{f}_{t+2}^{\prime}]

Now assume that there is a solution in which marginal utility of consumption grows at a constant rate over time, ft=kft+1\mathbf{f}_{t}^{\prime}=k \mathbf{f}_{t+1}^{\prime} and substitute into (27)

ft+1(kαβ)=Rβ[ft+2(kαβ)]kft+2(kαβ)=Rβ[ft+2(kαβ)]k=Rβ\begin{aligned} \mathbf{f}_{t+1}^{\prime}(k-\alpha \Discount) & = \Rfree\Discount[\mathbf{f}_{t+2}^{\prime}(k-\alpha \Discount)] \\ k \mathbf{f}_{t+2}^{\prime}(k-\alpha \Discount) & = \Rfree\Discount[\mathbf{f}_{t+2}^{\prime}(k-\alpha \Discount)] \\ k & = \Rfree\Discount \end{aligned}

so marginal utility grows at rate 1/Rβ1/\Rfree\Discount. Note that if we assume α=0\alpha=0 so that habits do not matter, we again obtain the standard result that u(ct)=Rβu(ct+1)\uFunc^{\prime}(c_{t}) = \Rfree\Discount \uFunc^{\prime}(c_{t+1}).

Now make the final assumption that f(z)=z1ρ/(1ρ)\mathbf{f}(z) = z^{1-\CRRA}/(1-\CRRA), implying of course that f(z)=zρ\mathbf{f}^{\prime}(z) = z^{-\CRRA}. Equation (28) can be rewritten

1=Rβ(zt+1/zt)ρ1 = \Rfree\Discount (z_{t+1}/z_{t})^{-\CRRA}

Now expand zt+1/ztz_{t+1}/z_{t}

ct+1αctctαct1=ct+1/ctα1αct1/ct\frac{c_{t+1}-\alpha c_{t}}{c_{t}-\alpha c_{t-1}} = \frac{c_{t+1}/c_{t}-\alpha}{1-\alpha c_{t-1}/c_{t}}
1+Δlogct+1α1α+αΔlogct\approx \frac{1+\Delta \log c_{t+1}-\alpha}{1 - \alpha + \alpha \Delta \log c_{t}}
=1α+Δlogct+11α+αΔlogct= \frac{1-\alpha+\Delta \log c_{t+1}}{1 - \alpha + \alpha \Delta \log c_{t}}
=1+(Δlogct+1)/(1α)1+(α/(1α))Δlogct= \frac{1+(\Delta \log c_{t+1})/(1-\alpha)}{1 + (\alpha/(1-\alpha)) \Delta \log c_{t}}
1+(11α)(Δlogct+1αΔlogct)\approx 1+\left(\frac{1}{1-\alpha}\right)\left(\Delta \log c_{t+1} - \alpha \Delta \log c_{t} \right)

where (31) follows from (30) because ct+1/ct=1+(ct+1ct)/ct1+Δlogct+1c_{t+1}/c_{t} = 1+(c_{t+1}-c_{t})/c_{t} \approx 1+\Delta \log c_{t+1} and ct1/ct=(ct(ctct1))/ct1Δlogctc_{t-1}/c_{t} = (c_{t}-(c_{t}-c_{t-1}))/c_{t} \approx 1 - \Delta \log c_{t}, and (34) follows from (33) because for small η\eta and ϵ\epsilon, (1+η)/(1+ϵ)1+ηϵ(1+\eta)/(1+\epsilon) \approx 1+\eta-\epsilon.

Substituting (34) into (29) gives

1Rβ(1+(11α)(Δlogct+1αΔlogct))ρ0log[Rβ]ρ(11α)(Δlogct+1αΔlogct)Δlogct+1(1α)ρ1(rϑ)+αΔlogct.\begin{aligned} 1 & \approx \Rfree\Discount (1+\left(\frac{1}{1-\alpha}\right)\left(\Delta \log c_{t+1} - \alpha \Delta \log c_{t} \right))^{-\CRRA} \\ 0 & \approx \log [\Rfree\Discount] - \CRRA \left(\frac{1}{1-\alpha}\right)\left(\Delta \log c_{t+1} - \alpha \Delta \log c_{t} \right) \\ \Delta \log c_{t+1} & \approx (1-\alpha)\CRRA^{-1}(\rfree - \timeRate) + \alpha \Delta \log c_{t}. \end{aligned}

Thus, this formulation of habit formation implies that the growth rate of consumption is serially correlated.

Footnotes
  1. Google the string “partial differentiation confusing OCW” to find a fuller description of the problems of standard notation on partial differentiation.

References
  1. Carroll, C. D. (2000). Solving Consumption Models with Multiplicative Habits. Economics Letters, 68(1), 67–77. 10.1016/S0165-1765(00)00223-8