Consumption Models with Habit Formation

1The Problem¶

Consider a consumer whose goal at date $t$ is to solve the problem (this section is a simplified version of Carroll (2000)):

\max \sum_{n=0}^{T-\tNow} \Discount^{n} \uFunc(c_{t+n},\habit_{t+n})

(1)

where $\habit_{t+n}$ is the habit stock, and all other variables are as usually defined. Note that here “habit formation” refers to something different from the colloquial usage of the term: it is not about developing behavioral routines or rules of thumb for saving. Rather, it refers to preferences in which past consumption affects current utility. Utility is decreasing in the habit stock ( $\uFunc^{h}<0$ ): higher past consumption raises the reference point against which current consumption is compared, making any given level of current consumption less satisfying. This utility function is not “time separable” in the original sense, but once habits are included as a state variable it becomes time-separable. The DBC is

\mRat_{t+1} = (\mRat_{t}-c_{t})\Rfree + y_{t+1}.

(2)

However, when habits affect utility we must also specify a process that describes how habits evolve over time. Our assumption will be:

FirstYearVersion=true:

\habit_{t+1} = c_{t}.

(3)

FirstYearVersion=false:

\habit_{t+1} = \habit_{t} + \lambda (c_{t} - \habit_{t}).

(4)

Bellman’s equation for this problem is therefore

\vFunc_{t}(\mRat_{t},\habit_{t}) = \max_{\{c_{t}\}} ~~\uFunc(c_{t},\habit_{t}) + \Discount \vFunc_{t+1}((\mRat_{t}-c_{t})\Rfree+y_{t+1},c_{t}).

(5)

To clarify the workings of the Envelope theorem in the case with two state variables, let’s define a function $\underline{\vFunc}_{t}$ that represents value for any choice of $c_{t}$ , not just the optimal one:

\underline{\vFunc}_{t}(\mRat_{t},\habit_{t},c_{t}) = \uFunc(c_{t},\habit_{t}) +\Discount \vFunc_{t+1}((\mRat_{t}-c_{t})\Rfree+y_{t+1},c_{t})

(6)

and define the function $\mathbf{c}_{t}(\mRat_{t},\habit_{t})$ as the choice of $c_{t}$ that solves the maximization (5), so that we have

\vFunc_{t}(\mRat_{t},\habit_{t}) = \underline{\vFunc}_{t}(\mRat_{t},\habit_{t},\mathbf{c}_{t}(\mRat_{t},\habit_{t})).

(7)

2Optimality Conditions¶

2.1The First Order Condition¶

The first order condition for (5) with respect to $c_{t}$ is (dropping arguments for brevity and denoting the derivative of $f$ with respect to $x$ at time $t$ as $f^{x}_{t}$ ):

0 = \uFunc^{c}_{t} + \Discount \left(\vFunc^{h}_{t+1} - \Rfree \vFunc^{\mRat}_{t+1} \right)

(8)

or, equivalently,

\uFunc^{c}_{t} = \Discount \left(\Rfree \vFunc_{t+1}^{\mRat}- \vFunc_{t+1}^{h}\right).

(9)

The intuition is as follows. Note first that if utility is not affected by habits, then $\vFunc^{h}_{t+1}=0$ and equation (9) reduces to the usual first order condition for consumption, which tells us that increasing consumption by $\epsilon$ today and reducing it by $\Rfree \epsilon$ in the next period must not change expected discounted utility. With habits, an increase in consumption today has a consequence beyond its effect on tomorrow’s resources $\mRat_{t+1}$ : tomorrow’s habit stock will be changed as well. An increase in consumption today of size $\epsilon$ increases the size of the habit stock which tomorrow’s consumption is compared to, and therefore reduces tomorrow’s utility by an amount corresponding to the marginal utility of higher habits tomorrow $\vFunc^{h}_{t+1}$ . Since $\vFunc^{h}_{t+1}$ is negative (higher habits make utility lower), this tells us that the RHS of equation (9) will be a larger positive number than it would be without habits. This means that the level of $c_{t}$ that satisfies the first order condition will be a lower number (higher marginal utility) than before. Hence, habits increase the willingness to delay spending, and increase the saving rate.

Note that the first order condition also implies that

\frac{d \underline{\vFunc}_{t}}{d c_{t}} = 0

(10)

when evaluated at $c_{t}=\mathbf{c}_{t}(\mRat_{t},\habit_{t})$ .

2.2Envelope Conditions¶

Now consider the total derivative of $\underline{\vFunc}_{t}(\mRat_{t},\habit_{t},\mathbf{c}_{t}(\mRat_{t},\habit_{t}))$ with respect to $\mRat_{t}$ . (To reduce clutter, I will write $d \mathbf{c}_{t}(\mRat_{t},\habit_{t})/d \mRat_{t}$ as $d \mathbf{c}_{t}/d \mRat_{t}$ ). The chain rule tells us that

\begin{aligned} \frac{d \underline{\vFunc}_{t}}{d \mRat_{t}} & = \frac{d \mathbf{c}_{t}}{d \mRat_{t}}\uFunc^{c}_{t}+\overbrace{\frac{d \habit_{t}}{d \mRat_{t}}}^{=0} \uFunc^{h}_{t} + \Discount \left((\frac{d \mathbf{c}_{t}}{d \mRat_{t}})(\vFunc^{h}_{t+1}-\Rfree \vFunc_{t+1}^{\mRat}) + \Rfree \vFunc_{t+1}^{\mRat} \right) \\ & = \left(\frac{d \mathbf{c}_{t}}{d \mRat_{t}}\right) \underbrace{(\uFunc_{t}^{c}+\Discount\left(\vFunc_{t+1}^{h}-\Rfree \vFunc_{t+1}^{\mRat}\right))}_{=0}+\Discount \Rfree \vFunc_{t+1}^{\mRat} \end{aligned}

(11)

where the underbraced term vanishes at $c_{t}=\mathbf{c}_{t}(\mRat_{t},\habit_{t})$ by the first-order condition (8). Thus we have that

\begin{aligned} \vFunc_{t}^{\mRat} & = \frac{d \underline{\vFunc}_{t}}{d \mRat_{t}}|_{c_{t}=\mathbf{c}_{t}(\mRat_{t},\habit_{t})} \\ & = \Discount \Rfree \vFunc^{\mRat}_{t+1}. \end{aligned}

(12)

The Envelope theorem is the shortcut way to obtain this conclusion. The clearest way to use the theorem is by taking the partial derivatives of the $\underline{\vFunc}_{t}$ function with respect to each of its three arguments, using the Chain Rule to take into account the possible dependency of $\habit_{t}$ and $c_{t}$ on $\mRat_{t}$ :

\begin{aligned} \vFunc_{t}(\mRat_{t},\habit_{t}) & = \underline{\vFunc}_{t}(\mRat_{t},\habit_{t},\mathbf{c}_{t}(\mRat_{t},\habit_{t})) \\ \vFunc_{t}^{\mRat} & = \frac{\partial \underline{\vFunc}_{t}}{\partial \mRat_{t}} + \frac{\partial \underline{\vFunc}_{t}}{\partial \habit_{t}}\underbrace{\frac{\partial \habit_{t}}{\partial \mRat_{t}}}_{=0} + \underbrace{\frac{\partial \underline{\vFunc}_{t}}{\partial c_{t}}}_{=0}\frac{\partial \mathbf{c}_{t}}{\partial \mRat_{t}}+\frac{\partial \underline{\vFunc}_{t}}{\partial c_{t}}\frac{\partial c_{t}}{\partial \habit_{t}}\underbrace{\frac{\partial \habit_{t}}{\partial \mRat_{t}}}_{=0} \end{aligned}

(13)

where the Envelope theorem is what tells you that the $\partial \underline{\vFunc}_{t}/\partial c_{t}$ term is equal to zero because you are evaluating the function at $c_{t}=\mathbf{c}_{t}(\mRat_{t},\habit_{t})$ (and $\partial \habit_{t}/\partial \mRat_{t}$ is zero by the assumed structure of the problem in which $\habit_{t}$ is predetermined).

Now writing out $\partial \underline{\vFunc}_{t}/\partial \mRat_{t}$ , (13) becomes

\vFunc^{\mRat}_{t} = \frac{\partial}{\partial \mRat_{t}}\left[\Discount \vFunc_{t+1}((\mRat_{t}-c_{t})\Rfree+y_{t+1},\habit_{t+1})\right]

(14)

which the envelope theorem says is equivalent to

\vFunc^{\mRat}_{t} = \Discount \Rfree \vFunc_{t+1}^{\mRat}.

(15)

There is a potentially confusing thing about doing it this way, however: when you reach an expression like (14) it is tempting to think to yourself as follows: “ $c_{t}$ is a function of $\mRat_{t}$ , and $\habit_{t+1}=c_{t}$ is also indirectly a function of $\mRat_{t}$ , so the chain rule tells me that when I take the derivative in (14) I need to keep track of these.” In fact, you must treat $\partial c_{t}/\partial \mRat_{t}$ and $\partial \habit_{t+1}/\partial \mRat_{t}$ as zero here. The reason is that this is a partial derivative with respect to $\mRat_{t}$ . The dependence of $c_{t}$ (and indirectly $\habit_{t+1}$ ) on $\mRat_{t}$ has already been taken care of in the two terms in (13) that were equal to zero. The confusion here is caused largely by the fact that partial differentiation is an area where standard mathematical notation is basically confusing and poorly chosen.^[1]

The shortest way to obtain the end result is, as in the single variable problem, to start with Bellman’s equation and take the partial derivative with respect to $\mRat_{t}$ directly (treating the problem as though $c_{t}$ were a constant):

\begin{aligned} \vFunc_{t}(\mRat_{t},\habit_{t}) & = \uFunc(c_{t},\habit_{t})+\Discount \vFunc_{t+1}((\mRat_{t}-c_{t})\Rfree+y_{t+1},\habit_{t+1}) \\ \vFunc_{t}^{\mRat}(\mRat_{t},\habit_{t}) & = \Discount \Rfree \vFunc_{t+1}^{\mRat}(\mRat_{t+1},\habit_{t+1}). \end{aligned}

(16)

Whichever way you do it, substituting (15) into the FOC equation (9) gives

\vFunc_{t}^{\mRat} = \uFunc^{c}_{t}+\Discount \vFunc_{t+1}^{h}.

(17)

The intuition for this is as follows. The marginal value of wealth must be equal to the marginal value associated with a tiny bit more consumption. In the presence of habits, the extra consumption yields extra utility today $\uFunc^{c}_{t}$ but affects value next period by $\vFunc^{h}_{t+1}$ (which is a negative number), the discounted consequence of which from today’s perspective is the $\Discount \vFunc^{h}_{t+1}$ term.

2.3Envelope Theorem for $\habit_{t}$ ¶

In a problem with two state variables, the Envelope theorem can be applied to each state (and indeed in general must be applied in order to solve the model).

Again let’s start the brute force way by working through the total derivative of $\underline{\vFunc}_{t}$ . For this problem, the total derivative (again denoting $d \mathbf{c}_{t}(\mRat_{t},\habit_{t})/d {h}_{t}$ as $d \mathbf{c}_{t}/d \habit_{t}$ ) is:

\begin{aligned} \frac{d \underline{\vFunc}_{t}}{d \habit_{t}} & = \frac{d \mathbf{c}_{t}}{d \habit_{t}} \uFunc^{c}_{t} + \uFunc^{h}_{t} + \Discount \left( \frac{d \habit_{t+1}}{d \habit_{t}}\vFunc_{t+1}^{h} + \frac{d \mRat_{t+1}}{d \habit_{t}} \vFunc_{t+1}^{\mRat}\right) \\ & = \frac{d \mathbf{c}_{t}}{d \habit_{t}} \uFunc^{c}_{t} + \uFunc^{h}_{t} + \Discount \left(\frac{d \habit_{t+1}}{d \mathbf{c}_{t}}\frac{d \mathbf{c}_{t}}{d \habit_{t}} \vFunc_{t+1}^{h} + \frac{d \mRat_{t+1}}{d \mathbf{c}_{t}}\frac{d \mathbf{c}_{t}}{d \habit_{t}} \vFunc_{t+1}^{\mRat} \right) \\ & = \uFunc^{h}_{t} + \frac{d \mathbf{c}_{t}}{d \habit_{t}} \underbrace{\left(\uFunc^{c}_{t} + \Discount (\vFunc_{t+1}^{h} - \Rfree \vFunc_{t+1}^{\mRat} )\right)}_{= 0} \end{aligned}

(18)

where again the underbraced term vanishes at $c_{t}=\mathbf{c}_{t}(\mRat_{t},\habit_{t})$ by (8). Thus we have

\begin{aligned} \vFunc_{t}^{h} & = \frac{d \underline{\vFunc}_{t}}{d \habit_{t}}|_{c_{t} = \mathbf{c}_{t}(\mRat_{t},\habit_{t})} \\ & = \uFunc_{t}^{h}. \end{aligned}

(19)

Turning now to more direct use of the envelope theorem, the Chain Rule tells us

\vFunc_{t}^{h} = \frac{\partial \underline{\vFunc}_{t}}{\partial \mRat_{t}} \overbrace{\frac{\partial \mRat_{t}}{\partial \habit_{t}}}^{=0} + \frac{\partial \underline{\vFunc}_{t}}{\partial \habit_{t}} + \frac{\partial \underline{\vFunc}_{t}}{\partial c_{t}}\frac{\partial \mathbf{c}_{t}}{\partial \habit_{t}}

(20)

while the Envelope theorem once again says $\partial \underline{\vFunc}_{t}/\partial c_{t} = 0$ at $c_{t}=\mathbf{c}_{t}(\mRat_{t},\habit_{t})$ so we obtain

\begin{aligned} \vFunc_{t}^{h} & = \frac{\partial \underline{\vFunc}_{t}}{\partial \habit_{t}} \\ & = \uFunc_{t}^{h} \end{aligned}

(21)

since $\habit_{t}$ appears directly only in the $\uFunc(c_{t},\habit_{t})$ part of $\underline{\vFunc}_{t}(\mRat_{t},\habit_{t},c_{t})$ . And once again, the shortest way to the answer is to treat $c_{t}$ as though it were a constant in the value function, which yields

\begin{aligned} \vFunc_{t}(\mRat_t,\habit_{t}) & = \uFunc(c_{t},\habit_{t})+\Discount \vFunc_{t+1}((\mRat_{t}-c_{t})\Rfree+y_{t+1},c_{t}) \\ \vFunc_{t}^{h}(\mRat_{t},\habit_{t}) & = \uFunc_{t}^{h}. \end{aligned}

(22)

From (17) this implies that

\vFunc_{t}^{\mRat} = \uFunc^{c}_{t}+\Discount \uFunc_{t+1}^{h}.

(23)

Roll this equation forward one period and substitute into equation (15) to obtain:

\uFunc^{c}_{t} + \Discount \uFunc_{t+1}^{h} = \Rfree \Discount \left[\uFunc^{c}_{t+1}+\Discount \uFunc^{h}_{t+2}\right]

(24)

Note that if $\uFunc_{t+1}^{h}=\uFunc_{t+2}^{h}=0$ so that habits have no effect on utility, (24) again is solved by the standard time-separable Euler equation.

Now assume that the utility function takes the specific form

\uFunc(c,h) = \mathbf{f}(c-\alpha h)

(25)

which implies derivatives of

\begin{aligned} \uFunc^{c} & = \mathbf{f}^{\prime} \\ \uFunc^{h} & = -\alpha \mathbf{f}^{\prime}. \end{aligned}

(26)

Substituting these into equation (24) we obtain,

\mathbf{f}_{t}^{\prime}-\alpha \Discount \mathbf{f}_{t+1}^{\prime} = \Rfree\Discount[\mathbf{f}_{t+1}^{\prime} -\alpha \Discount \mathbf{f}_{t+2}^{\prime}]

(27)

Now assume that there is a solution in which marginal utility of consumption grows at a constant rate over time, $\mathbf{f}_{t}^{\prime}=k \mathbf{f}_{t+1}^{\prime}$ and substitute into (27)

\begin{aligned} \mathbf{f}_{t+1}^{\prime}(k-\alpha \Discount) & = \Rfree\Discount[\mathbf{f}_{t+2}^{\prime}(k-\alpha \Discount)] \\ k \mathbf{f}_{t+2}^{\prime}(k-\alpha \Discount) & = \Rfree\Discount[\mathbf{f}_{t+2}^{\prime}(k-\alpha \Discount)] \\ k & = \Rfree\Discount \end{aligned}

(28)

so marginal utility grows at rate $1/\Rfree\Discount$ . Note that if we assume $\alpha=0$ so that habits do not matter, we again obtain the standard result that $\uFunc^{\prime}(c_{t}) = \Rfree\Discount \uFunc^{\prime}(c_{t+1})$ .

Now make the final assumption that $\mathbf{f}(z) = z^{1-\CRRA}/(1-\CRRA)$ , implying of course that $\mathbf{f}^{\prime}(z) = z^{-\CRRA}$ . Equation (28) can be rewritten

1 = \Rfree\Discount (z_{t+1}/z_{t})^{-\CRRA}

(29)

Now expand $z_{t+1}/z_{t}$

\frac{c_{t+1}-\alpha c_{t}}{c_{t}-\alpha c_{t-1}} = \frac{c_{t+1}/c_{t}-\alpha}{1-\alpha c_{t-1}/c_{t}}

(30)

\approx \frac{1+\Delta \log c_{t+1}-\alpha}{1 - \alpha + \alpha \Delta \log c_{t}}

(31)

= \frac{1-\alpha+\Delta \log c_{t+1}}{1 - \alpha + \alpha \Delta \log c_{t}}

(32)

= \frac{1+(\Delta \log c_{t+1})/(1-\alpha)}{1 + (\alpha/(1-\alpha)) \Delta \log c_{t}}

(33)

\approx 1+\left(\frac{1}{1-\alpha}\right)\left(\Delta \log c_{t+1} - \alpha \Delta \log c_{t} \right)

(34)

where (31) follows from (30) because $c_{t+1}/c_{t} = 1+(c_{t+1}-c_{t})/c_{t} \approx 1+\Delta \log c_{t+1}$ and $c_{t-1}/c_{t} = (c_{t}-(c_{t}-c_{t-1}))/c_{t} \approx 1 - \Delta \log c_{t}$ , and (34) follows from (33) because for small $\eta$ and $\epsilon$ , $(1+\eta)/(1+\epsilon) \approx 1+\eta-\epsilon$ .

Substituting (34) into (29) gives

\begin{aligned} 1 & \approx \Rfree\Discount (1+\left(\frac{1}{1-\alpha}\right)\left(\Delta \log c_{t+1} - \alpha \Delta \log c_{t} \right))^{-\CRRA} \\ 0 & \approx \log [\Rfree\Discount] - \CRRA \left(\frac{1}{1-\alpha}\right)\left(\Delta \log c_{t+1} - \alpha \Delta \log c_{t} \right) \\ \Delta \log c_{t+1} & \approx (1-\alpha)\CRRA^{-1}(\rfree - \timeRate) + \alpha \Delta \log c_{t}. \end{aligned}

(35)

Thus, this formulation of habit formation implies that the growth rate of consumption is serially correlated.

Footnotes¶

Google the string “partial differentiation confusing OCW” to find a fuller description of the problems of standard notation on partial differentiation.
↩

References¶

Carroll, C. D. (2000). Solving Consumption Models with Multiplicative Habits. Economics Letters, 68(1), 67–77. 10.1016/S0165-1765(00)00223-8