8.4 Moving average models
Rather than using past values of the forecast variable in a regression, a moving average model uses past forecast errors in a regression-like model. yt=c+et+θ1et−1+θ2et−2+⋯+θqet−q, where et is white noise. We refer to this as an MA(q) model. Of course, we do not observe the values of et, so it is not really a regression in the usual sense.
Notice that each value of yt can be thought of as a weighted moving average of the past few forecast errors. However, moving average models should not be confused with the moving average smoothing we discussed in Chapter 6. A moving average model is used for forecasting future values, while moving average smoothing is used for estimating the trend-cycle of past values.

Figure 8.6: Two examples of data from moving average models with different parameters. Left: MA(1) with yt=20+et+0.8et−1. Right: MA(2) with yt=et−et−1+0.8et−2. In both cases, et is normally distributed white noise with mean zero and variance one.
Figure 8.6 shows some data from an MA(1) model and an MA(2) model. Changing the parameters θ1,…,θq results in different time series patterns. As with autoregressive models, the variance of the error term et will only change the scale of the series, not the patterns.
It is possible to write any stationary AR(p) model as an MA(∞) model. For example, using repeated substitution, we can demonstrate this for an AR(1) model: yt=ϕ1yt−1+et=ϕ1(ϕ1yt−2+et−1)+et=ϕ21yt−2+ϕ1et−1+et=ϕ31yt−3+ϕ21et−2+ϕ1et−1+etetc.
Provided −1<ϕ1<1, the value of ϕk1 will get smaller as k gets larger. So eventually we obtain yt=et+ϕ1et−1+ϕ21et−2+ϕ31et−3+⋯, an MA(∞) process.
The reverse result holds if we impose some constraints on the MA parameters. Then the MA model is called “invertible”. That is, we can write any invertible MA(q) process as an AR(∞) process.
Invertible models are not simply introduced to enable us to convert from MA models to AR models. They also have some mathematical properties that make them easier to use in practice.
The invertibility constraints are similar to the stationarity constraints.
- For an MA(1) model: −1<θ1<1.
- For an MA(2) model: −1<θ2<1, θ2+θ1>−1, θ1−θ2<1.
More complicated conditions hold for q≥3. Again, R will take care of these constraints when estimating the models.