Open access peer-reviewed chapter

ARIMA Models with Time-Dependent Coefficients: Official Statistics Examples

Written By

Guy Mélard

Submitted: 26 July 2022 Reviewed: 27 October 2022 Published: 30 November 2022

DOI: 10.5772/intechopen.108789

From the Edited Volume

Time Series Analysis - New Insights

Edited by Rifaat Abdalla, Mohammed El-Diasty, Andrey Kostogryzov and Nikolay Makhutov

Chapter metrics overview

111 Chapter Downloads

View Full Metrics

Abstract

About 25 years ago, effective methods for dealing with time series models that vary with time appeared in the statistical literature. Except in a few cases, they have never been used for economic statistics. In this chapter, we consider autoregressive integrated moving average (ARIMA) models with time-dependent coefficients (tdARIMA) applied to monthly industrial production series. We start with a small-size study with time-dependent integrated autoregressive (tdARI) models on Belgian series compared to standard ARI models with constant coefficients. Then, a second, bigger, illustration is given on 293 U.S. industrial production time series with tdARIMA models. We employ the software package Tramo to obtain linearized series and model specifications and build both the ARIMA models with constant coefficients (cARIMA) and the tdARIMA models, using specialized software. In these tdARIMA models, we use the simplest specification for each coefficient: a simple regression with respect to time. Surprisingly, for a large part of the series, there are statistically significant slopes, indicating that the tdARIMA models fit better the series than the cARIMA models.

Keywords

  • nonstationary process
  • time series
  • time-dependent model
  • time-varying model
  • local stationarity

1. Introduction

About 25 years ago, effective methods for dealing with time series models that vary with time appeared in the statistical literature. Except in a few cases, like Van Bellegem and von Sachs [1] for marginal heteroscedasticity in financial data or Kapetanios et al. [2], they are not used for economic statistics. In this chapter, we consider autoregressive integrated moving average (ARIMA) models with time-dependent coefficients (tdARIMA) that provide a natural alternative to standard ARMA models. Several theories appeared in the last 25 years for parametric estimation in that context, including Dahlhaus’ approach based on locally stationary processes, see Dahlhaus [3, 4]. To simplify the presentation of the method in Section 2, we first focus on autoregressive integrated (ARI) models before going to the general case of ARIMA. Section 3 is devoted to illustrations of official time series, more precisely industrial production series. We start with a small-size study on Belgian monthly industrial production and show an improvement for time-dependent autoregressive integrated (tdARI) models with respect to standard ARI models with constant coefficients. Then, a second, bigger, illustration of tdARIMA models is given on 293 U.S. industrial production time series, already used by Proietti and Lütkepohl [5], with a different objective. We employ the software package Tramo from Gómez and Maravall [6] to obtain linearized series and model specifications, and we built both ARIMA models with constant coefficients (cARIMA) and tdARIMA models based on the Tramo specifications. This is done in specialized software since no existing package can cope with these tdARIMA models. In these tdARIMA models, we use the simplest specification for each coefficient: a simple regression with respect to time, hence two parameters, a constant and a slope. Indeed, this is the closest departure from constancy, and this seems natural in an evolving world. We will see that, for a large part of the series, there are statistically significant slopes, indicating that the tdARIMA models fit better the series than the cARIMA models. In the second step, since many of the slopes introduced as additional parameters in the model are not significantly different from 0, they are omitted one by one, starting with the least significant one, until all the remaining slopes are different from 0 at the 5% level. Most of the summary results are improved. Section 4 contains our conclusions.

Advertisement

2. Methods

We consider the well-known class of multiplicative seasonal ARIMA models, see e.g. Gómez and Maravall and Box et al. [6, 7]. Models with time-dependent coefficients appear often in econometrics but not in ARIMA models. For a very long time series, there is no reason that the coefficients would stay constant. They can be supposed to vary slowly with time although breaks could also be considered. This is the reason why linear (or other) functions of time replace the constant coefficients. Time series models with time-varying coefficients have been studied, mainly from a theoretical point of view. In addition to [3, 4], several papers [8, 9, 10] provide conditions for the asymptotic properties, hence the justification for statistical inference. Otherwise, our tests on slopes would have no foundation. These conditions are of course enforced in the estimation procedure.

2.1 The model

To illustrate a simple ARIMA model with a time-dependent coefficient, we can consider the ARMA(1,1) model. Let the series be denoted by y = (y1, y2, …, yn). Then a tdARMA(1,1) model is described by the equation as follows:

yt=ϕtnyt1+etθtnet1,E1

where the et are independent random variables with mean zero and with standard deviation σ, and the time-dependent coefficients ϕtn and θtn depend on time t, also on n, the length of the series, and also on a small number of parameters stored in a m×1 vector β. The simplest specification for ϕtn, for example, is as follows:

ϕtnβ=ϕ+1n1tn+12ϕ,E2

where ϕ is an intercept and ϕ is a slope, and a similar expression for θtnβ using two other parameters θ and θ. The vector β contains all parameters to be estimated, those in ϕtnβ (like ϕ and ϕ, here) and θtnβ (θ and θ), but not the scale factor σ which is estimated separately. For the corresponding cARIMA model, there is of course no slope, i.e., ϕ = θ=0. For a lag k instead of 1, we add a subscript k to the coefficient symbols.

Let us now consider a general tdARMA(p, q) model. It is defined by the equation

yt=k=1pϕtknβytk+etk=1qθtknβetk,E3

where the coefficients ϕtknβ, k = 1, …, p, and θtknβ, k = 1, …, q, are deterministic functions of t and, possibly, of n. The et,t=1,2,, are like before. We suppose that the additional number of parameters is small. Practically, for economic time series, linear or exponential functions of time, like in Eq. (2), seem to be enough instead of constant coefficients, but there is no problem to use other functions, up to some point. In other cases, see Alj et al. [11], periodic functions can be considered. In practice, we suppose that the coefficients are constant before the first observation.

Adding marginal heteroscedasticity should also be tried. Van Bellegem and von Sachs [1] had already shown the usefulness of a time-dependent variance. Indeed, there is no reason why the innovation standard deviation is constant. We replace etk, k = 0, 1, …, q, in Eq. (3) with gtknβetk, where gtnβ is a (strictly positive) deterministic function of t and, possibly, of n, depending on the parameters, so that the standard deviation becomes gtnβσ>0. Adding gtnβ is also covered by Azrak and Mélard [8, 12]. In practice, we used an exponential function of time for gtnβ.

Since the series are nonstationary, we need to consider also regular and seasonal differences s, where s is the seasonal period (s = 12, for monthly data), on the possibly square roots or log-transformed observations. Furthermore, the series is not seasonally adjusted, so the so-called seasonal multiplicative models of Box et al. [7] are also needed.

2.2 The estimation method

For any tdARIMA model, we can estimate the parameters by maximizing the logarithm of the Gaussian likelihood. Time Series Expert [13], and more precisely its computational engine ANSECH is used for that purpose. It is based on an exact algorithm for the computation of the Gaussian likelihood [14] and an implementation of a Levenberg–Marquardt nonlinear least-squares algorithm. Under some very general conditions [8, 12], it is shown that the quasi-maximum likelihood estimator β̂ converges to the true value of β, and β̂ is asymptotically normal, more precisely nβ̂βDN0V1, when n where D indicates convergence in distribution, and V1 is the asymptotic covariance matrix. Moreover, V can be estimated as a by-product of estimation. Let us denote its estimator by V̂n. The Student t statistics shown in the next section make use of the standard errors deduced from the estimation of V. Using the asymptotic covariance matrix, it is also possible to design a Wald test for a subset b of r among the m parameters in β, for example, to test that all the slopes are equal to 0, using a χ2 distribution. Let R be a r×m restriction matrix composed of the rows of the m×m identity matrix that correspond to the parameters in the subset b. Then, b=. The Wald statistic for testing that b=0 is then nb̂RV̂nR1b̂, where b̂ is the estimate of b and indicates transposition. Under the null hypothesis, the statistic converges in distribution to a χ2 distribution with r degrees of freedom when n.

Note that centering of time around its mean (n + 1)/2 in Eq. (2) improves the statistical properties of the estimators by reducing the amount of correlation between their elements and that the factor 1/(n–1) is there to avoid explosive behavior when n.

Note also that the conditions for convergence and asymptotic normality are satisfied in the present case because a sufficient condition [15] is that the AR and MA polynomials have their roots outside the unit circle at all times and that condition is checked during estimation.

An asymptotic theory for locally stationary processes due to Dahlhaus [3, 4] can also be used. There seems to exist only one software implementation, the R package LSTS (for Locally Stationary Time Series) by Olea et al. [16] to support the estimation of locally stationary ARMA models, see also Palma et al. [17]. Since it does not cope with the multiplicative seasonal models necessary to deal with seasonally unadjusted time series, we have preferred to use Azrak and Mélard [8] with Time Series Expert for estimation. See Azrak and Mélard [18] for a comparison of the existing theories.

2.3 The datasets

In the first empirical analysis, the number of series is limited, and simple pure autoregressive models are used. The purpose is to show the basic elements of the methodology. We used a dataset of indices for the monthly Belgian industrial production for the period 1985–1994 by the various branches of activity, 26 in all. Nine years are used for fitting the models and a tenth year is used to compute ex-post forecasts and the mean absolute percentage error (MAPE). An automatic procedure is applied to fit ARIMA models and we retained the 20 series out of 26 for which pure integrated autoregressive or ARIpdPD12 models are fitted to the series of 108 observations. Let us remind that these models are defined by the equation as follows:

ϕpLΦPLsd12Dyt=et,E4

where L is the lag operator, such that Lyt=yt1,ϕpL and ΦPLs are, respectively, the regular autoregressive and the seasonal autoregressive polynomials, of degree p and 12P in L. The model can include transformations and interventions (additive or on the differenced series) which are not detailed here. The fit is characterized by the value of the SBIC criterion. For using time-dependent ARI, or tdARI, models, slope parameters are added for each of the existing coefficients, like ϕ for ϕ in Eq. (2). The models have therefore coefficients that are linear functions of time. For models in multiplicative seasonal form, the product of the regular and seasonal polynomials is first computed and slope parameters are added to each lag, but only to lags smaller than 14, for practical reasons. For example, for the AR2112 model, with the polynomial in the lag operator L

1ϕ1Lϕ2L21Φ1L12=1ϕ1Lϕ2L2Φ1L12+ϕ1Φ1L13+ϕ2Φ1L14,E5

the specification is 1ϕt1nLϕt2nL2ϕt,12nL12ϕt,13nL13+ϕ2Φ1L14, where ϕt1n is like in Eq. (2), and

ϕt2n=ϕ2+1n1tn+12ϕ2,ϕt,12n=Φ1+1n1tn+12ϕ12,
ϕt,13n=ϕ1Φ1+1n1tn+12ϕ13,

with seven parameters instead of the full form 1ϕt1nBϕt2nB2ϕt,12nB12ϕt,13nB13ϕt,14nB14 that would involve 10 parameters in all. This is enforced to restrict the number of parameters and avoid numerical problems. Note that the factor 1/n1 is there only for the asymptotic theory and will be omitted in practice.

In the second empirical analysis, we use a big dataset of U.S. industrial production time series, already used by Proietti and Lütkepohl [5] for assessing transformations in forecasting. See http://www.federalreserve.gov/releases/g17/ipdisk/ip_nsa.txt. These are now 293 time series from January 1986 to December 2018 at least. Most series start before and they are even a few ones starting in 1919. The models were fitted until December 2016 leaving the remaining months to compare the data to the ex-post forecasts, using either a fixed forecast origin for several horizons or rolling forecasts each for given horizons.

We employ the software package Tramo described by Gómez and Maravall [6] to obtain partially linearized series by removing outliers and trading day effects. Indeed, the presence of outliers and trading day effects can distort the analysis, as could be seen in preliminary analyses. Selecting the cARIMA models in an automated way is also done using Tramo. Then we replace the constant coefficients by linear functions of t for order k ≤ 13, giving tdARIMA models, like in Eq. (2) for each lag k coefficient in the model. At this stage, we do not omit nonsignificant parameters. The cARIMA and tdARIMA models are fitted using the same specialized software package ANSECH included in Time Series Expert, to facilitate the comparison. See Figure 1 for a schematic representation of the whole automatic procedure. For more complex time dependency, an automatic selection procedure like the one exposed by Van Bellegem and Dahlhaus [19] is possible.

Figure 1.

Schematic representation for the whole automatic treatment.

We compare the results of tdARIMA versus cARIMA models using the following criteria:

  • Is the highest t statistic of the td parameters, the slopes, in absolute value, larger than 1.96?

  • Is the p-value of the global χ2 statistic for the Wald test on the slopes smaller than 0.05?

  • Is tdARIMA SBIC smaller than the corresponding cARIMA SBIC?

  • Is tdARIMA residual standard deviation smaller than the corresponding cARIMA one?

  • Is the tdARIMA P-value of the Ljung-Box (LB) statistic for residual autocorrelation (with lag 48) larger than the corresponding cARIMA one?

  • Is tdARIMA MAPE in percent for 1 year (2017) and all horizons from 1 to 12 smaller than the corresponding cARIMA one?

  • similarly, for rolling forecasts from December 2016, for several horizons 1, 3, 6, and 12, are tdARIMA MAPE in percent smaller than the corresponding cARIMA one?

In the early stage of this project, the data were limited to 2016 and without a correction for outliers or trading days, and only fixed origin forecasts were considered. This gave worse results that were indicative, and not conclusive.

Note that one can object against the use of the Ljung-Box test statistic to compare models, especially here because there is no foundation to its limit behavior for tdARIMA models. Like the other criteria, we use it as a descriptive indicator.

Advertisement

3. Empirical results

3.1 Two examples

Before showing the results, we will consider two examples, to justify the recourse to the class of tdARIMA models which is the object of this chapter.

The first example is taken from the first dataset: the index of land transportation (series TRTER) for Belgium for the period from January 1985 to December 1994, see Figure 2. Two additive interventions were automatically considered, respectively, in May 1986 and in February 1992. Let I8605 and I9202 denote, respectively, the corresponding binary variables. Otherwise, the series is taken in square roots and seasonally differenced. Let us denote the transformed series TRTER_TF after all these operations. It equal to 12TRTERbI8605I8605bI9202I9202, up to a normalizing factor, and shown in Figure 3. The partial autocorrelations of that series shown in Figure 4 show a truncation after lag 12. The usual Box and Jenkins analysis lead to the suggestion of a seasonal AR model. Then adding time dependency to the AR coefficient leads to the following model:

Figure 2.

The original series, the index of land transportation (TRTER).

Figure 3.

The transformed series (TRTER_TF).

Figure 4.

The partial autocorrelation function of the transformed series.

1Φt,12B12TRTERbI8605I8605bI9202I9202μ=et,

where the Φt,12, is estimated by −0.686 + 6.47 10−03 (t – 60.5), and the estimates of bI8605,bI9202, and μ are, respectively, equal to −30.9, 24.8, and 4.23. The standard error corresponding to the slope of Φt,12 is equal to 1.70 10−03, so the associated Student statistic is equal to 3.8, hence the slope is significantly different from 0. To explain that significance, let us look at the partial autocorrelation at lag 12 for the transformed series TRTER_TF: for the first 4 years it is −0.488 and for the last 4 years it is −0.391. That explains the significantly positive slope for Φt,12

The second example is taken from the second dataset: the U.S. production index of clothing (B51212) for the period from January 1986 to January 2019, see Figure 5.

Figure 5.

The original series, the U.S. production index of clothing (B51212).

Tramo has adjusted the series for outliers and proposed a logarithmic transform, and both a regular and a seasonal difference, giving the transformed series BS51212DS shown in Figure 6.

Figure 6.

The transformed B51212 series (B51212DS).

Subsequently, Tramo has suggested modeling the series by a seasonal ARIMA model with a regular autoregressive polynomial of degree 3 and a seasonal moving average. We fitted that model using ANSECH and obtained

1+0.035L0.142L20.249L312logB51212t=10.850L12et.

Then, we replaced the constant coefficients with linear functions of time and replaced the constant innovation variance with an exponential function of time. Omitting one by one the nonsignificant parameters at the 5% probability level, we obtained finally the following heteroscedastic model but with constant autoregressive and moving average polynomials:

1+0.041L0.142L20.228L312logB51212t=10.855L12gtet,

where gt is given by gt=exp0.001753t193.

3.2 First empirical analysis

In the first experiment, the number of series is small and simple pure integrated autoregressive models are used.

Table 1 shows the main results, including those tdAR coefficients for which the test of zero slope leads to a rejection at the 5% level, and the corresponding t-statistic.

Branch (Name)OrdersARItdARI
(p, d)
(P, D)
SBIC
MAPE
SBIC
MAPE
parametert-Value
Food, beverages
(ALIBOR)
(3, 0)
(0, 1)
655
4.0
669
3.9
none
Other extraction of minerals
(AUEXTR)
(2, 0)
(0, 1)
904
10.5
910
9.4
AR22.8
Wood-processing, furniture
(BOIME)
(3, 1)
(1, 1)
778
6.4
792
5.9
AR122.6
Hosiery
(BONNE)
(2, 0)
(1, 1)
702
9.1
692
5.5
AR1
AR2
AR12
2.9
8.7
11.0
Commerce
(COMME)
(3, 0)
(1, 1)
553
1.7
557
2.4
AR123.7
Construction
(CONST)
(3, 0)
(0, 1)
827
8.0
830
23.6
AR22.8
Petrol derivatives
(DERPE)
(1, 1)
(0, 0)
888
5.2
891
5.2
none
Petrol distribution
(DISPE)
(1, 0)
(1, 0)
941
10.8
951
10.7
none
Metal processing
(FABME)
(2, 0)
(0, 1)
682
6.6
681
7.2
AR22.1
Manufacture of textiles
(FILAT)
(2, 1)
(0, 1)
739
5.5
749
6.4
none
Gas production/distribution
(GAZ)
(3, 0)
(1, 1)
897
2.1
891
2.0
AR2
AR3
2.8
4.5
Construction materials
(MATCO)
(1, 1)
(1, 1)
809
4.3
807
5.7
AR133.3
Nonmetallic manufacturing
(NONFE)
(2, 1)
(1, 1)
826
5.5
830
9.4
AR123.6
Paper/paperboard industry
(PAPCA)
(3, 0)
(1, 1)
746
5.9
758
5.8
AR22.1
Iron and steel
(SIDER)
(3, 0)
(1, 1)
833
5.9
836
9.2
AR1
AR12
4.0
3.6
Manufacture of tobacco
(TABAC)
(3, 0)
(1, 1)
778
10.9
791
13.2
AR3
AR12
3.1
5.4
Aviation
(TRAER)
(3, 1)
(1, 1)
746
8.8
753
8.7
AR126.3
Maritime transportation
(TRMAR)
(2, 1)
(1, 1)
732
3.6
740
2.6
AR12
AR13
2.3
3.6
Land transportation
(TRTER)
(0, 0)
(1, 1)
854
12.1
849
12.5
AR122.6
Manufacture of clothing
(VETEM)
(2, 1)
(0, 1)
757
26.7
767
26.7
none

Table 1.

For each branch of the economy, we give the orders (p, d)(P, D) of the model, SBIC and MAPE (in italics) for the raw ARI model and for the tdARI model (results in bold type are better), the statistically significant slopes (ARk denotes ϕk) and the corresponding t-value.

For example, the AR2112 model in Eq. (5) which was shown in Section 2.3 is used for nonmetallic manufacturing, in addition to a regular and a seasonal difference. In that case, the standard ARI model is better than the tdARI model for SBIC (826 versus 830) and provides also betters forecasts (MAPE = 5.5% versus 9.4%) although there is one significant slope for lag 12 with a Student statistic of 3.6. Even if tdARI models are not systematically better, they often produce better forecasts and sometimes show a better fit or at least some statistically significant slope parameters at the 5% level. All the nonsignificant slopes were left in the model and that can explain why the SBIC criterion was generally worse for tdARI models. Since that analysis was promising, we were led to consider a bigger dataset.

3.3 Second empirical analysis

The second empirical analysis bears on 293 seasonally unadjusted time series in a dataset of U.S. industrial production. We will show three tables of results. Table 2 presents a summary of the dataset resulting from Tramo. For example for 280 series out of 293, a regular difference was used, which accounts for 96% of the dataset. As said above, we preserved this and all parameters in our cARIMA and tdARIMA models. For the tdARIMA models, slopes were added to all autoregressive and moving average coefficients for lag less or equal to 13.

Specification#series%Note
Levels vs. logs70/22324/76
Regular diff. vs. none280/1396/44% with 2 differences
Seasonal diff. vs. none279/1495/50% with 2 differences
Stationary vs. nonstationary0/2930/100
Airline model vs. other84/20929/71
Outliers vs. none79/210–20, on average 2.63
Trading day effect vs. none47/53
Easter effect vs. none40/60
ARMA parameters vs. none100/01–7, on average 3.12

Table 2.

Summary of the model selections made by Tramo on the 293 U.S. industrial production series.

Table 3 is based on the initial tdARIMA models, with possibly nonsignificant parameters. We show the percentages for each criterion across the 293 series. For example, more than 50% of the dataset had at least one of the slopes with a Student t value greater than 1.96. If we use the Wald test, which should show a better view, for more than 44% of the series, the hypothesis of null slopes leads to a rejection at the 5% level. If the series were randomly drawn from cARIMA processes, we should expect 5% of rejections, on average. Of course, because of the multiple-test argument, the Student tests on the slopes would give a higher proportion of rejections. A simulation study will confirm this later. The use of the Wald test in the present context is therefore essential. Some results are partially deceptive but can be explained: only about 4% of the series have a smaller SBIC for the tdARIMA but this is mainly due to the useless parameters. About one-half of the series have a smaller residual standard deviation, but for more than 57% the test on residual autocorrelation, based on the Ljung-Box test with 48 lags, gives a better result.

CriteriaPercentageNotesPercentage if td significant
Highest |t| statistic of td parameters >1.9650.17(*)
p-value of global test of stationarity <0.0544.71100.00
tdARIMA SBIC < cARIMA SBIC04.10(**)9.16
tdARIMA residual std. dev < cARIMA49.1561.83
tdARIMA LB P-value > cARIMA57.6854.20
tdARIMA forecasting MAPE < cARIMA47.4445.04
tdARIMA h = 1 rolling forecasts MAPE < cARIMA32.7631.30
tdARIMA h = 3 rolling forecasts MAPE < cARIMA32.4229.01
tdARIMA h = 6 rolling forecasts MAPE < cARIMA37.8840.46
tdARIMA h = 12 rolling forecasts MAPE < cARIMA36.8637.40

Table 3.

For each criterion, the percentages of improvement from cARIMA models to tdARIMA models are given over the 293 U.S. series.

Notes: (*) Statistically significant slope parameters at the 5% level, (**) Nonsignificant parameters were not omitted.

If we retain only the series where the Wald test rejected the constancy of the coefficients, the percentage of smaller SBIC for tdARIMA models is only slightly higher at about 9% and reaches 61% for the residual standard deviation. The percentage for the Ljung-Box test is lower. Indeed, the theory for that test was never undertaken for tdARIMA models. Forecasting performance was evaluated using the MAPE criterion. For fixed origin forecasts, about 47% of the series have a smaller MAPE for the tdARIMA models rather than for the cARIMA models. Among the series for which time dependency is retained, only 45% of them benefit from better forecasts. For rolling forecasts for various horizons, the percentages are even smaller, in particular for horizons of 1 and 3 months. The percentages are about the same if the Wald test rejects constancy or not. That means that, even if the introduction of time-dependency improved the fits, it does not improve the forecasts. Let us remind that, at this stage, the tdARIMA may have many statistically nonsignificant slopes.

For Table 4, starting from the full tdARIMA models of Table 3, we omitted, one by one, the most nonsignificant slope at the 5% level, see Figure 1. In the end, all remaining slopes are thus significantly different from 0. This was done in an automated way in order to avoid mistakes. We will refer to these models as parsimonious tdARIMA models. Of course, the cARIMA models are the same as previously, essentially the same as given by Tramo, but estimated with more digits of accuracy. We notice that the percentage of at least one statistically significant slope, 54.61%, differs slightly from the percentage of rejection of the Wald test on all the slopes, 54.27%. Indeed, for one series (G325A4, Chemicals except for pharmaceuticals and medicines), there are two slightly significant slopes but the global test does not reject their nullity, although the p-value is close to 0.05. Anyway, these percentages of improved tdARIMA models are slightly higher than in Table 3.

CriteriaPercentageNotesPercentage if td significant
Highest |t| statistic of td parameters >1.9654.61(*)
p-value of global test of stationarity <0.0554.27100.00
tdARIMA SBIC < cARIMA SBIC18.77(**)34.59
tdARIMA residual std. dev < cARIMA38.2370.44
tdARIMA LB P-value > cARIMA27.6550.31
tdARIMA forecasting MAPE < cARIMA28.3351.57
tdARIMA h = 1 rolling forecasts MAPE < cARIMA20.4837.74
tdARIMA h = 3 rolling forecasts MAPE < cARIMA17.7532.70
tdARIMA h = 6 rolling forecasts MAPE < cARIMA18.7734.59
tdARIMA h = 12 rolling forecasts MAPE < cARIMA21.8440.25

Table 4.

For each criterion, the percentages of improvement from cARIMA models to tdARIMA models are given over the 293 U.S. series. The last column contains percentages conditional to the rejection of nullity of all the slopes by the Wald test.

Notes: (*) Statistically significant slope parameters at the 5% level, (**) Contrarily to Table 3, nonsignificant slope parameters are omitted one by one until all were statistically significant.

The fitting results are partially better with more than 18% smaller SBIC for tdARIMA models (respectively 34% if we condition on the rejection of the Wald test). Some are worse, however, with 38% for the residual standard deviation instead of 49% for the fully parameterized model (respectively 70% and 61%, if we condition on the rejection of the Wald test), and 27% for the residual autocorrelation instead of 57% for the full model (respectively 50% and 54%, if we condition on the rejection of the Wald test).

Strangely, the forecasting performance with a fixed origin is worse for the parsimonious model than for the full model with the percentage of improvement of tdARIMA models with respect to cARIMA models equal to 28%, instead of 47% (respectively 51% and 45%, if we condition on the rejection of the Wald test). That means that the omitted slopes seem to contribute to the forecasting performance but that, among the series with time-dependent coefficients, about one-half have provided better forecasts. The picture for rolling forecasts is again worse for the parsimonious models with smaller percentages of improvement in the range of 17-22%, according to the horizon, instead of 32-37% for the full models, but again similar under the condition of rejection of the Wald test (respectively 32-44% instead of 29-40%). Surprisingly, the percentages are systematically higher for horizons 6 and 12 months rather than for those of 1 and 3 months.

One can object that introducing time dependency can introduce some over-fitting: a certain proportion of the tests of nullity of the slopes ϕk or θk can lead to rejection, about 5% when there is only one slope, more otherwise.

To try to answer that natural question, we generated artificially 320 series using cARIMA models, with the same length of 372, again leaving the last 12 values. We have used an airline model for that purpose instead of the large variety of models fitted by Tramo-Seats. Then we added time dependency and proceeded exactly like before. The results are shown in Table 5. The percentage of 14.06 for the first criterion (instead of five) shows that our rough examination of the largest |t| value should be better replaced by a simultaneous test on the td parameters, as we did. For SBIC, there are many superfluous parameters, as could be guessed. But about one-half of the tdARIMA models give smaller residual standard deviations, less residual autocorrelation, and smaller forecast errors than their cARIMA counterparts, as expected.

CriteriaPercentage Artificial seriesPercentage U.S series
Highest |t| statistic of td parameters >1.9614.0650.17
tdARIMA SBIC < cARIMA SBIC00.0004.10
tdARIMA residual std. dev < cARIMA42.8149.15
tdARIMA LB P-value > cARIMA43.7557.68
tdARIMA forecasting MAPE < cARIMA47.8147.44

Table 5.

For each criterion, the percentages of improvement from going from cARIMA models to tdARIMA models are given over the 320 artificial series. The last column contains the corresponding percentages obtained for the U.S. series taken from Table 3.

The results show that for a majority of series there is (i) at least one statistically significant slope parameter at the 5% level, (ii) rejection of the nullity of all the slopes using a Wald test that provides better-founded results than the t-tests, (iii) smaller residual standard deviation, and (iv) less residual autocorrelation. This is true for the full tdARIMA model specifications but also, at least partly, with more parsimonious tdARIMA models obtained by omitting, one by one, the statistically nonsignificant slopes. At least it is true conditionally on significant time dependency, i.e. when the Wald test rejects the constancy of the coefficients.

The results for the SBIC criterion are not good. For the full tdARIMA model, an explanation is the presence of nonsignificant slope parameters. It remains, however, for the parsimonious models. The only unsatisfactory aspect of tdARIMA models is that they fail to improve the forecasts for a majority of the series. Indeed, they confirm that only one-third of the “time-dependent series”, i.e. those series which have at least one statistically significant slope parameter, provide better forecasts with a tdARIMA model than with a cARIMA model.

We had already observed similar results with slightly shorter series of the big dataset when the outliers and trading day effects were not handled. Consequently, the presence of outliers or trading day effects is not the cause of better fits by tdARIMA models, as we feared. A common feature is nevertheless that the forecasts are not better by replacing the cARIMA models with tdARIMA models. This is surprising although we know that a better fit is not a guarantee for better forecasts. It should be investigated why the forecasts seem to be worse in the U.S. series for tdARIMA models. Of course, it can be due to a global change in 2016.

Advertisement

4. Conclusions

It took several decades to go from ARIMA models with constant coefficients to suitable and powerful generalizations with time-dependent coefficients that vary deterministically. We showed the usefulness of the approach for dealing with official statistics time series that have generally a seasonal component.

We used linear functions of time. We do not hope that other functions than linear functions should be useful with the inconvenience to add many parameters, except if we exploit the fact that since 2019 most of the series in the dataset are available before 1986, often since 1972, or sometimes earlier.

Finally, one weak point in the analysis is due to the detection of outliers and trading day effects, and the time series linearization is based on cARIMA models. If the time dependency of the coefficients becomes serious for very long official time series, it would be worth trying to extend Tramo features to tdARIMA models, e.g. to detect outliers simultaneously with the estimation of time-dependent coefficients for the ARIMA model.

On the other side, it would be also interesting to conclude that traditional cARIMA models are enough to forecast very long time series and that no substantial gain can be obtained by considering tdARIMA models.

It can be interesting to repeat the analysis with other datasets, quarterly or preferably monthly, like those maintained by Eurostat. Good candidates would be in the industry, trade, and services, short-term business statistics, production, turnover, etc. A U.S. database like FRED (https://research.stlouisfed.org/econ/mccracken/fred-databases/) could also be considered.

Advertisement

Acknowledgments

I thank Agustín Maravall (for his help in producing linearized time series in Tramo), Rajae Azrak (for her contributions to the theory), Ahmed Ben Amara (for his contribution to a very first version of a part of the program chain which includes Tramo-Seats and Microsoft Excel Visual Basic modules, in addition to our specialized code for estimating tdARIMA models), and Dario Buono, Team Leader of Methodology, Eurostat, Unit B1 (for exchanges, suggestions, and encouragement on previous versions of this chapter). Finally, I thank the editors for their remarks and Mrs. Karla Skuliber, the author service manager, for her efficiency.

Advertisement

Conflict of interest

I declare there is no conflict of interest.

References

  1. 1. Van Bellegem S, von Sachs R. Forecasting economic time series with unconditional time-varying variance. International Journal of Forecasting. 2004;20:611-627. DOI: 10.1016/j.ijforecast.2003.10.002
  2. 2. Kapetanios G, Marcellino M, Venditti F. Large time-varying parameter VARs: A nonparametric approach. Journal of Applied Econometrics. 2019;34:1027-1049. DOI: 10.1002/jae.2722
  3. 3. Dahlhaus R. Maximum likelihood estimation and model selection for locally stationary processes. Journal of Nonparametric Statistics. 1996;6:171-191. DOI: 10.1080/10485259608832670
  4. 4. Dahlhaus R. Fitting time series models to nonstationary processes. Annals of Statistics. 1997;25:1-37. DOI: 10.1214/aos/1034276620
  5. 5. Proietti T, Lütkepohl H. Does the Box-Cox transformation help in forecasting macroeconomic time series? International Journal of Forecasting. 2013;29:88-99. DOI: 10.1016/j.ijforecast.2012.06.001
  6. 6. Gómez V, Maravall A. Automatic modelling methods for univariate series. In: Peña D, Tiao GC, Tsay RS, editors. A Course in Time Series Analysis. New York: Wiley; 2001. pp. 171-201. DOI: 10.1002/9781118032978.ch7
  7. 7. Box GEP, Jenkins GM, Reinsel GC, Ljung GS. Time Series Analysis, Forecasting and Control. 5th ed. New York: Wiley; 2015 xxvi+669 p
  8. 8. Azrak R, Mélard G. Asymptotic properties of quasi-maximum likelihood estimators for ARMA models with time-dependent coefficients. Statistical Inference for Stochastic Processes. 2006;9:279-330. DOI: 10.1007/s11203-005-1055-6
  9. 9. Dahlhaus R. A likelihood approximation for locally stationary processes. Annals of Statistics. 2000;28:1762-1794. DOI: 10.1214/aos/1015957480
  10. 10. Dahlhaus R. Locally stationary processes. In: Subba Rao T, Subba Rao S, Rao CR, editors. Handbook of Statistics, Volume 30: Time Series Analysis: Methods and Applications. Amsterdam: Elsevier; 2012. pp. 145-159. DOI: 10.1016/B978-0-444-53858-1.00013-2
  11. 11. Alj A, Azrak R, Ley C, Mélard G. Asymptotic properties of QML estimators for VARMA models with time-dependent coefficients. Scandinavian Journal of Statistics. 2017;44:617-635. DOI: 10.1111/sjos.12268
  12. 12. Azrak R, Mélard G. Asymptotic properties of conditional least-squares estimators for array time series. Statistical Inference for Stochastic Processes. 2021;24:525-547. DOI: 10.1007/s11203-021-09242-8
  13. 13. Mélard G, Pasteels J-M. User’s Manual of Time Series Expert (TSE Version 2.3). Brussels: Institut de Statistique, Université Libre de Bruxelles; 1998. Available from: https://dipot.ulb.ac.be/dspace/retrieve/829842/TSE23E.PDF [Accessed: October 17, 2022]
  14. 14. Mélard G. The likelihood function of a time-dependent ARMA model. In: Anderson OD, Perryman MR, editors. Applied Time Series Analysis. Amsterdam: North-Holland; 1982. pp. 229-239
  15. 15. Mélard G. An indirect proof for the asymptotic properties of VARMA model estimators. Econometrics and Statistics. 2022;21:96-111. DOI: 10.1016/j.ecosta.2020.12.004
  16. 16. Olea R, Palma W, Rubio P. Package LSTS. 2015. Available from: https://cran.r-project.org/web/packages/LSTS/LSTS.pdf [Accessed: October 17, 2022]
  17. 17. Palma W, Olea R, Ferreira G. Estimation and forecasting of locally stationary processes. Journal of Forecasting. 2013;32:86-96. DOI: 10.1002/for.1259
  18. 18. Azrak R, Mélard G. Autoregressive models with time-dependent coefficients - a comparison between several approaches. Stats. 2022;5:784-804. DOI: 10.3390/stats5030046
  19. 19. Van Bellegem S, Dahlhaus R. Semiparametric estimation by model selection for locally stationary processes. Journal of the Royal Statistical Society Series B. 2006;68:721-746. DOI: 10.1111/j.1467-9868.2006.00564.x

Written By

Guy Mélard

Submitted: 26 July 2022 Reviewed: 27 October 2022 Published: 30 November 2022