ssarima() - State-Space ARIMA

Ivan Svetunkov

2025-07-01

SSARIMA stands for “State-space ARIMA” or “Several Seasonalities ARIMA”. Both names show what happens in the heart of the function: it constructs ARIMA in a state-space form and allows to model several (actually more than several) seasonalities. ssarima() is a function included in smooth package. This vignette covers ssarima() and auto.ssarima() functions. For more details about the underlying model, read (Svetunkov and Boylan 2019).

As usual, we will use data from Mcomp package, so it is advised to install it.

Let’s load the necessary packages:

require(smooth)

The default call constructs ARIMA(0,1,1):

ssarima(AirPassengers, h=12, silent=FALSE)
## Time elapsed: 0.01 seconds
## Model estimated using ssarima() function: SSARIMA(0,1,1)
## With backcasting initialisation
## Distribution assumed in the model: Normal
## Loss function type: likelihood; Loss function value: 700.9087
## ARMA parameters of the model:
##        Lag 1
## MA(1) 0.4064
## 
## Sample size: 144
## Number of estimated parameters: 2
## Number of degrees of freedom: 142
## Information criteria:
##      AIC     AICc      BIC     BICc 
## 1405.817 1405.902 1411.757 1411.968

Some more complicated model can be defined using parameter orders the following way:

ssarima(AirPassengers, orders=list(ar=c(0,1),i=c(1,0),ma=c(1,1)), lags=c(1,12), h=12)
## Time elapsed: 0.04 seconds
## Model estimated using ssarima() function: SSARIMA(0,1,1)[1](1,0,1)[12]
## With backcasting initialisation
## Distribution assumed in the model: Normal
## Loss function type: likelihood; Loss function value: 555.2182
## ARMA parameters of the model:
##       Lag 12
## AR(1) 1.0919
##         Lag 1 Lag 12
## MA(1) -0.4177 -0.192
## 
## Sample size: 144
## Number of estimated parameters: 4
## Number of degrees of freedom: 140
## Information criteria:
##      AIC     AICc      BIC     BICc 
## 1118.437 1118.724 1130.316 1131.031

This would construct seasonal ARIMA(0,1,1)(1,0,1)\(_{12}\).

We could try selecting orders manually, but this can also be done automatically via auto.ssarima() function:

auto.ssarima(AirPassengers, h=12)
## Time elapsed: 1.57 seconds
## Model estimated using ssarima() function: SSARIMA(0,1,3)[1](0,1,0)[12]
## With backcasting initialisation
## Distribution assumed in the model: Normal
## Loss function type: likelihood; Loss function value: 557.8991
## ARMA parameters of the model:
##         Lag 1
## MA(1) -0.3838
## MA(2)  0.1251
## MA(3) -0.2212
## 
## Sample size: 144
## Number of estimated parameters: 4
## Number of degrees of freedom: 140
## Information criteria:
##      AIC     AICc      BIC     BICc 
## 1123.798 1124.086 1135.678 1136.393

Automatic order selection in SSARIMA with optimised initials does not work well and in general is not recommended. This is partially because of the possible high number of parameters in some models and partially because of potential overfitting of first observations when non-zero order of AR is selected:

auto.ssarima(AirPassengers, h=12, initial="backcasting")
## Time elapsed: 1.63 seconds
## Model estimated using ssarima() function: SSARIMA(0,1,3)[1](0,1,0)[12]
## With backcasting initialisation
## Distribution assumed in the model: Normal
## Loss function type: likelihood; Loss function value: 557.8991
## ARMA parameters of the model:
##         Lag 1
## MA(1) -0.3838
## MA(2)  0.1251
## MA(3) -0.2212
## 
## Sample size: 144
## Number of estimated parameters: 4
## Number of degrees of freedom: 140
## Information criteria:
##      AIC     AICc      BIC     BICc 
## 1123.798 1124.086 1135.678 1136.393
auto.ssarima(AirPassengers, h=12, initial="optimal")
## Time elapsed: 10.13 seconds
## Model estimated using ssarima() function: SSARIMA(0,1,2)[1](0,1,0)[12] with drift
## With optimal initialisation
## Distribution assumed in the model: Normal
## Loss function type: likelihood; Loss function value: 554.0551
## Intercept/Drift value: 0.3658
## ARMA parameters of the model:
##         Lag 1
## MA(1) -0.3267
## MA(2)  0.0118
## 
## Sample size: 144
## Number of estimated parameters: 17
## Number of degrees of freedom: 127
## Information criteria:
##      AIC     AICc      BIC     BICc 
## 1142.110 1146.967 1192.597 1204.667

As can be seen from the example above the model with optimal initials takes more time and we end up with a different model than in the case of backcasting.

A power of ssarima() function is that it can estimate SARIMA models with multiple seasonalities. For example, SARIMA(0,1,1)(0,0,1)_6(1,0,1)_12 model can be estimated the following way:

ssarima(AirPassengers, orders=list(ar=c(0,0,1),i=c(1,0,0),ma=c(1,1,1)), lags=c(1,6,12), h=12, silent=FALSE)

It probably does not make much sense for this type of data, it would make more sense on high frequency data (for example, taylor series from forecast package). However, keep in mind that multiple seasonal ARIMAs are very slow in estimation and are very capricious. So it is really hard to obtain an appropriate and efficient multiple seasonal ARIMA model. To tackle this issue, I’ve developed an alternative ARIMA model for multiple seasonalities, called msarima().

Now let’s introduce some artificial exogenous variables:

x <- cbind(rnorm(length(AirPassengers),50,3),rnorm(length(AirPassengers),100,7))

If we save model:

ourModel <- auto.ssarima(AirPassengers, h=12, holdout=TRUE, xreg=x)

we can then reuse it:

ssarima(AirPassengers, model=ourModel, h=12, holdout=FALSE, xreg=x, interval=TRUE)
## Time elapsed: 0 seconds
## Model estimated using ssarima() function: SSARIMAX(0,1,3)[1](0,1,0)[12] with drift
## With backcasting initialisation
## Distribution assumed in the model: Normal
## Loss function type: likelihood; Loss function value: 558.4561
## Intercept/Drift value: 0.5096
## ARMA parameters of the model:
##         Lag 1
## MA(1) -0.3096
## MA(2)  0.1363
## MA(3) -0.2296
## 
## Sample size: 144
## Number of estimated parameters: 3
## Number of degrees of freedom: 141
## Information criteria:
##      AIC     AICc      BIC     BICc 
## 1122.912 1123.084 1131.822 1132.248

Finally, we can combine several SARIMA models:

ssarima(AirPassengers, h=12, holdout=FALSE, interval=TRUE, combine=TRUE)
## Time elapsed: 0.01 seconds
## Model estimated using ssarima() function: SSARIMA(0,1,1)
## With backcasting initialisation
## Distribution assumed in the model: Normal
## Loss function type: likelihood; Loss function value: 700.9087
## ARMA parameters of the model:
##        Lag 1
## MA(1) 0.4064
## 
## Sample size: 144
## Number of estimated parameters: 2
## Number of degrees of freedom: 142
## Information criteria:
##      AIC     AICc      BIC     BICc 
## 1405.817 1405.902 1411.757 1411.968

MSARIMA

While SSARIMA is flexible, it is not fast. In fact, it cannot handle high frequency data well and most probably will take ages to estimate the parameter and produce forecasts. This is because of the transition matrix, which becomes huge in case of multiple seasonalities. The MSARIMA model (Multiple Seasonal ARIMA) is formulated in a different state-space form, which reduces the size of transition matrix, significantly reducing the computational time for cases with high frequency data.

There are auto.msarima() and msarima() function in the package, that do things similar to auto.ssarima() and ssarima(). Here’s just one example of what can be done with it:

msarima(AirPassengers, orders=list(ar=c(0,0,1),i=c(1,0,0),ma=c(1,1,1)),lags=c(1,6,12),h=12, silent=FALSE)
## Time elapsed: 0.06 seconds
## Model estimated using msarima() function: SARIMA(0,1,1)[1](0,0,1)[6](1,0,1)[12]
## With backcasting initialisation
## Distribution assumed in the model: Normal
## Loss function type: likelihood; Loss function value: 550.0145
## ARMA parameters of the model:
##       Lag 12
## AR(1)  1.116
##         Lag 1  Lag 6  Lag 12
## MA(1) -0.4028 0.0926 -0.3002
## 
## Sample size: 144
## Number of estimated parameters: 5
## Number of degrees of freedom: 139
## Information criteria:
##      AIC     AICc      BIC     BICc 
## 1110.029 1110.464 1124.878 1125.958

The forecasts of the two models might differ due to the different state space form. The detailed explanation of MSARIMA is given in Chapter 9 of ADAM textbook.

References

Svetunkov, Ivan, and John E. Boylan. 2019. State-space ARIMA for supply-chain forecasting.” International Journal of Production Research 0 (0): 1–10. https://doi.org/10.1080/00207543.2019.1600764.