Time series analysis and forecasting with ECOTOOL

Authors: Diego J. Pedregal ^aff001
Authors place of work: ETSI Industriales, Universidad de Castilla-La Mancha, Ciudad Real, Spain ^aff001
Published in the journal: PLoS ONE 14(10)
Category: Research Article
doi: https://doi.org/10.1371/journal.pone.0221238

Summary

This paper presents ECOTOOL, a new free MATLAB toolbox that embodies several routines for identification, validation and forecasting of dynamic models. The toolbox includes a wide range of exploratory, descriptive and diagnostic statistical tools with visual support, designed in easy-to-use Graphical User Interfaces. It also incorporates complex automatic procedures for identification, exact maximum likelihood estimation and outlier detection for many types of models available in the literature (like multi-seasonal ARIMA models, transfer functions, Exponential Smoothing, Unobserved Components, VARX). ECOTOOL is the outcome of a long period of programming effort with the aim of producing a user friendly toolkit such that, just a few lines of code written in MATLAB are able to perform a comprehensive analysis of time series. The toolbox is supplied with an in-depth documentation system and online help and is available on the internet. The paper describes the main functionalities of the toolbox, and its power is shown working on several real examples.

Keywords:

Carbon dioxide – Seasons – Polynomials – Electricity – Random walk – Graphical user interfaces – Transfer functions

Introduction

The rapid development of Information and Communication Technologies has open the door to the use of massive amounts of data in virtually any area of science and industry. When trying to forecast such amount of information, there is an imperative need for methods and tools capable of providing automatic, general, efficient and reliable solutions in many different contexts.

One of such tools for time series analysis and forecasting is ECOTOOL, a new MATLAB toolbox introduced in this paper. It includes routines for well-know methods, like regression, ARIMA(X), Transfer Functions, VAR(X), ExponenTial Smoothing (ETS), but it also includes less common methods, mainly Unobserved Components models (UC). It offers abundant descriptive and diagnostic statistics in friendly Graphical User Interfaces (GUI), automatic procedures of identification and outliers detection, auxiliar functions for building calendar typical intervention variables, additional tools to make routine operations easy and a long list of demos.

There are some commercial and open-source pieces of software similar and complementary to ECOTOOL: Econometrics and GARCH official MATLAB toolboxes [1], CAPTAIN [2], SCA [3], TRAMO/SEATS [4], forecast package in R [5], gretl [6], STAMP [7], Eviews [8], SAS [9], Stata [10], etc.

ECOTOOL gathers in a single toolbox many flexible features scattered among other packages that served as inspiration. Many users might find similarities with other pieces of software, mainly CAPTAIN, SCA, TRAMO/SEATS and the forecast package. Most of such similarities have to do with visualisation interfaces, treatment of missing data, automatic identification methods, outliers detection procedures, diagnostic testing, etc.

But, despite all the similarities with previous work, there are some unique features of ECOTOOL that very rarely are found elsewhere, to the author knowledge. Six most important are: i) automatic identification routines for multi seasonal ARIMA models; ii) estimation of UC and ETS models with inputs modeled as linear transfer functions (also linear regressions, as they are particular cases of transfer functions); iii) estimation of most models by Exact Maximum Likelihood, this is usual for many types of models, but rather unusual for ETS, see e.g. [11]; iv) automatic outliers detection in UC and ETS models; v) particular handy ways to specify transfer function and ARIMA models, since they are directly passed to ECOTOOL as string variables very similar to the analytical expression written on a paper (see the Toolbox overview section); vi) forecasting transfer function models with stochastic inputs taking into account the uncertainty around univariate forecasts of input variables.

Further advantages of ECOTOOL, shared with other packages, is that it is integrated with the rest of toolboxes available in MATLAB as a unified environment and it is freely available at https://github.com/djpt999/ECOTOOL.

The methods implemented in ECOTOOL have potential applications in many areas of research, from economics to engineering, including many other areas related to life and social sciences or environmental sciences, see e.g., [2, 12–19]

The rest of the paper is organised as follows. Next section shows the main methods available in ECOTOOL. Then, a quick guide to the most important features of the toolbox is presented. Afterwards, several real examples are discussed in depth to show the capabilities of ECOTOOL. Finally, the paper concludes with some relevant remarks.

Models and methods implemented in ECOTOOL

ECOTOOL offers the possibility of identifying, estimating, testing and forecasting time series by methods with different degrees of complexity, from pure univariate to full multivariate systems. The next sub-sections show the full list of methods implemented.

Naïve methods

The simplest methods, commonly used as benchmarks, are a set of naïve methods, i.e., mere rules of thumb that needs just simple computations or no computations at all. They are strictly correct under some severe restricting assumptions, but usually most of them have no meaning at all for many time series.

Let’s call y_t, t = 1, 2, …, T a time series; s the seasonal period; l the forecasting horizon and ⌊x⌋ the integer part of x. Then, the l steps ahead forecasts conditional on all information available up to time T (i.e., y ^ T + l) is estimated by each of the naïve methods as shown in Table 1. It is assumed in general that if more sophisticated methods are better in forecasting terms, they should outperform all these naïve options. Mean forecasts are just the mean of the in-sample data; RW forecasts are just the last observation propagated into the future; seasonal RW is the repetition of the last seasonal cycle available; mean seasonal RW is the mean of the last seasonal cycle of data; drift is a linear forecast with a slope calculated by joining the last and the first observations; mean drift is similar but the slope is based on the mean seasonal RW forecast.

**Tab. 1. Naïve methods implemented in function modelNAIVE.**

All these naïve methods are implemented in the function modelNAIVE.

ARIMA

The ARIMA models implemented in ECOTOOL in the function modelTF are rather general [12], since it allows for multiple seasonal polynomials. This is what some authors have called multi-seasonal ARIMA models. The general formulation is in Eq (1), where z_t is a stationary time series; B is the back-shift operator such that B^l z_t = z_t−l; θ Q i ( B s i ) and ϕ P i ( B s i ) are moving average and autoregressive polynomials of order Q_i and P_i, respectively, in which the exponent of the backshift operator in each summand is a multiple of the seasonal frequency, s_i (i = 1, 2, …, k); and a_t is a Gaussian serially independent white noise with zero mean and constant variance.

Time series z_t in Eq (1) is assumed stationary, but usually is the result of applying the differencing operators to a non stationary time series y_t, as in Eq (2).

ECOTOOL offers an automatic identification procedure inspired in [5] applied to multi seasonal ARIMA models. It is effectively a much more complex process than [14] in the sense that identification is fully automatic and that it allows for moving average terms. As far as the author is concerned, this is the first time such identification procedure is implemented successfully in a time series package for multi seasonal models (see the ‘Case studies’ section below).

Unobserved Components (UC)

ECOTOOL allows for UC models known as a Basic Structural Model of Harvey in the function modelUC [13]. The formulation of this model is shown in Eq (3), where a time series y_t is decomposed into a long term trend (T_t), a seasonal component (S_t) and an irregular component (I_t).

Particular formulations are possible by selecting different alternatives of each component. For trend models, the possibilities are condensed in Eq (4), where T t * is referred to as the trend ‘slope’, α is a parameter between zero and one, and η_t and η t * are independent white noise sequences with variances σ η 2 and σ η * 2, respectively.

This model is called Generalised Random Walk Trend and subsumes the following specific cases implemented in ECOTOOL: i) Random Walk (RW), by eliminating the second equation and α = 1; ii) Integrated Random Walk (IRW) with α = 1 and σ η 2 = 0; iii) Smoothed Random Walk (SRW) with σ η 2 = 0; and iv) Local Linear Trend (LLT) with α = 1, see e.g., [13], [2] and [20].

Seasonal components are included as the so called dummy seasonality [13], that depends on a single parameter, namely the variance of white noise ω_t, see Eq (5).

The full UC model is built by assembling all the State Space models for the components. As a matter of fact, Eq (3) is the observation equation of a State Space system and the block concatenation of all models for the trend in Eq (4) and seasonal in Eq (5) form the transition equation. The Kalman Filter and other recursive algorithm provides the optimal solution to state estimation, see details in [2, 13, 20].

ExponenTial Smoothing (ETS)

The ETS models implemented in function modelETS are taken from [11], see Table 2 for the full list of possibilities. Multiplicative forms are possible by using the log transformation.

**Tab. 2. ETS type of components in modelETS.**

In Table 2, l_t, b_t, S_t and e_t stand for the level, slope, seasonal and irregular components respectively; and α, β, γ and ϕ are unknown parameters that should be estimated. The code is composed of two letters and defines the model components, the first letter specifies the trend and the second letter is reserved for the seasonal component. ‘N’ indicates that the component is not present; ‘A’ implies that the component is additive; ‘D’ implies a damped component, applicable to trends only (with 0 < ϕ < 1). When specifying an ETS model with a seasonal component, the two letters ought to be followed by a number indicating the fundamental period in samples per cycle.

Transfer function (TF)

Multiple Input Single Output TF models are implemented in ECOTOOL by means of the modelTF function. The model may be written as in Eq (6).

In this equation, ω n i ( B ) = ( ω 0 + ω 1 B + ω 2 B 2 + ⋯ + ω n i B n i ) and δ m i ( B ) = ( 1 + δ 1 B + δ 2 B 2 + ⋯ + δ m i B m i ) are polynomials in the backshift operator of order n_i and m_i, respectively; and N(B)a_t is a general representation for any of the univariate alternatives in ECOTOOL, namely ARIMA, UC, and ETS. Mind that linear regression is a particular case of model (6) if all numerator and denominator polynomials are of order zero.

When the noise model in Eq (6) is specified as an AR, ARMA or ARIMA model, it is possible to transform the TF model into an ARX, ARMAX or ARIMAX, by setting appropriate constraints on the polynomials. This operation is straightforward in ECOTOOL due to the way the models are specified (see section Case studies).

While linear regression or TF models with ARIMA noise have been used abundantly for a long time since the first edition of [12] in 1970, ETS or UC combined with TF models is a unique feature of ECOTOOL toolbox, as far as the author is concerned. Taking advantage of this feature, function modelTF also allows for the automatic detection of four types of outliers in ARIMA, UC and ETS models (see Case studies section). Once more, automatic identification of outliers in UC or ETS models is a unique feature of ECOTOOL.

VARX

VARX models in Eq (7) are the multivariate option implemented in ECOTOOL by means of the function modelVARX, where boldface letters indicate either matrices or vectors.

In this equation, z_t stands for a vector of p stationary outputs (generally obtained by appropriate differencing of corresponding vector of non-stationary time series y_t); u_t represent a set of m inputs; Φ_i are a set of squared matrices of dimension p × p; Ω_j are a set of matrices of dimension p × m; and a_t are a vector of p white noises with zero mean and non-diagonal covariance matrix Γ.

Optimal estimation of unconstrained VARX models is easy, since equation by equation estimation by least squares renders consistent and efficient estimates. ECOTOOL allows for imposing constraints on the coefficients for which iterative generalised least squares are used to reach efficient estimation with a number of iterations controlled by the user [16].

Toolbox overview

ECOTOOL is user oriented in the sense that the coding effort demanded from the user is reduced to a minimum at the cost of the programmer elaborating long and comprehensive functions. The result is that it is possible to carry out an exhaustive analysis of time series with the recourse to a few functions. The main ones are listed on Table 3 in separated sections showing the GUI and demo tools, model functions, functions for the generation of calendar effects dummies and general purpose functions. The real number of functions in ECOTOOL is actually much greater because all the available options in the menus of the toolTEST GUI are actually implemented as separate functions that may be run independently, see the documentation. Some explanations about this functions are included in the next paragraphs and the way to use them is illustrated in the worked examples below.

The main features of ECOTOOL are:

As shown in the previous section, a number of time series methods are implemented, ranging from simple naïve univariate models to multivariate methods.
Extensive and detailed documentation is available. All functions are provided with a thorough help accessible in the usual way. Eleven detailed demos with extensive explanations that show all the properties of the toolbox are also provided (accessed by ECOTOOLdemos).
The design of the toolbox is such that it is possible to perform a full time series analysis with just a few MATLAB instructions. In this way, the memory effort demanded from the user is reduced to a minimum. In addition, function names are selected following mnemonic rules such that they are easy to remember and easy to look for.
The toolbox is composed of four types of functions, see Table 3: i) GUI tools to perform several tasks that are named as tool* (where ‘*’ stands for a name, at the moment two are available, toolTEST and toolFORECAST, see below); ii) Modeling methods that are named as model* (like modelNAIVE, modelTF, etc.); iii) functions that facilitates building dummy variables to deal with calendar effects; and iv) general purpose functions to perform a number of important tasks when dealing with time series analysis, like transforming the data (standardising, differencing, Box-Cox variance transforming, etc.) or doing other tasks (filtering and differencing vector time series, calculating convolutions of multivariate polynomials, calculating roots of multivariate polynomials, etc.).
Specification of models is rather simple and flexible. All functions for estimation of models, i.e., model* functions, are written with a common syntax in order to make the model specification task easier to the user. Besides, in the case of modelTF for TF or ARIMA model estimation and forecasting, the way the models are specified is in fact very similar to its analytical expression according to Eqs (1) and (2). For example, the MATLAB code for specifying an ARIMA(0, 1, 2) is ‘(1+ma1*B+ma2*B2)/(1-B)’, where ma1 and ma2 stand for two arbitrary names chosen to label the moving average coefficients and B is the back-shift operator.

One advantage of this feature is that imposing constraints among parameters are straightforward. For example, if in the previous model for a given dataset the constraint ma1 = ma2 wanted to be imposed, the model code would be ‘(1+ma1*B+ma1*B2)/(1-B)’ instead, and only the ma1 parameter would be estimated.
Either conditional or exact Maximum Likelihood (ML) estimation of ARIMA models are available, see e.g., [12]. Conditional ML is always used as a mean to obtain initial conditions for exact estimation, but it is convenient when the model involves very long time series or it is very complex, as is the case of models with multiple seasonal factors or many parameters.
An algorithm for automatic identification of ARIMA models (function modelAUTO) is included, inspired in [5] and coded in the widespread forecast package in R. The procedure follows this reference except in the way differencing orders are identified. In particular, instead of relying on formal unit root tests, ECOTOOL selects difference orders by minimizing the variance of the resulting time series. This discrepancy is introduced due to many problems detected with formal unit root tests when applied to real time series. In addition, the automatic method is expanded to multi-seasonal models, making ECOTOOL the unique piece of software that implements this procedure, to the author knowledge.
Automatic identification of four types of outliers are coded for ARIMA and TF models, following [21] and [4]. The types are additive, innovative, level shift and transitory change (coded in ECOTOOL as AO, IO, LS and TC). They are modeled as particular TFs applied to impulse dummy variables.
ETS and UC models are estimated from their equivalent ‘reduced’ or ARIMA form, as in [22]. This allows to incorporate to these methods all the power ECOTOOL offers for modeling ARIMA processes. In particular, they may be estimated by exact ML, may include inputs as transfer functions and automatic detection of outliers may be carried out. All these are, once more, unique properties of ECOTOOL, as far as the author is concerned.
The estimation output of any sort of models is rather exhaustive in tabular form. Such tables show parameter values with their standard errors and T tests, information criteria, correlation among parameters and, in the case of TF and ARIMA models, warnings about problems with unit roots in either numerator or denominator polynomials.

There are two functions that produce GUI interfaces that deserve special attention, namely toolTEST and toolFORECAST.

toolTEST is conceived as a friendly and exhaustive environment for descriptive statistics, as well as an identification tool and model validation tool of multivariate time series. When it is invoked, three menus unfold in a standard figure window in addition to the usual figure menus: i) a comprehensive ‘Tests’ menu, briefly explained below; ii) a ‘Series’ menu if the input is multivariate and allows to apply the tests to any individual and/or to all the time series at once; and iii) an ‘Options’ menu to deal with specific options for each item in the ‘Tests’ menu.

The ‘Tests’ menu offers a thorough combination of tabular and graphical information for many tests available. The following is a non-exhaustive list of such tests, classified in different categories:

Descriptive information: time plots, box plots, scatter plots, descriptive statistics, histograms, formal Gaussianity tests [23, 24].
Identification tools: univariate and multivariate sample autocorrelation and partial autocorrelation functions, Ljung-Box Q and Monti tests [25, 26], information criteria, Akaike’s, Schwarz, Hannan and Quin, [27–29], Granger causality tests based on VAR models [30].
Constant mean and heteroscedasticity tests: CUSUM and CUSUMSQ tests [31], mean vs standard deviation scatter plots, formal ratio of variances heteroscedasticity test, Box-Cox transformation estimation [32, 33].
Integration and cointegration tests: Dickey-Fuller and Perron unit root tests, Johansen cointegration tests [34–36].
Non-linearity tests: [26, 37–39], Schwarz criterion on squares.
Frequency domain tools: cumulative periodogram, smoothed or raw periodogram, AR-spectrum [16, 40].

The second useful GUI is the so called toolFORECAST, that is designed to show graphically the forecasting output of the model functions in ECOTOOL and to print out tables with plenty of error metrics (see examples in Case studies section).

Function toolFORECAST allows to plot one or several forecasts and forecasting errors for each time series in turn, selecting the output to be displayed interactively by menus. For long time series several pushbuttons permit to slide side-wise along the time series (as in toolTEST). Tabular results may be displayed in three different forms, i) actual values, forecasts, errors and error metrics for each time series and each forecasting step separately; ii) overall error metrics for one time series but many methods altogether to do fast comparisons; and iii) Diebold-Mariano test, sign test and Wilcoxon signed-rank test for testing statistical significant differences among several forecasting methods [41, 42].

Case studies

The present section considers three case studies chosen to illustrate ECOTOOL working on real data. Not all the capabilities of the toolbox are shown in these examples, due to space restrictions. The documentation shows a wide range of thorough examples, run step-by-step with their respective coding, covering all the models and tools available in ECOTOOL. There, the implementation is shown deploying the code necessary to run the examples, together with the output produced.

The three forecasting cases shown below are designed following some common rules. They are rolling forecasting experiments in which the training in-sample data length, the testing out-of-sample data length, and the forecast origin are fixed initially depending on the properties of each dataset. The first round of forecasts with all the appropriate models is run and forecasted and the corresponding actual values stored. Then, the window is moved several samples ahead and all the forecasts are produced again with the models identified and estimated with the most recent information. This updating step is repeated to the end of the data.

This exhaustive evaluation of forecasting performance of each model is completed with the help of two error metrics that have proven very useful in many applications and are free from some inconveniences, namely the symmetric Mean Absolute Percentage Error (sMAPE) and the Mean Absolute Scaled Error (MASE), see Eqs (8) and (9) and [43, 44].

In Eqs (8) and (9) y_t and y ^ t are the actual and forecasted values at time t, respectively; h is the forecast horizon; s is the seasonal period of the data, if applicable, or just 1 if the data is not seasonal; and n is the number of observations in the fitting sample. The sMAPE metric avoids the distortions of the standard non-symmetric MAPE criterion and problems for values close to zero. The MASE metric compares the out-of-sample performance of the model with the in-sample performance of a simple seasonal RW (see Table 1), i.e. assuming that the model for the data is y_t = y_t−s + a_t, with a_t white noise with zero mean, constant variance and serially independent.

Forecasting analysis of Mauna Loa CO₂ concentration data

The measured CO₂ concentration at Mauna Loa is represented in Fig 1 (ftp://aftp.cmdl.noaa.gov/products/trends/co2/co2_mm_mlo.txt). It consists of sixty years of monthly data collected from March 1958 to May 2018. The data show a growing trend during the full period associated with long run emissions and a seasonal component rather stable related to the global net uptake and release of CO₂ in summer and winter.

The Mauna Loa CO<sub>2</sub> concentration data from March 1958 to May 2018. — **Fig. 1. The Mauna Loa CO₂ concentration data from March 1958 to May 2018.**

The upward trend and the seasonal variations may be also checked out by either the pseudo-periodogram (pseudo because the time series is not mean-stationary) or the AR pseudo-spectrum available in the toolTEST GUI tool (not shown here to save space). Clear peaks appear for zero frequency related to the trend and the seasonal periods 12 and 6 samples per cycle. The rest of harmonics do not show up.

To give the feeling of how the ECOTOOL code looks like, some coding is shown for this example. The next listing shows how to load the data, select the raw CO₂ data (stored in the fifth column of the matrix data downloaded from the official web page), avoid the first two years of monthly observations, select 480 months after that and select two parameters that will be used later on, namely the number of forecasts (nofs) and the seasonal period (s). The last line is useful for those users who want to try the variance stabilizing Box-Cox transformation of the data (applied to the first 480 months). It can also be accessed by the ‘Box-Cox Transformation’ option in the ‘Tests’ menu of the toolTEST GUI. By this command the λ parameter of the transformation is estimated according to [33] and returns the transformed variable (ty) and λ (lambda).

load maunaLoa.data % Load data

y = maunaLoa(25:504, 5); % Select in-sample CO2

nofs = 24; % Number of forecasts

s = 12; % Seasonal period

[ty, lambda] = vboxcox(y); % Box-Cox transform

The models used in this case study are all those available in ECOTOOL that may sensibly applied to this data. In particular:

Airline: ARIMA(0, 1, 1) × (0, 1, 1)₁₂ estimated with modelTF. The architecture of the model is maintained along the whole experiment, but the parameters are updated at each step.
Auto: ARIMA automatically identified by modelAUTO. Both architecture and parameters are updated.
AR: pure AR model automatically identified by a procedure similar to modelAUTO but truncated to avoid MA terms.
UC: Unobserved components model composed of a Local Linear Trend with a trigonommetric seasonal component of period 12 samples per cycle. This is provided by modelUC with model option ‘LLT12’.
ETS: additive Exponential Smoothing with level, slope and seasonal (modelES with model option ‘AA12’).
Naïve: seasonal naïve, i.e. forecasts are taken as the last observation of the same season available (third column of output from modelNAIVE function).

The next listing shows how the previous models may be run for the first 480 months of the data (see details on how to use these functions in the toolbox documentation). Pure AR models are not included in the listing because they were produced with an ad-hoc particular function not included in ECOTOOL.

model = '(1+ma1*B)(1+ma12*B12)/(1-B)(1-B12)';

m1 = modelTF(ty, [], model, nofs); % Airline model

m2 = modelAUTO(y, [], [1 s]); % Auto ARIMA model

m3 = modelUC(ty, [], 'LLT12', nofs); % UC model

m4 = modelETS(ty, [], 'AA12', nofs); % ETS model

m5 = modelNAIVE(ty, nofs, s); % Naive models

Mind that the syntax for all models are very similar and that the first input to modelAUTO is the time series without Box-Cox transformation because such change is tested inside this particular function. The output variables m1 to m5 are MATLAB structures with all the relevant information about the model output, like residuals, forecasts, etc.

The automatic ARIMA identification procedure implemented in ECOTOOL (by means of modelAUTO function) produces an ‘airline’ model, i.e. ARIMA(0, 1, 1) × (0, 1, 1)₁₂. This model is perfectly supported by the Simple and Partial Autocorrelation functions for the differenced data shown in Fig 2. It certainly shows that the MA terms are essential to avoid over parameterisation if only AR terms are used, since finite order MA models are theoretically equivalent to infinite order AR models, as in [14].

**Fig. 2. Simple and Partial Autocorrelation functions for the Mauna Loa differenced data.**

Fig 2 is the result of the following code, though it can also be produced by toolTEST.

z = vdif(ty, [1 1], [1 s]); % Difference series

vident(z); % Identification

Calculating error metrics and statistical tests about forecast errors, and performing some graphical checks of forecasts are very simple with the help of the toolFORECAST function (actually a GUI). Next listing shows how to produce such outputs in the original scale of the variable. This requires to undo the Box-Cox transformation on the model forecasts, stored in field .py of all model structures (m1 to m5).

ACTUAL = maunaLoa(505:528, 5);

FORECASTS = vboxcoxinv([m1.py m2.py m3.py m4.py m5.py], lambda);

toolFORECAST(ACTUAL, FORECASTS)

The forecast exercise consists of a rolling out experiment in which the initial forecast origin is chosen at 480 observations from the beginning (March 1998) and the forecast horizon is 24 months ahead. Then, one month is added to the sample and the whole process is repeated to the end of the sample. Therefore, 220 total rounds of 24 months-ahead forecasts from all models are produced. The average forecasting performance of all models used are shown in Table 4. The table is divided in two parts reporting the average SMAPE and MASE metrics for each model for selected horizons ranging from 1 to 24 months.

**Tab. 4. Average sMAPE and MASE metrics for different models on the Mauna Loa experiment.**

Table 4 offers some interesting insights into the forecasting issue of the Mauna Loa data. Firstly, forecasts deteriorate with the horizon for all models, as expected. Secondly, all models show significant performance improvements over the Naïve, implying that they are really capturing the structure of the data beyond a naïve seasonal pattern. Thirdly, all errors are very small implying that the series can be forecast with great accuracy (take, for example, the Airline model that produces an average sMAPE of only 0.139% for 24 months ahead). Fourthly, no big differences in performance may be reported among models up to one year ahead with the exception of Naïve, though strictly speaking, Airline outperforms the rest at every single step. However, that is not the case for 24 steps ahead, where the best model is Auto, instead. Finally, AR model lies in a middle range, better than UC and ETS, while ETS deteriorates importantly for 24 months ahead.

The overall conclusion is that ECOTOOL provides a set of models rather reasonable for modelling and forecasting the Mauna Loa data. Moreover, the Unobserved Components may be used to extract the trend of the data (stored in field comp of structure m3) and compared to the ‘official’ one reported in the original web page. Fig 3 shows a detail of the data with the ‘official’ trend and the one estimated by modelUC. One single trend is visible because, though they are not exactly the same, both are consistent.

**Fig. 3. Detail of Mauna Loa data with the ‘official’ and modelUC trends.**

Hourly electricity demand in Spain

The data for the Spanish electricity demand used in this case study is publicly available at the Iberian Energy Market Operator web page (OMIE: http://www.omie.es). The data is continuously updated from 29 June 2001. Fig 4 shows a portion of such data, from 1 January 2017 to 12 June 2018.

**Fig. 4. Spanish hourly electricity load demand from January 2017 to June 2018.**

These data are characterised by a number of periodic components superimposed that have to be dealt with, if a comprehensive model wants to be fitted. Firstly, the data exhibits a clear annual cycle with two peaks in winter and summer, respectively, closely related to temperatures. Secondly, a strong diurnal cycle, with different profiles depending on the season of the year. Thirdly, a weekly cycle is present with lower demand during weekends, mainly due to the absence of industrial activity. Finally, the data is affected by a number of special days, special events, moving festivals and holidays, etc.

In general, it is common to avoid modelling the year cycle for short term forecasting with hourly data, for several reasons: i) the most important drivers of the data in the short run are the daily and weekly cycle, while the annual cycle would become of paramount importance for longer horizons (from one week onwards); ii) it is parametrically unfeasible and much research should be conducted if trivial extensions of existing models want to be avoided. The problem is that the annual cycle holds 8,760 hours and a trivial extension of models to take into account the periodic behaviour would involve 4,380 harmonics. One way to tackle with these problems is with the aid of time aggregation techniques, e.g., [17], but they are rather specific methods that require specialised software, far beyond the scope of this paper, that focuses on presenting a toolbox for general use.

In this context is where ECOTOOL offers an important innovation, since automatic identification of ARIMA models is extended for these complex type of databases, namely the multi-seasonal ARIMA model, i.e., models that include as multiplicative seasonal factors as necessary. As far as the author is concerned, this is the first time that an automatic algorithm is developed for such complex cases. Certainly, in this case the model is composed of the multiplication of three ARMA factors, namely regular, daily and weekly seasonals. The general specification is in Eq (10), with the same nomenclature of Eq (1).

θ_q(B)/ϕ_p(B) is a ratio of polynomials in the back-shift operator of appropriate orders, respectively; Θ Q 1 ( B 24 ) / Φ P 1 ( B 24 ) is similarly a ratio with orders Q₁ and P₁ in multiples of 24 hours per day; and Θ Q 2 ( B 168 ) / Φ P 2 ( B 168 ) is similarly defined, though for 168 hours per week. The multi-seasonal extension of the ‘airline’ model used later is shown in Eq (11).

y_t in Eq (11) is the undifferenced time series because the differences are included explicitly in the model denominators.

The automatic identification applied to these data suggests that daily differencing is not necessary, implying that the ‘airline’ model in Eq (11) is strictly wrongly specified because of over-differentiation. However, manual identification of demand series imposing the differences implied by the ‘airline’ model (i.e., ( 1 - B ) ( 1 - B 24 ) ( 1 - B 168 ) demand t) produces a clear evidence in favour of the airline model except for the non-seasonal part of the correlogram (lags 1 to 11, see Fig 5). Therefore, the airline model will be kept as a benchmark to compare with.

**Fig. 5. Simple and Partial Autocorrelation functions for the Spanish load demand data.**

The models trained are: i) a weekly Naïve of period 168 hours per cycle as a bottom benchmark (Naïve, daily naïve with period 24 was also tried, but was discarded because the results were systematically worse); ii) multi-seasonal airline in Eq (11) as a more sophisticated benchmark (Airline); iii) multi-seasonal model in Eq (10) automatically identified with modelAUTO (Auto); and iv) multi-seasonal AR model identified in a similar manner (AR).

All models were estimated and used to forecast a week ahead along a full year (from July 2017 to June 2018) with a rolling forecast origin every 6 hours and samples of 8 weeks length. Thence, 1,460 rounds of 168 hours-ahead forecasts were calculated with each model. The window size (8 weeks) allow the models to adapt for the changing profile of the seasonal components over the year. A full year of data was reserved as the test set to give a better overall idea of forecasting performance, since such performance varies with the season of the year.

Results for sMAPE and MASE metrics are shown in Table 5 for a selection of forecast horizons. Some relevant observations follow. Firstly, the forecast performance deteriorates with the forecast horizon for all methods. Secondly, the Auto method is the best for all horizons when compared with Airline, meaning that the automatic identification implemented in ECOTOOL makes sense in terms of forecasting performance. Thirdly, Auto is better than AR as well, implying that including moving average terms in the models pays back in terms of forecasting performance. Finally, a striking result is that the deterioration of the Naïve model is much slower than in the rest of cases (though its performance for shorter horizons is rather poor), but it is the best options for the long horizons, for 6 and above days.

**Tab. 5. Average sMAPE and MASE metrics for different models on electricity demand data.**

This latter observation means that standard time series models focus on short-term horizons and more sophisticated extensions should be provided for longer horizons. One clear extension would be to add the annual cycle into the models, since it could be the case that for horizons long enough the lack of a annual cycle starts to be felt.

One final point worth considering is that the optimal forecasts of the Mauna Loa data in the previous case study are systematically much more accurate than the electricity demand forecasts, because of a much greater level of uncertainty in the latter case. This point may be checked by comparing Table 4 with the appropriate rows in Table 5, bearing in mind that 12 and 24 hours ahead in Table 5 corresponds to 1 and 2 years in Table 4, respectively. The sMAPE for electricity demand 24 steps ahead is about 30 times the sMAPE for the CO₂ concentration data.

Global Horizontal Irradiation forecasting at a photovoltaic plant in Ciudad Real, Spain

This case study evaluates the forecasting performance of the models implemented in ECOTOOL when applied to the Global Horizontal Irradiation (GHI) data provided by the Spanish Meteorological Estate Agency (AEMET) weather station located at Ciudad Real, Spain. The original dataset consisted of 18 years of GHI hourly observations. Fig 6 shows an overview of the last year of data that shows clearly the annual cycle and the variability among different days, sometimes weeks, depending mainly on the cloud cover.

**Fig. 6. A full year of GHI data at a photovoltaic plant in Ciudad Real, Spain.**

One typical feature of the data is that GHI drops down to zero every night at different times within the day depending on the sunrise and sunset times. Consequently, the time series contains numerous zeros deterministically located along the year. In winter there are just 10 sun hours, while in summer the Sun shines for up to 16 hours. An efficient way to deal with this singularity of the data is removing such zeros before the modelling stage and inserting them back to build the final forecast for full days. In this way, at each forecast origin the periodicity of the data is different, depending on the time of the year.

Due to this peculiarity, the data is rather heteroscedastic along the year. This problem may be alleviated by the Box-Cox transformation, that in ECOTOOL is implemented in the function vboxcox (that may be run directly or by a menu option within toolTEST), in which the optimal lambda is estimated following the model-independent approach by [33]. Lambda turned up to be close to 0 in most cases, meaning that the optimal transformation is the natural logarithm. Fig 7 illustrates the convenience of these two transformations. Top panel shows two months of the original data, while the bottom panel shows the data in logs and after zero-removal. The sample length is drastically reduced (only 10 samples per day out of 24 remained in the case shown) and the natural logarithm transformation renders a time series with proper statistical properties, at least regarding homoskedasticity.

**Fig. 7. Two months of original GHI data (top panel) and the data after transformation with Box-Cox and removal of night zero values.**

This case, unlike electricity demand, is not multi-seasonal, since only a diurnal period is observed on top of an annual cycle. The annual cycle is ignored because models used are sensible strictly for short run forecasting (see discussion on this issue in the previous case study). Then, the methods used in this case are the ones already used in the Mauna Loa case, but with time varying periods due to the zero values removal. More specifically the models are ARIMA Airline; ARIMA Auto identified by means of modelAUTO; automatic AR models; UC model; ETS and seasonal Naïve of period 24 hours.

To illustrate ECOTOOL working on this data, the rolling experiment is conducted by selecting samples of 2 months of data previous to each forecasting origin. One week ahead of data is forecast at each step and the forecast origin is moved 6 hours forward. The evaluation is repeated along a full year of data, i.e., 1,460 total runs of each model.

sMAPE metrics cannot be computed in this case due to the presence of simultaneous zeros in data and forecasts. Indeed, inspection of Eq (8) shows this is one of few particular cases where sMAPE is not defined because both forecasts and actual values are zero. Cases like this highlights the utility of other metrics, like MASE that may be still be computed.

Results are reported in Table 6, showing clearly that the GHI data is less forecastable than the previous cases. The main reason for this is that all the MASE measurements are much bigger now. As an example, the Auto model renders MASE values that are about twice the electricity case and almost four times the Mauna Loa case for 12 steps ahead forecasts. But the main reason is that the Naïve model, i.e., projecting as forecasts into the future just the last seasonal cycle available, is rather good. Indeed, for horizons longer than 9 hours ahead, the Naïve model is the absolute winner, and it is the best than most of them for horizons longer than 6 hours ahead.

**Tab. 6. MASE metrics for different models on a GHI data from a photovoltaic plan in Spain.**

But still, this confusing evidence should not distract from other type of evidence. Firstly, for horizons shorter than 9 hours the best method is Auto. Secondly, the performance of ETS is rather poor, since is better than Naïve only for 1 and 2 hours ahead. Thirdly, AR is more accurate than UC and ETS for short horizons, but it deteriorates rather badly for longer horizons. Finally, Airline and Auto outperforms AR for any forecast horizon.

Putting together all this evidence, several conclusions follow:

Forecasting GHI data is rather difficult because of its inherent volatility. Improving simple models for horizons longer than 9 hours ahead may require a lot of modelling effort. The improvements may be more related to the use of inputs (like cloud cover) than to other methods closer to Machine Learning that usually concentrate on very short forecast horizons. However, predicting cloud cover to use them as an input to GHI may be more complex than predicting GHI itself.
Pure AR models are sub-optimal, especially bad in this case study. If ARIMA models want to be used, there is no doubt that MA terms are very helpful in relation to forecasting accuracy, in contrast to the view of [14].
The automatic identification of ARIMA models implemented in ECOTOOL produces models that outperform the rest.

Conclusions

This paper has introduced ECOTOOL, a toolbox intended mainly for professional practitioners, academic researchers, students, and anyone involved in the analysis of time series, forecasting or signal processing. ECOTOOL is composed of a number of powerful functions to estimate a wide range of models in a rather user friendly manner; with abundant tools for identification, validation and graphical representation of results.

The main methods implemented are ARIMA, Exponential Smoothing, Unobserved Components, ARX, ARMAX, Transfer Function, Distributed Lag models and VARX. Several properties are the salient features of the toolbox, e.g. it is user-oriented; just a few MATLAB functions are enough to carry out a complete analysis of time series; model specification is rather simple and flexible; several estimation methods are implemented; automatic detection and estimation of four types of outliers is implemented; the toolbox is very robust making it useful in long automatic experiments in programming environments.

The toolbox also provides a wide range of descriptive information of the data, both graphically and in tabular format; standard and not so standard identification tools; formal and visual tests for gaussianity, independence, causality, heteroscedasticity, non-linearity, unit root and cointegration; spectral tools; tests on forecasting performance; etc.

Though many of the procedures implemented may be found in other software packages, some of them are exclusive to ECOTOOL, to the author knowledge. It is the case of the automatic identification of multi seasonal ARIMA models, automatic detection of outliers in Unobserved Components and Exponential Smoothing models, and the possibility of estimating Unobserved Components and Exponential Smoothing models by Exact Maximum Likelihood adding inputs specified as dynamic transfer functions.

The toolbox is shown working on three case studies, in which several methods are tested on forecasting time series with different sampling interval and degrees of complexity, namely the monthly CO₂ concentration data at Mauna Loa, hourly electricity demand in Spain, and hourly global horizontal irradiation at a photovoltaic plant in Ciudad Real, Spain.

Results show clearly that forecastability depends strongly on the case, being the global irradiation the worst case to forecast. In all cases, the automatic procedure of identification of ARIMA models shows great potentiallity as a general tool in forecasting tasks and including moving average terms in ARIMA models increases forecast accuracy.

Zdroje

1. The MathWorks Inc. MATLAB—The Language of Technical Computing, Version R2018a. Natick, Massachusetts. URL http://www.mathworks.com/products/matlab/; 2018.

2. Taylor CJ, Pedregal DJ, Young PC, Tych W. Environmental Time Series Analysis and Forecasting with the Captain Toolbox. Environmental Modelling & Software. 2007;22(6):797–814. doi: 10.1016/j.envsoft.2006.03.002

3. Liu LM. Time Series Analysis and forecasting. Scientific Computing Associates Corp.; 2009.

4. Gómez V, Maravall A. Automatic Modeling Methods for Univariate Series. In: A Course in Time Series. John Wiley & Sons, Inc.; 2001. p. 171–201.

5. Hyndman RJ, Khandakar Y. Automatic Time Series Forecasting: The Forecast Package for R. Journal of Statistical Software. 2008;3(27):1–22.

6. Cottrell A, R L. Gretl; 2018. http://gretl.sourceforge.net/.

7. Koopman S, Harvey A, Doornik J, Shephard N. STAMP 8.2: Structural Time Series Analyser and Modeller and Predictor. Timberlake Consultants Limited; 2009.

8. Bossche FV. Fitting State Space Models with EViews. Journal of Statistical Software. 2011;41(8):1–16. doi: 10.18637/jss.v041.i08

9. Selukar R. State Space Modeling Using SAS. Journal of Statistical Software. 2011;41(12):1–13. doi: 10.18637/jss.v041.i12

10. Drukker R, Gates R. State Space Methods in Stata. Journal of Statistical Software. 2011;41(10):1–25. doi: 10.18637/jss.v041.i10

11. Hyndman R, Koehler AB, Ord JK, Snyder RD. Forecasting with Exponential Smoothing: the State Space Approach. Springer Science & Business Media; 2008.

12. Box GEP, Jenkins GM, Reinsel GC, Ljung GM. Time Series Analysis: Forecasting and Control. 5th ed. John Wiley & Sons; 2015.

13. Harvey AC. Forecasting, Structural Time Series Models and the Kalman Filter. Cambridge university press; 1989.

14. Chen J, Boccelli DL. Real-time forecasting and visualization toolkit for multi-seasonal time series. Environmental Modelling & Software. 2018;105:244–256. https://doi.org/10.1016/j.envsoft.2018.03.034

15. Zeng Q, Wen H, Huang H, Pei X, Wong SC. Incorporating temporal correlation into a multivariate random parameters Tobit model for modeling crash rate by injury severity. Transportmetrica A: Transport Science. 2018;14(3):177–191. doi: 10.1080/23249935.2017.1353556

16. Wei WS. Time Series Analysis–Univariate and Multivariate Methods. Temple University. USA; 2006.

17. Pedregal DJ, Trapero JR. Mid-term hourly electricity forecasting based on a multi-rate approach. Energy Conversion and Management. 2010;51(1):105–111. https://doi.org/10.1016/j.enconman.2009.08.028

18. Chen F, Chen S, Ma X. Analysis of hourly crash likelihood using unbalanced panel data mixed logit model and real-time driving environmental big data. Journal of Safety Research. 2018;65:153–159. doi: 10.1016/j.jsr.2018.02.010 29776524

19. Ma X, Chen S, Chen F. Multivariate space-time modeling of crash frequencies by injury severity levels. Analytic Methods in Accident Research. 2017;15:29–40. doi: 10.1016/j.amar.2017.06.001

20. Durbin J, Koopman SJ. Time Series Analysis by State Space Methods. 2nd ed. Oxford University Press; 2012.

21. Tsay RS. Time Series Model Specification in the Presence of Outliers. Journal of the American Statistical Association. 1986;81(393):132–141. doi: 10.1080/01621459.1986.10478250

22. Broze L, Mélard G. Exponential smoothing: Estimation by maximum likelihood. Journal of Forecasting. 1990;9(5):445–455. doi: 10.1002/for.3980090504

23. Lilliefors HW. On the Kolmogorov-Smirnov Test for Normality with Mean and Variance Unknown. Journal of the American Statistical Association. 1967;62(318):399–402. doi: 10.1080/01621459.1967.10482916

24. Bera AK, Jarque CM. Efficient tests for normality, homoscedasticity and serial independence of regression residuals: Monte Carlo Evidence. Economics Letters. 1981;7(4):313–318. https://doi.org/10.1016/0165-1765(81)90035-5

25. Ljung GM, Box GEP. On a measure of lack of fit in time series models. Biometrika. 1978;65(2):297–303. doi: 10.1093/biomet/65.2.297

26. Monti AC. A Proposal for a Residual Autocorrelation Test in Linear Models. Biometrika. 1994;81(4):776–780. doi: 10.1093/biomet/81.4.776

27. Akaike H. A New Look at the Statistical Model Identification. IEEE Transactions on Automatic Control. 1974;19:716–723. doi: 10.1109/TAC.1974.1100705

28. Schwarz GE. Estimating the Dimension of a Model. Annals of Statistics. 1978;6:461–464. doi: 10.1214/aos/1176344136

29. Hannan EJ, Quinn BG. The Determination of the order of an autoregression. Journal of the Royal Statistical Society, Series B. 1979;41:190–195.

30. Granger CWJ. Investigating Causal Relations by Econometric Models and Cross-spectral Methods. Econometrica. 1969;37(3):424–438. doi: 10.2307/1912791

31. Brown RL, J D, M EJ. Techniques for testing the constancy of regression relationships over time (with discussion). Journal of the Royal Statistical Society B. 1975;37:149–192.

32. Box GEP, Cox DR. An Analysis of Transformations. Journal of the Royal Statistical Society Series B (Methodological). 1964;26(2):211–252. doi: 10.1111/j.2517-6161.1964.tb00553.x

33. Guerrero VM. Time series analysis supported by power transformations. Journal of Forecasting. 1993;12(1):37–48. doi: 10.1002/for.3980120104

34. Dickey DA, Fuller WA. Distribution of the Estimators for Autoregressive Time Series with a Unit Root. Journal of the American Statistical Association. 1979;74(366a):427–431. doi: 10.2307/2286348

35. Phillips PCB, Perron P. Testing for a Unit Root in Time Series Regression. Biometrika. 1988;75(2):335–346. doi: 10.1093/biomet/75.2.335

36. Johansen S. Estimation and Hypothesis Testing of Cointegration Vectors in Gaussian Vector Autoregressive Models. Econometrica. 1991;59(6):1551–1580. doi: 10.2307/2938278

37. McLeod AI, K LW. Diagnostic checking ARMA time series models using squared residual autocorrelations. Journal of Time Series Analysis. 1983;4:269–273. doi: 10.1111/j.1467-9892.1983.tb00373.x

38. Peña D, Rodriguez J. Detecting nonlinearity in time series by model selection criteria. International Journal of Forecasting. 2005;21(4):731–748. https://doi.org/10.1016/j.ijforecast.2005.04.014

39. Tsay RS. Nonlinearity tests for time series. Biometrika. 1986;73(2):461–466. doi: 10.1093/biomet/73.2.461

40. Young PC, Pedregal DJ, Tych W. Dynamic Harmonic Regression. Journal of Forecasting. 1999;18(6):369–394. doi: 10.1002/(SICI)1099-131X(199911)18:6%3C369::AID-FOR748%3E3.0.CO;2-K

41. Diebold FX, Mariano RS. Comparing Predictive Accuracy. Journal of Business & Economic Statistics. 1995;13(3):253–263. doi: 10.1080/07350015.1995.10524599

42. Wilcoxon F. Individual Comparisons by Ranking Methods. Biometrics Bulletin. 1945;1(6):80–83. doi: 10.2307/3001968

43. Makridakis S, Hibon M. The M3-competition: results, conclusions and implications. International Journal of Forecasting. 2000;16(4):451–476. doi: 10.1016/S0169-2070(00)00057-1

44. Hyndman RJ, Koehler AB. Another look at measures of forecast accuracy. International Journal of Forecasting. 2006;22(4):679–688. https://doi.org/10.1016/j.ijforecast.2006.03.001