The uncertainty estimation of feature-based forecast combinations

Xiaoqian Wang

Beihang University

41st International Symposium on Forecasting

June 17, 2021

1 / 29

Joint work with


Yanfei Kang	Fotios Petropoulos	Feng Li
Beihang University	University of Bath	Central University of Finance and Economics

2 / 29

Outline

Introduction
Feature-based interval forecasting framework
Weight determination
Application to the M4 competition data
Conclusions

3 / 29

Introduction4 / 29

Motivation

Forecasting

Time series

Point forecasts
Probabilistic forecasts

5 / 29

Motivation

Forecasting

Time series

Point forecasts
Probabilistic forecasts

Forecasting method

Individual models

Naïve
Snaïve
ARIMA
ETS...

6 / 29

Motivation

Forecasting

Time series

Point forecasts
Probabilistic forecasts

Description

Forecasting method

Features

Trend
Linearity
Nonlinearity
Seasonality...

Individual models

Naïve
Snaïve
ARIMA
ETS...

7 / 29

Motivation

Forecasting

Time series

Point forecasts
Probabilistic forecasts

Description

Forecasting method

Features

Trend
Linearity Feature-based
Nonlinearity forecasting
Seasonality...

Individual models

Naïve
Snaïve
ARIMA
ETS...

8 / 29

IntroductionPoint forecasting mainly forecasts the mean or the median of the distributions for future observations.
Probabilistic forecasting can provide a comprehensive outlook of the expected future value and the future uncertainty.
Time series features provide valuable information for decision makers.
The superiority of forecast combinations over a single model.No-free-lunch theorem (Wolpert & Macready, 1997).
Horses for courses (Petropoulos et al., 2014).
Merely tackling model uncertainty is sufficient to help (Petropoulos et al., 2018).

9 / 29

Challenges

Previous literature mainly focuses on
- point forecasting + forecast combinations.
How do features affect the uncertainty estimation of forecasts?
How to guarantee the effectiveness of the relationship in forecasting a newly given dataset?
How to translate the relationship into an attempt to improve the forecasting performance?

Feature-based probabilistic forecast combinations.

10 / 29

Feature-based interval forecasting framework11 / 29

General framework

12 / 29

GRATIS (Kang et al., 2020)

13 / 29

Dataset

Reference (GRATIS)

Test (M4)

14 / 29

Other components

$42$ times series features (R package tsfeatures)
Individual model pool
Interval forecast evaluation $\begin{aligned} M S I S = \frac{1}{h} \frac{\sum_{t = n + 1}^{n + h} (U_{t} - L_{t}) + \frac{2}{α} (L_{t} - Y_{t}) 1 {Y_{t} < L_{t}} + \frac{2}{α} (Y_{t} - U_{t}) 1 {Y_{t} > U_{t}}}{\frac{1}{n - m} \sum_{t = m + 1}^{n} | Y_{t} - Y_{t - m} |} \end{aligned}$

15 / 29

Linking features with performance

Why GAM?

Interpretability
Regularization
Flexibility

GAM model for each individual model

$\log ({M S I S}_{N}) ⟺ F_{N \times P}$

16 / 29

Partial effect analysis

Feature	Description	Range
seasonal_strength	Strength of seasonality	$[0, 1)$
nonlinearity	Nonlinearity coefficient	$[0, \infty)$
x_acf1	The first autocorrelation coefficient	$(- 1, 1)$

17 / 29

Partial effect analysis

The partial effect of one feature on the interval forecasting performance is distinct from the other features.
A feature has its unique way of affecting the interval forecasting performance of individual models.
Some features are biased towards up-weighting some forecasting models over others.

18 / 29

Weight determination19 / 29

Weight assignment

Adjusted softmax function

$\begin{aligned} P_{i j} = \frac{\exp {\frac{μ_{i} - \hat{\log ({M S I S}_{i j})}}{σ_{i}}}}{\sum_{k = 1}^{M} \exp {\frac{μ_{i} - \hat{\log ({M S I S}_{i k})}}{σ_{i}}}}, i = 1, \dots, N; j = 1, \dots, M \end{aligned}$

Negative values can be down-weighted to near-zero.
$\log (M S I S) ↑ ⟹ Accuracy ↓ ⟹ P ↓$

Optimal threshold ratio search

For $i$ th time series,

calculate the ratio of weight $R_{k} = P_{i j} / max (P_{i k})$ .
select individual models that satisfy $R_{k} > T r$ $(0 < T r \leq 1)$ .

20 / 29

Combined forecasts

Combined prediction intervals

$\begin{aligned} f_{w i}^{l} & = \frac{1}{\sum_{k = 1}^{S} P_{i k}} \sum_{k = 1}^{S} P_{i k} f_{i k}^{l} \\ f_{w i}^{u} & = \frac{1}{\sum_{k = 1}^{S} P_{i k}} \sum_{k = 1}^{S} P_{i k} f_{i k}^{u} \end{aligned}$

Combined point forecasts

$\begin{aligned} f_{w i} = \frac{1}{2} (f_{w i}^{l} + f_{w i}^{u}) \end{aligned}$

21 / 29

we assume the intervals to be symmetric around the point forecast

Optimal threshold ratio search

Model combination $⟶$ Model selection.
$T r = 0$ indicates that all the methods from the pool are selected.
$T r = 1$ indicates that only the method with the minimal fitted $\log (M S I S)$ is selected.

22 / 29

A larger threshold value means that fewer methods are selected for model combining, while a smaller threshold value means that many more methods are used for model combining.

This indicates that con- trolling the number of methods using the threshold searching algorithm is beneficial for improving the forecasting performance.

Application to the M4 competition data23 / 29

Selection rates of each model

24 / 29

Performance for different confidence levels

25 / 29

Forecasting results

26 / 29

Conclusions27 / 29

Conclusions

Features are taken into account to estimate the uncertainty of forecasts (cross-learning).
We propose an optimal threshold ratio searching algorithm to select an appropriate subset of models per time series for model combination.
Our approach outperforms a variety of individual models with distinctions for both point forecasts and prediction intervals.

28 / 29

Thanks for your attention!

Paper: Wang et al., (2021, JORS)
R package: https://github.com/xqnwang/fuma
Slides: https://xqnwang.rbind.io/talk/fuma/Slides.html
Web: https://xqnwang.rbind.io

29 / 29

Yanfei Kang

Fotios Petropoulos

Feng Li

Beihang University

University of Bath

Central University of
Finance and Economics

Help

Keyboard shortcuts

↑, ←, Pg Up, k

Go to previous slide

↓, →, Pg Dn, Space, j

Go to next slide

Home

Go to first slide

End

Go to last slide

Number + Return

Go to specific slide

b / m / f

Toggle blackout / mirrored / fullscreen mode

Clone slideshow

Toggle presenter mode

Restart the presentation timer

?, h

Toggle this help