+ - 0:00:00
Notes for current slide
Notes for next slide




The uncertainty estimation of feature-based forecast combinations


Xiaoqian Wang

Beihang University

41st International Symposium on Forecasting

June 17, 2021

1 / 29

Joint work with

Yanfei Kang Fotios Petropoulos Feng Li
Beihang University University of Bath Central University of
Finance and Economics
2 / 29

Outline

  • Introduction

  • Feature-based interval forecasting framework

  • Weight determination

  • Application to the M4 competition data

  • Conclusions

3 / 29

Introduction

4 / 29

Motivation

                  Forecasting

    Time series    

  • Point forecasts
  • Probabilistic forecasts

5 / 29

Motivation

                  Forecasting

    Time series    

  • Point forecasts
  • Probabilistic forecasts

       


Forecasting method

    Individual models

  • Naïve
  • Snaïve
  • ARIMA
  • ETS...
6 / 29

Motivation

                  Forecasting

    Time series    

  • Point forecasts
  • Probabilistic forecasts


Description

       


Forecasting method

    Features

  • Trend
  • Linearity
  • Nonlinearity
  • Seasonality...

    Individual models

  • Naïve
  • Snaïve
  • ARIMA
  • ETS...
7 / 29

Motivation

                  Forecasting

    Time series    

  • Point forecasts
  • Probabilistic forecasts


Description

       


Forecasting method

    Features

  • Trend     
  • Linearity    Feature-based
  • Nonlinearity    forecasting
  • Seasonality...

    Individual models

  • Naïve
  • Snaïve
  • ARIMA
  • ETS...
8 / 29

Introduction

  • Point forecasting mainly forecasts the mean or the median of the distributions for future observations.
  • Probabilistic forecasting can provide a comprehensive outlook of the expected future value and the future uncertainty.
  • Time series features provide valuable information for decision makers.
  • The superiority of forecast combinations over a single model.
    • No-free-lunch theorem (Wolpert & Macready, 1997).
    • Horses for courses (Petropoulos et al., 2014).
    • Merely tackling model uncertainty is sufficient to help (Petropoulos et al., 2018).
9 / 29

Challenges

  • Previous literature mainly focuses on
    • point forecasting + forecast combinations.
  • How do features affect the uncertainty estimation of forecasts?
  • How to guarantee the effectiveness of the relationship in forecasting a newly given dataset?
  • How to translate the relationship into an attempt to improve the forecasting performance?


Feature-based probabilistic forecast combinations.

10 / 29

Feature-based interval forecasting framework

11 / 29

General framework

12 / 29

GRATIS (Kang et al., 2020)

13 / 29

Dataset

Reference (GRATIS)

Test (M4)

14 / 29

Other components

  • 42 times series features (R package tsfeatures)

  • Individual model pool

  • Interval forecast evaluation MSIS=1ht=n+1n+h(UtLt)+2α(LtYt)1{Yt<Lt}+2α(YtUt)1{Yt>Ut}1nmt=m+1n|YtYtm|

15 / 29

Linking features with performance

Why GAM?

  • Interpretability
  • Regularization
  • Flexibility


GAM model for each individual model

  • log(MSISN)FN×P
16 / 29

Partial effect analysis

Feature Description Range
seasonal_strength Strength of seasonality [0,1)
nonlinearity Nonlinearity coefficient [0,)
x_acf1 The first autocorrelation coefficient (1,1)


17 / 29

Partial effect analysis

  • The partial effect of one feature on the interval forecasting performance is distinct from the other features.

  • A feature has its unique way of affecting the interval forecasting performance of individual models.

  • Some features are biased towards up-weighting some forecasting models over others.

18 / 29

Weight determination

19 / 29

Weight assignment

Adjusted softmax function

Pij=exp{μilog(MSISij)^σi}k=1Mexp{μilog(MSISik)^σi},i=1,,N;j=1,,M

  • Negative values can be down-weighted to near-zero.
  • log(MSIS)AccuracyP

Optimal threshold ratio search

For ith time series,

  • calculate the ratio of weight Rk=Pij/max(Pik).
  • select individual models that satisfy Rk>Tr (0<Tr1).
20 / 29

Combined forecasts

Combined prediction intervals

fwil=1k=1SPikk=1SPikfiklfwiu=1k=1SPikk=1SPikfiku

Combined point forecasts

fwi=12(fwil+fwiu)

21 / 29

we assume the intervals to be symmetric around the point forecast

Optimal threshold ratio search

  • Model combination Model selection.
  • Tr=0 indicates that all the methods from the pool are selected.
  • Tr=1 indicates that only the method with the minimal fitted log(MSIS) is selected.
22 / 29

A larger threshold value means that fewer methods are selected for model combining, while a smaller threshold value means that many more methods are used for model combining.

This indicates that con- trolling the number of methods using the threshold searching algorithm is beneficial for improving the forecasting performance.

Application to the M4 competition data

23 / 29

Selection rates of each model

24 / 29

Performance for different confidence levels

25 / 29

Forecasting results

26 / 29

Conclusions

27 / 29

Conclusions

  • Features are taken into account to estimate the uncertainty of forecasts (cross-learning).

  • We propose an optimal threshold ratio searching algorithm to select an appropriate subset of models per time series for model combination.

  • Our approach outperforms a variety of individual models with distinctions for both point forecasts and prediction intervals.

28 / 29

Joint work with

Yanfei Kang Fotios Petropoulos Feng Li
Beihang University University of Bath Central University of
Finance and Economics
2 / 29
Paused

Help

Keyboard shortcuts

, , Pg Up, k Go to previous slide
, , Pg Dn, Space, j Go to next slide
Home Go to first slide
End Go to last slide
Number + Return Go to specific slide
b / m / f Toggle blackout / mirrored / fullscreen mode
c Clone slideshow
p Toggle presenter mode
t Restart the presentation timer
?, h Toggle this help
Esc Back to slideshow