What works for forecasting macro economic series with deep learning?

Macro data is awkward for deep learning: short history, lots of missing values, mixed frequencies. I ran experiments on Korean output and investment indices with several model families. Below: what the data looks like, what actually worked (and what didn’t), and a few caveats.

What the data is like

Macro data often has very few time steps—you don’t get decades of daily points. That makes it easy to overfit. The setup I used was a real-time nowcasting pipeline for Korea: dozens of series (employment, prices, surveys, etc.) and two targets: all-industry output and equipment investment.

EDA of macro series: levels, distributions, correlations. — Exploratory analysis: high level, small volatility, strong correlation between series.

The series sit at high levels but don’t move much from one period to the next—especially the output index. So there’s limited “signal” for a model to learn from. The variables are highly correlated (a lot of redundancy). Some series are weekly, others monthly, and many values are missing (e.g. data not yet released). Models that can handle missing data “inside” the model (so-called state-space or factor models) have an advantage; others need you to fill in gaps first.

What worked and what didn’t

Short-term forecasts vs actual for output and investment. — Example short-term forecasts vs actual for output and investment indices.

Bottom line: no single best model. Results depended a lot on which series I was forecasting and how I set things up.

On the output index (flat, low variance, high correlation), a model that mixes weekly and monthly scales (TimeMixer) did poorly—there wasn’t enough variation for that kind of decomposition to help. On investment, which moves more and has clearer weekly–monthly structure, that same model did well. A transformer that looks at relationships between variables (iTransformer) did best on the output index; another patch-based transformer (PatchTST) was strong on some targets but inconsistent. Classic factor models (a few hidden “factors” driving all series) were numerically unstable in my setup—I had to add heavy constraints to get them to converge, which made the “interpretable” factors hard to trust. A deep version of the factor model (neural net instead of linear maps) did better, and switching to spline interpolation helped.

So in practice: pick the model to match the series and the horizon. Models with an internal “state” (state-space / factor style) handle missing data and multi-step forecasting in a consistent way, but for short-term accuracy they didn’t always beat the attention-based models when the data suited the latter. Overfitting was a real issue. For real-time nowcasting, updating the forecast one step at a time beat trying to predict many steps ahead in one go.

Caveats and takeaway

This was one country and one data pipeline. The data are short, so conclusions are limited. Takeaway: performance is conditional—match the model to the target series, how far ahead you forecast, and whether you need interpretability. Prefer one-step-ahead updates for nowcasting.

References

FRBNY Nowcast — Bok, B., Caratelli, D., Giannone, D., Sbordone, A., & Tambalotti, A. (2019). The FRBNY Staff Nowcast. Federal Reserve Bank of New York Staff Reports, 897.
TFT — Lim, B., Arik, S. Ö., Loeff, N., & Pfister, T. (2021). Temporal Fusion Transformers for Interpretable Multi-horizon Time Series Forecasting. International Journal of Forecasting, 37(4), 1748–1764.
PatchTST — Nie, Y., Nguyen, N. H., Sinthong, P., & Kalagnanam, J. (2022). A Time Series is Worth 64 Words: Long-term Forecasting with Transformers. arXiv:2211.14730.
iTransformer — Liu, Y., Hu, T., Zhang, H., et al. (2023). iTransformer: Inverted Transformers are Effective for Time Series Forecasting. arXiv:2310.06625.
Deep Dynamic Factor Models — Andreini, P., Izzo, C., & Ricco, G. (2020). Deep Dynamic Factor Models. Working Paper (v. May 2023).
Dynamic factor models — Stock, J. H., & Watson, M. W. (2016). Dynamic Factor Models, Factor-Augmented VARs, and Structural VARs in Macroeconomics. Handbook of Macroeconomics, 2, 415–525. Elsevier.

What works for forecasting macro economic series with deep learning?

What the data is like

What worked and what didn’t

Caveats and takeaway

References

Other posts