Master data

High-dimensional statistical learning: New methods to advance economic and sustainability policies
Description:

Recent years have seen a tremendous surge in the availability of socioeconomic data characterized by vast complexity and high dimensionality. However, prevalent methods employed to inform practitioners and policy makers are still focused on small to medium-scale datasets. Consequently, crucial transmission channels are easily overlooked and the corresponding inference often suffers from omitted variable bias. This calls for novel methods which enable researchers to fully exploit the ever increasing amount of data.

In this project, we aim to investigate how the largely separate research streams of Bayesian econometrics, statistical model checking, and machine learning can be combined and integrated to create innovative and powerful tools for the analysis of big data in economics and other social sciences. Thereby, we pay special attention to properly incorporating relevant sources of uncertainty. Albeit crucial for thorough empirical analyses, this aspect is often overlooked in traditional machine learning techniques which have mainly been centered on producing point forecasts for key quantities of interest only. In contrast, Bayesian statistics and econometrics are based on designing algorithms to carry out exact posterior inference which in turn allows for density forecasts.

Our contributions are twofold: From a methodological perspective, we develop cutting-edge methods that enable fully probabilistic inference of dynamic models in vast dimensions. In terms of empirical advances, we apply these methods to highly complex datasets that comprise situations where either the number of observations, the number of potential time series and/or the number of variables included is large. More specifically, empirical applications center on four topical issues in the realm of sustainable development and socioeconomic policy to answer questions such as: How do market and economic uncertainty affect income inequality? What are the relationships between greenhouse gas emissions and macroeconomic indicators? Which role do tweets play in the evolution of the prices of crypto-currencies? Which policy measures are most effective to foster sustainable urban mobility patterns?

In these applications, we focus on probabilistic forecasting using real-time data to perform model validation in an efficient way. Moreover, we address structural inference. As policy makers are typically interested in evaluating their policies quantitatively, robust econometric tools are crucial for counterfactual simulations. In light of the increasing complexity of the economy, however, large information sets need to be exploited to appropriately recover the underlying causal structures and provide a rich picture of potential transmission channels of policy interventions.

The team constitutes a genuinely collaborative partnership of five young high-potential researchers composed of statisticians, machine learning experts, macro- and regional economists as well as social and computer scientists.

Keywords: Bayesian statistics, econometrics, (socio-)economic policy, statistical model checking, uncertainty quantification
Short title: FWF: ZK 35
Period: 01.08.2019 - 31.07.2024
Contact e-mail: gregor.kastner@aau.at
Homepage: https://zk35.org

Employees

Employees Role Time period
Gregor Kastner (internal)
  • 01.08.2019 - 31.07.2024
  • 01.08.2019 - 31.07.2024
  • 01.08.2019 - 31.07.2024
Luis Bastian Gruber (internal)
  • Research staff
  • 01.05.2021 - 31.07.2023
Alexander Mozdzen (internal)
  • Research staff
  • 01.05.2021 - 31.07.2023
Florian Schwendinger (internal)
  • Research staff
  • 01.09.2021 - 31.07.2023
Annalisa Cadonna (internal)
  • Research staff
  • 01.07.2022 - 31.07.2023
Laura Vana (external)
  • Research staff
  • Project leader
  • 01.01.2020 - 31.05.2021
  • 01.06.2021 - 31.07.2023
Rainer Hirk (external)
  • Research staff
  • 01.04.2020 - 01.10.2020
Florian Huber (external)
  • Applicant
  • Project leader
  • 01.08.2019 - 31.07.2023
  • 01.08.2019 - 31.07.2023
Michael Pfarrhofer (external)
  • Research staff
  • 01.08.2019 - 31.07.2023
Niko Hauzenberger (external)
  • Research staff
  • 01.08.2019 - 31.07.2023
Philipp Piribauer (external)
  • Applicant
  • Project leader
  • 01.08.2019 - 31.07.2023
  • 01.08.2019 - 31.07.2023
Laura Nenzi (external)
  • Applicant
  • Project leader
  • 01.08.2019 - 31.10.2021
  • 01.08.2019 - 31.10.2021
Roman Kuznets (external)
  • Research staff
  • 01.08.2019 - 31.01.2020
Ennio Visconti (external)
  • Research staff
  • 01.01.2020 - 31.07.2023
Karin Dobernig (external)
  • Applicant
  • Project leader
  • 01.08.2019 - 31.05.2021
  • 01.08.2019 - 31.05.2021
Stefan Schupp (external)
  • Project leader
  • 01.11.2021 - 31.07.2023
Stephan Adelsberger (external)
  • Research staff
  • 01.05.2022 - 31.07.2023
Roman Parzer (external)
  • Research staff
  • 01.12.2021 - 31.07.2023
Camilla Damian (external)
  • Research staff
  • 01.06.2021 - 01.10.2021
Nico Amstätter-Zöchbauer (internal)
  • Research staff
  • 01.01.2024 - 31.07.2024
Stephan Adelsberger (internal)
  • Research staff
  • 01.01.2024 - 31.07.2024

Categorisation

Project type Research funding (on request / by call for proposals)
Funding type §27
Research type
  • Fundamental research
  • Applied research
Subject areas
  • 101018 - Statistics
  • 101026 - Time series analysis
  • 102022 - Software development
  • 502025 - Econometrics
  • 102035 - Data science
Research Cluster No research Research Cluster selected
Gender aspects Genderrelevance not selected
Project focus
  • Science to Science (Quality indicator: I)
Classification raster of the assigned organisational units:
working groups No working group selected

Cooperations

No partner organisations selected