Addressing Misspecification in Simulation-based Inference through Data-driven Calibration

  • Antoine Wehenkel
  • , Juan L. Gamella
  • , Ozan Sener
  • , Jens Behrmann
  • , Guillermo Sapiro
  • , Jörn Henrik Jacobsen
  • , Marco Cuturi

Research output: Contribution to journalConference articlepeer-review

Abstract

Driven by steady progress in deep generative modeling, simulation-based inference (SBI) has emerged as the workhorse for inferring the parameters of stochastic simulators. However, recent work has demonstrated that model misspecification can compromise the reliability of SBI, preventing its adoption in important applications where only misspecified simulators are available. This work introduces robust posterior estimation (RoPE), a framework that overcomes model misspecification with a small real-world calibration set of ground-truth parameter measurements. We formalize the misspecification gap as the solution of an optimal transport (OT) problem between learned representations of real-world and simulated observations, allowing RoPE to learn a model of the misspecification without placing additional assumptions on its nature. RoPE demonstrates how OT and a calibration set provide a controllable balance between calibrated uncertainty and informative inference, even under severely misspecified simulators. Results on four synthetic tasks and two real-world problems with groundtruth labels demonstrate that RoPE outperforms baselines and consistently returns informative and calibrated credible intervals.

Original languageEnglish (US)
Pages (from-to)65949-65980
Number of pages32
JournalProceedings of Machine Learning Research
Volume267
StatePublished - 2025
Event42nd International Conference on Machine Learning, ICML 2025 - Vancouver, Canada
Duration: Jul 13 2025Jul 19 2025

All Science Journal Classification (ASJC) codes

  • Software
  • Control and Systems Engineering
  • Statistics and Probability
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Addressing Misspecification in Simulation-based Inference through Data-driven Calibration'. Together they form a unique fingerprint.

Cite this