Path-minimizing latent ODEs for improved extrapolation and inference

Matt L. Sampson, Peter Melchior

Research output: Contribution to journalArticlepeer-review

Abstract

Latent ordinary differential equation (ODE) models provide flexible descriptions of dynamic systems, but they can struggle with extrapolation and predicting complicated non-linear dynamics. The latent ODE approach implicitly relies on encoders to identify unknown system parameters and initial conditions (ICs), whereas the evaluation times are known and directly provided to the ODE solver. This dichotomy can be exploited by encouraging time-independent latent representations. By replacing the common variational penalty in latent space with an ℓ 2 penalty on the path length of each system, the models learn data representations that can easily be distinguished from those of systems with different configurations. This results in faster training, smaller models, more accurate interpolation and long-time extrapolation compared to the baseline ODE models with a gated recurent unit (GRU), recurrent neural network, and long-short-term-memory encoder/decoders on tests with damped harmonic oscillator, self-gravitating fluid, and predator-prey systems. We also demonstrate superior results for simulation-based inference of the Lotka-Volterra parameters and ICs by using the latents as data summaries for a conditional normalizing flow. Our change to the training loss is agnostic to the specific recognition network used by the decoder and can therefore easily be adopted by other latent ODE models.

Original languageEnglish (US)
Article number025047
JournalMachine Learning: Science and Technology
Volume6
Issue number2
DOIs
StatePublished - Jun 30 2025

All Science Journal Classification (ASJC) codes

  • Software
  • Human-Computer Interaction
  • Artificial Intelligence

Keywords

  • latent
  • ODEs
  • regularization

Fingerprint

Dive into the research topics of 'Path-minimizing latent ODEs for improved extrapolation and inference'. Together they form a unique fingerprint.

Cite this