Lies We Can Trust: Quantifying Action Uncertainty with Inaccurate Stochastic Dynamics Through Conformalized Nonholonomic Lie Groups

  ArXiv   PDF   Code

Under Review

We propose Conformal Lie-group Action Prediction Sets (CLAPS), a symmetry-aware conformal prediction-based algorithm that constructs, for a given action, a set guaranteed to contain the resulting system configuration at a user-defined probability. Our assurance holds under both aleatoric and epistemic uncertainty, non-asymptotically, and does not require strong assumptions about the true system dynamics, the uncertainty sources, or the quality of the approximate dynamics model. Typically, uncertainty quantification is tackled by making strong assumptions about the error distribution or magnitude, or by relying on uncalibrated uncertainty estimates — i.e., with no link to frequentist probabilities — which are insufficient for safe control. Recently, conformal prediction has emerged as a statistical framework capable of providing distribution-free probabilistic guarantees on test-time prediction accuracy. While current conformal methods treat robots as Euclidean points, many systems have non-Euclidean configurations, e.g., some mobile robots have SE(2)SE(2). In this work, we rigorously analyze configuration errors using Lie groups, extending previous Euclidean Space theoretical guarantees to SE(2)SE(2). Our experiments on a simulated Jetbot, and on a real MBot, suggest that by considering the configuration space’s structure, our symmetry-informed nonconformity score leads to more volume-efficient prediction regions that represent the underlying uncertainty better than existing approaches.

CLAPS Overview
Title Figure. Our proposed algorithm (CLAPS) constructs prediction regions Cq\mathcal{C}^q (in C-Space) that are marginally guaranteed to contain the next unknown system configuration at a user-set probability (1α)(1-\alpha). By considering the robot’s symmetry, we can construct more efficient prediction regions.

Problem Setting

Let qQq\in \mathcal Q be the robot configuration, q˙TqQ\dot q \in T_q \mathcal Q the generalized velocity, and s:=(q,q˙)TQs := (q,\dot q) \in T\mathcal Q the state. We consider holonomic and nonholonomic systems whose Q\mathcal Q is the Lie group SE(2)SE(2) (unicycles, car-like robots, quadrotors, surface/underwater vehicles, satellites, quadrupeds’ COM, …). The unknown dynamics evolve as

sk+1=f(sk,uk,wk),wkPnoise, s_{k+1} = f(s_k, u_k, w_k), \qquad w_k \sim P_{noise},

where ff is unknown, wkw_k is an iid disturbance drawn from an unknown distribution, and ukRmu_k \in \mathbb R^m is the control input. Inaccuracies in modeling ff may arise e.g., from domain shifts between fitting and deployment, and result in epistemic uncertainty. Additionally, wkw_k introduces aleatoric uncertainty, and may represent external disturbances such as wind gusts, wheel slippage, or terrain bumps.

Objective

For a given admissible action udesu_{des}, provide a C-Space prediction region CqQC^q \subseteq \mathcal Q that contains the resulting (unknown) configuration q1q_1 with probability at least (1α)(1-\alpha):

P(q1Cq)1α,α(0,1). \mathbb{P}(q_1 \in \mathcal C^q ) \ge 1 - \alpha, \quad \alpha \in (0,1).

where α\alpha is the user-set acceptable failure probability. While purely achieving this goal is trivial, e.g., by predicting the entire space (Cq=Q)(C^q = \mathcal Q), we additionally want CqC^q to be as tight/volume-efficient as possible, to make it practical for downstream robotic tasks such as safe control. We do not make strong assumptions about the fidelity of f~\tilde{f}, or the nature of the stochastic disturbances.

CLAPS

CLAPS uses a dataset of state transitions (Dcal)(D_{cal}) to calibrate the uncertainty estimates provided by approximate dynamics models. CLAPS can be applied as a post-hoc calibration layer on top of existing Lie-algebraic Gaussian uncertainty estimators (e.g., Invariant EKF), turning their approximate covariances into provably calibrated ones. By using a symmetry-respective score metric, our approach produces prediction regions that are more volume-efficient than existing conformal prediction baselines that treat the robot’s configuration as Euclidean.

CLAPS Method Diagram
Method Figure. Conformal Lie-Group Action Prediction Sets | Offline: a dataset of state transitions is used jointly with an approximate dynamical model to derive a rigorous symmetry-aware probabilistic error bound on the configuration predictions. Online: our algorithm takes in a desired action udesu_{des} and computes a calibrated C-Space prediction region Cq\mathcal{C}^q that is marginally guaranteed to contain the true configuration resulting from executing udesu_{des}.

The prediction region constructed by CLAPS (CqQ)(C^q \subseteq Q) can be used for probably-safe control in three main ways (for more details refer to Section §\SV-C):

  1. Configuration Check: a (sample) configuration gg belongs in CqC^q if log(g~1g)Σ~1log(g~1g)χα2(dimg)\sqrt{\log(\tilde{g}^{-1}g)^\top \tilde{\Sigma}^{-1}\log(\tilde{g}^{-1}g)} \le \chi^2_{\alpha}(\dim \mathfrak g) — quick to evaluate in batch
  2. C-space set: The CqC^q can be reconstructed by Alg. 2, for example to check if CqQsafeC^q \subseteq \mathcal Q_{safe}, for a known safe set QsafeQ\mathcal Q_{safe} \subseteq \mathcal Q.
  3. Workspace set: CqC^q can be inflated by the robot’s radius and mapped to the workspace (R2)(\mathbb R^2) to perform collision checks with known obstacles.

Experiments

We compare CLAPS against seven baselines in both simulation (JetBot) and hardware (MBot) to demonstrate its improved efficiency and representation quality. We model both systems as a second-order unicycles, and perform standard system identification to estimate the inertial properties. In all the experiments below we use α=0.1\alpha=0.1.

A) JetBot Experiments (Simulation)
 In Isaac Sim, we independently sampled additive perturbations to udesu_{des}, introducing aleatoric uncertainty. This leads to the well-known banana-shaped distributions seen below. Epistemic uncertainty arose from unmodeled effects (e.g., friction), and imperfections in the mass/inertia estimation. The Figure below demonstrates CLAPS’ ability to represent the underlying dynamics uncertainty of the unknown system (MC particles).

Workspace method comparison plot
Workspace (R2\mathbb{R}^2) footprint. Workspace marginalization of the C-Space regions generated by the methods, over two of the 625 JetBot validation trials. Left: lower linear and angular velocity. Right: higher velocity case. InEKF+MLE has expected pose g~1\tilde{g}_1 shown as the gray dot. All other methods have the same expected pose, which is represented by the blue dot. Both InEKF+2M and InEKF+MLE produce the same uncertainty covariance for all initial states and control inputs. The Point Prediction (PP) methods generate large regions with boundaries lying outside the plots’ margins. SS EKF, InEKF, InEKF+2M, and InEKF+MLE are not guaranteed to contain the resulting configuration at the user-set likelihood. Qualitatively, CLAPS appears to more accurately represent the underlying uncertainty distribution than the symmetry-unaware baselines.

Quantitatively, CLAPS achieves the highest average Intersection over Union (IoU) with the MC particles, validating its alignment with the systems’ uncertainty propagation, and CLAPS has a smaller C-space volume than all calibrated baselines in each of the 625 validation trials we tested.

JetBot Performance Table

Below we visualize the C-space regions CqC^q constructed by the different methods in three of the 625 validation trials. The State Space (SS) baselines produce hyperellipsoids in configuration space, due to treating it as Euclidean. Instead, both the Invariant Kalman Filter (InEKF) and CLAPS produce symmetry-respective prediction regions, better capturing the underlying uncertainty. While the uncertainty estimates provided by the InEKF are approximate, CLAPS provides provably calibrated prediction regions suitable for safe-control.

B) MBot Experiments (Hardware)
 We also validated our method on an MBot, a differential-drive vehicle shown below. Despite a relatively-small calibration dataset corresponding to \approx2 min of driving data (Dcal=237)(\lvert D_{cal}\rvert = 237), our method provably satisfied the user-specified safety specifications, thanks to its non-asymptotic guarantees. CLAPS uses DcalD_{cal} to derive data-driven provable (probabilistic) bounds on the uncertainty arising from both model mismatch, and inherent stochasticity.

The system configuration and velocity were estimated using a motion capture system. Uncertainty in the resulting configuration arose due to inaccuracies in inertial property estimation, actuation delays, center-of-mass deviation from the body-fixed origin, ground-surface imperfections, friction, network jitter, etc. The collection procedure of system transitions that make up DcalD_{cal} and the validation set is shown below.

Our Python-implementation of CLAPS can run at 25 Hz, the sampling frequency of the MBot’s sensors, making it serviceable for online use.

BibTeX (cite this!)

@misc{marques2025liestrustquantifyingaction,
      title={Lies We Can Trust: Quantifying Action Uncertainty with Inaccurate Stochastic Dynamics through Conformalized Nonholonomic Lie Groups}, 
      author={Luís Marques and Maani Ghaffari and Dmitry Berenson},
      year={2025},
      eprint={2512.10294},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2512.10294}, 
}