ACTIVE Research

EasyWorldModel

Uncertainty-calibrated residual world models with 18% lower prediction error and deep ensemble uncertainty decomposition.

status ACTIVE
type Research
stack Python PyTorch Hydra

// DESCRIPTION

Uncertainty-Calibrated Residual World Models

EasyWorldModel introduces residual learning into model-based reinforcement learning world models, where the model predicts the delta between consecutive states rather than absolute next states. This residual formulation dramatically simplifies the learning problem for environments with smooth dynamics, reducing prediction error by 18% compared to standard direct prediction approaches.

A key innovation is the integration of deep ensemble uncertainty decomposition into the residual prediction framework. The system trains an ensemble of residual predictors and decomposes prediction uncertainty into epistemic (model disagreement) and aleatoric (inherent stochasticity) components. This decomposition enables intelligent exploration: the agent preferentially explores states with high epistemic uncertainty while being appropriately cautious in high-aleatoric regions.

The uncertainty calibration is validated through reliability diagrams showing that predicted confidence intervals match empirical coverage to within 3% across tested environments. This calibration is critical for safe model-based planning, where overconfident predictions can lead to catastrophic policy failures.

The implementation uses PyTorch for the neural residual predictors and Hydra for experiment configuration management, enabling systematic hyperparameter sweeps and reproducible comparisons across environments (MuJoCo, DeepMind Control Suite).

// HIGHLIGHTS

  • 18% lower prediction error through residual state delta learning
  • Deep ensemble uncertainty decomposition into epistemic and aleatoric components
  • Uncertainty calibration within 3% of empirical coverage across environments
  • Intelligent exploration driven by epistemic uncertainty targeting
  • Hydra-based experiment management for reproducible RL research