Skip to content

Unified Training Comparison

This demo shows that DreamerV3 and TD-MPC2 can run through the same high-level training flow.

Run

uv sync --extra dev
uv run python examples/compare_unified_training.py --quick

What It Demonstrates

  • same ReplayBuffer source for both families
  • same TrainingConfig contract
  • same artifact generation flow
  • same visualization helper (write_reward_heatmap_ppm)

Artifacts

  • outputs/comparison/summary.json
  • outputs/comparison/dreamer.ppm
  • outputs/comparison/tdmpc2.ppm