Skip to main content

Reproduce DreamerV3 and TD-MPC2 Locally

This guide explains the currently supported local reproducibility path for the two reference families in WorldFlux.

It stays inside the evidence-backed onboarding contract:

  • CPU smoke first
  • local scaffold training second
  • quick compatibility verification third

It does not claim proof-grade paper reproduction.

1. Prove the local environment first

Run the official CPU smoke before comparing model families:

uv sync --extra dev
uv run python examples/quickstart_cpu_success.py --quick

This confirms that your local install can create, train, and roll out a small Dreamer CI model on CPU with machine-checkable success criteria.

2. Compare the shared contract smoke

Run the repository comparison script:

uv sync --extra dev --extra training
uv run python examples/compare_unified_training.py --quick

This is the supported repository-level reproduction step for comparing DreamerV3 and TD-MPC2 under the same synthetic smoke conditions and the same quick verification flow.

The emitted quick verify results separate workflow completion from statistical quality. A warning-level quick verify result still means the comparison demo produced interpretable artifacts and completed the contract-smoke lane.

3. Reproduce the newcomer project flow

For a scaffolded project, generate a local project and run the same commands a new user would run:

worldflux init my-world-model
cd my-world-model
worldflux train --steps 5 --device cpu
worldflux verify --target ./outputs --mode quick

Repeat that flow once with a Dreamer-oriented setup and once with a TD-MPC2 oriented setup if you want to compare both families under your own local conditions.

For the guaranteed real-environment newcomer lane in this MVP, prioritize the Dreamer scaffold and install Atari extras first:

uv sync --extra training --extra atari

4. Interpret the result correctly

This workflow gives you:

  • a reproducible local smoke path
  • generated outputs and run manifests
  • a structured quick-verify result

This workflow does not give you:

  • upstream non-inferiority evidence
  • publishable proof artifacts
  • benchmark claims

5. When to use parity workflows

Move to worldflux parity only after the local path above is stable:

worldflux parity --help
worldflux parity proof-run --help
worldflux parity proof-report --help

Proof-oriented parity remains an advanced workflow. Public proof claims require published evidence bundles as described in Parity Harness.