Benchmarks¶

WorldFlux provides three reproducible benchmark entrypoints with aligned CLI contracts.

Common CLI Contract¶

All benchmark scripts support:

All runs emit:

uv run python benchmarks/benchmark_dreamerv3_atari.py --quick --seed 42

Full-mode example:

uv run python benchmarks/benchmark_dreamerv3_atari.py --full --data atari_data.npz --seed 42

Expected minimum result:

uv run python benchmarks/benchmark_tdmpc2_mujoco.py --quick --seed 42

Full-mode example:

uv run python benchmarks/benchmark_tdmpc2_mujoco.py --full --data mujoco_data.npz --seed 42

Expected minimum result:

uv run python benchmarks/benchmark_diffusion_imagination.py --quick --seed 42

Full-mode example:

uv run python benchmarks/benchmark_diffusion_imagination.py --full --seed 42

Expected minimum result: