ForgeWM Checkpoints
Training checkpoints for ForgeWM.
Available Checkpoints
| File | Stage | Description |
|---|---|---|
stage0/model.pt |
Stage 0 | Bidirectional SFT (domain adaptation, 4000 steps) |
stage1/model.pt |
Stage 1 | Teacher-Forcing Causal AR (10000 steps) |
stage2/model.pt |
Stage 2 | Consistency Distillation (6000 steps) |
stage3/model.pt |
Stage 3 | DMD final model (4-step real-time inference) |
stage3_new/model.pt |
Stage 3 (improved) | DMD with cache-refresh fix in training pipeline + sliding-window (las=6, sink=0) inference alignment (4000 steps). Better long-video stability and reduced HUD shrinkage / OOD drift on 22s rollouts. |
Notes on stage3_new
stage3_new was retrained from the same Stage 2 starting point as the original stage3, but with two pipeline-level fixes:
- Train-time KV cache refresh β the per-chunk t=0 cache rerun is now applied unconditionally. Previously it was skipped for grad-enabled chunks, creating a train/inference distribution shift on the KV cache (training cache was noisy at the exit_flags step, while inference cache reaches near-clean tβ0). See https://github.com/asdfo123/ForgeWM/blob/main/pipeline/self_forcing_training.py for the fix.
- Inference-time sliding-window alignment β the inference pipeline now correctly forwards the top-level
local_attn_size/sink_sizefrom the training config intoWanDiffusionWrapper. Previously these defaulted to-1/0and inference silently ran with full causal attention while the model was trained with a 6-frame window.
These changes are most visible on long-horizon (22s+) rollouts.
Usage
# Download
huggingface-cli download asdfo123/ForgeWM --local-dir ./ckpts --repo-type model
# Inference (default 4-step recipe)
python inference.py --checkpoint_path ckpts/stage3_new/model.pt --image_path demo_images/forest.png --action_type forward
π§ Coming Soon
- More intermediate checkpoints
- Few-step variants (2-step / 1-step with first-chunk FFE schedule)
Citation
@misc{forgewm2026,
title={ForgeWM: A Reproducible Training Recipe for Action-Controllable World Models},
author=ForgeWM Team,
year={2026},
url={https://github.com/asdfo123/ForgeWM}
}
Contact
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support