PackForcing-Unofficial

This repository is an unofficial implementation of PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference. It is a non-strict reproduction of PackForcing's core ideas on top of the Causal Forcing chunkwise 3f21l codebase, rather than a paper-faithful reimplementation.

The repo was built out of curiosity about how PackForcing can train and infer with such different RoPE ranges without immediately collapsing. I am also a beginner, so this was a casual paper-reproduction exercise to learn by doing.

What Is Here

Ported training-free Deep Forcing as a baseline.
Implemented PackForcing KV cache semantics, including three-stage memory, top-k mid selection, and RoPE correction.
Implemented a trainable HR Compressor.
Training-free LR Compressor.

Differences From the Paper

Built on Causal Forcing rather than Self Forcing. I do not expect this to be the main blocker, and it may even behave better.
Uses chunkwise 3f21l rather than 4f20l, mainly to avoid redoing the full ODE-initialization pipeline.
The HR Compressor removes temporal compression because the base setup is 3f21l. My current intuition is that this mostly lowers the compression ratio.
Reuses the Causal Forcing training dataset. The PackForcing paper seems to use a somewhat different dataset setup.

Current Status

Training-Free Deep Forcing Baseline

Baseline: causal_forcing.pt + Deep Forcing. In the comparison videos below, the left side uses sink + recent, while the right side uses sink + mid top-k + recent as the KV cache.

deepforcing_showcase_sample1.mp4	deepforcing_showcase_sample2.mp4
deepforcing_showcase_sample3.mp4	deepforcing_showcase_sample4.mp4

PackForcing Reproduction Result

I also tried a more paper-like setup: sink=3, recent=2, mid top-k=16 (chunk units), trained for 3k steps on 8 GPUs with global batch size 8 and no gradient accumulation.

There is indeed no large-scale color drift or full collapse as generation goes on. However, the aerial-view sample still breaks, there is a visible jump around 5-10 seconds, the frames remain noisy throughout, and gray blocks sometimes appear in the top-left corner.

Next I will probably run a few more experiments to narrow down where these issues come from. If they can all be resolved, the result should be fairly close to the PackForcing paper.

packforcing_queryscore_showcase_sample1.mp4	packforcing_queryscore_showcase_sample2.mp4
packforcing_queryscore_showcase_sample3.mp4	packforcing_queryscore_showcase_sample4.mp4

Debugging Notes

I also ran a few extra checks. For example, when I reduce inference-time top-k selection from 16 to 1, the train/infer RoPE range stays aligned, and the visible jumps disappear, which makes me suspect the jump issue is related to RoPE. I also tracked how the frames change across training steps: the videos tend to become more static as training goes on, while the gray blocks get gradually suppressed. My guess is that the gray-block artifact shows up because the HR Compressor is still not fully trained.

In the comparison below, the left video uses top-k=16, while the right video uses top-k=1. Both are 20-second inference results.

packforcing_snowfield_s3r2_default_steps_2x2.mp4

packforcing_snowfield_s3r2_topk1_steps_2x2.mp4

Usage

Environment Setup

Use the same environment and data setup as Causal Forcing. The default training initialization uses ckpt/causal_forcing_ckpt/causal_ode.pt, so you need to download that checkpoint into the expected local path first.

CLI Inference

python inference_packforcing.py \
  --config_path configs/packforcing_dmd_chunkwise_hr_8gpu_sink3_recent2.yaml \
  --checkpoint_path <your_packforcing_checkpoint.pt> \
  --data_path prompts/<your_prompt_file.txt> \
  --output_folder output/<your_run_name> \
  --use_ema

Training

The default 8-GPU training recipe is:

./scripts/train_packforcing_dmd_hr_8gpu.sh

It defaults to configs/packforcing_dmd_chunkwise_hr_8gpu_sink3_recent2.yaml, and the training behavior is broadly consistent with the paper (maybe).

Discussion

As this reproduction moved forward, I found that training-free methods such as Deep Forcing and Infinite-Forcing can already produce reasonably watchable infinite continuation. As long as the first few frames are kept and the RoPE behavior is corrected, the video can stay visually coherent without the colors collapsing over time.
I also found that Self Forcing itself tends to produce relatively low-motion videos. More generally, these trained infinite-continuation methods seem to prefer lowering the motion further, because once the motion gets too large, the generation can more easily drift outside the memory patterns seen during training and collapse.

Acknowledgements

This codebase builds on ideas and components from: Causal Forcing, Deep Forcing, PackForcing.

Citation

If you use this fork, please cite the upstream Causal Forcing paper:

@article{zhu2026causal,
  title={Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation},
  author={Zhu, Hongzhou and Zhao, Min and He, Guande and Su, Hang and Li, Chongxuan and Zhu, Jun},
  journal={arXiv preprint arXiv:2602.02214},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 114 Commits
assets		assets
configs		configs
demo_utils		demo_utils
docs		docs
long_video		long_video
model		model
pipeline		pipeline
prompts		prompts
scripts		scripts
tests		tests
trainer		trainer
utils		utils
wan		wan
.gitignore		.gitignore
AGENTS.md		AGENTS.md
LICENSE		LICENSE
README.md		README.md
demo.py		demo.py
get_causal_ode_data_chunkwise.py		get_causal_ode_data_chunkwise.py
get_causal_ode_data_framewise.py		get_causal_ode_data_framewise.py
inference.py		inference.py
inference_deepforcing.py		inference_deepforcing.py
inference_packforcing.py		inference_packforcing.py
requirements.txt		requirements.txt
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PackForcing-Unofficial

What Is Here

Differences From the Paper

Current Status

Training-Free Deep Forcing Baseline

PackForcing Reproduction Result

Debugging Notes

Usage

Environment Setup

CLI Inference

Training

Discussion

Acknowledgements

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PackForcing-Unofficial

What Is Here

Differences From the Paper

Current Status

Training-Free Deep Forcing Baseline

PackForcing Reproduction Result

Debugging Notes

Usage

Environment Setup

CLI Inference

Training

Discussion

Acknowledgements

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages