-
Notifications
You must be signed in to change notification settings - Fork 6.4k
Cosmos Predict2 #11695
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cosmos Predict2 #11695
Conversation
…pecific scheduler
@yiyixuxu This PR contains the version that works with We will probably be sticking with this PR IIUC from our discussion, so I'll update the scheduler configs once you confirm (currently only the text2image pipeline has been updated, so will update video2world soon too) PRs: |
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Thank you, Aryan and the HuggingFace team!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good!
We have new weight PRs now since the original weights were updated at the time of release: |
python3.10 code
ERROR:
|
Since there hasn't been a diffusers release yet, you need to install from the main branch to use Cosmos. A release will be happening soon, but for the time being, please try: |
why num_channels_latents = self.transformer.config.in_channels - 1 ? |
Video2World models have an additional channel for concatenated conditioning mask, which indicates what frames to use for video extending condition signal. The actual latent channels is one less than the transformer in_channels |
The cosmos is within us. We are made of star-stuff. We are a way for the universe to know itself.
cc @pjannaty @chenhsuanlin @fitsumreda @asfiyab-nvidia @amolfasale