Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about the released i2v model. #86

Open
Aria-Zhangjl opened this issue Feb 17, 2024 · 3 comments
Open

Questions about the released i2v model. #86

Aria-Zhangjl opened this issue Feb 17, 2024 · 3 comments

Comments

@Aria-Zhangjl
Copy link

Hi, thanks for your great work!
I have some questions about your released i2v models. Based on unet_i2vgen.py and #49 , I understand that i2vgen-xl is a single-stage model that takes both the image and text as conditions during video generation. However, as in the technical report, i2vgen is a two-stage diffusion model that takes the image as condition in the base stage and text in the refinement stage. Therefore I am curious about the role played by the input text in this single-stage generation process. What's the difference between the reported two-stage model and the released one-stage model? Can the one-stage model be considered as an image animator guided by the input text, similar to PIAhttps://arxiv.org/pdf/2312.13964.pdf? Additionally, I would like to know which dataset was used to train this open-source model.
Thank you!

@lmxyy
Copy link

lmxyy commented Feb 17, 2024

I also found the released model is a single-stage one and got confused.

@Steven-SWZhang
Copy link
Collaborator

Hello, thank you for your interest in our work. We have open-sourced the single-stage I2VGen-XL model here, which is capable of fully retaining the content of the input images. The training data used for this model is the same as that of the two-stage model. Our primary intention for open-sourcing this model is to provide some assistance to the community for research purposes. Currently, there are no plans to open-source the two-stage version of the I2VGen-XL model. However, our HiGen method is about to be open-sourced soon, which includes the two-stage process and can serve as an alternative. Thank you for your attention.

@ChengHSUHSU
Copy link

ChengHSUHSU commented Apr 17, 2024

There are related experiment result about the single stage model?
Instead, It look like different model with proposed I2VGen-XL.

thanks,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants