Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inquiry on training data and setup for T2I training #54

Open
youngwanLEE opened this issue Jun 11, 2024 · 2 comments
Open

Inquiry on training data and setup for T2I training #54

youngwanLEE opened this issue Jun 11, 2024 · 2 comments

Comments

@youngwanLEE
Copy link

youngwanLEE commented Jun 11, 2024

Firstly, I would like to express my gratitude and respect for the remarkable work you’ve done by open-sourcing the T2I model, which is a significant contribution to the community.

I have two questions:

  1. I have gone through the associated paper but was unable to find specific details on the datasets used for training the T2I model. Could you please confirm if this information is available elsewhere or if I may have overlooked it in the paper? Any details you could share would be greatly appreciated.

  2. While I have read about the training section you’ve shared for the T2I model, there seems to be a lack of information regarding the training data setup. I am particularly interested in the data structure and how to properly organize it for training. Additionally, it would be extremely helpful if you could provide an example of a toy dataset, similar to the one shown in Pixart-Sigma, and instructions to verify if the training CLI is functioning as intended.

I understand that providing this detailed information might be demanding, but I believe that such transparency would greatly benefit the wider adoption of the Lumina project within the open-source community.

Thank you for considering my request. I look forward to your response and any guidance you can provide.

@PommesPeter
Copy link
Contributor

Additionally, it would be extremely helpful if you could provide an example of a toy dataset, similar to the one shown in Pixart-Sigma, and instructions to verify if the training CLI is functioning as intended.

Thank you for your suggestion! We will provide a detailed description of the training in the next few days.

@PommesPeter
Copy link
Contributor

Hi @youngwanLEE ,

We have updated training instruction, check this out https://github.com/Alpha-VLLM/Lumina-T2X/tree/main/lumina_t2i#training

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants