Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Model use #10

Open
KongFulu opened this issue Apr 13, 2024 · 3 comments
Open

Model use #10

KongFulu opened this issue Apr 13, 2024 · 3 comments

Comments

@KongFulu
Copy link

Hello, can this large model be used directly and if it takes how long it takes to train

@KongFulu
Copy link
Author

I mean how long do I need to train if I use my own computer (4060)

@ch3cook-fdu
Copy link
Contributor

We have uploaded the pre-trained weights of the 3D generalist at https://huggingface.co/CH3COOK/LL3DA-weight-release/tree/main. It might be challenging to train our model on a single 4060 GPU, because of the heavy CUDA memory usage of large language models. Instead, you might try training your own model with smaller LLM backbones, i.e. GPT2, OPT-125m, etc.

@yitianlian
Copy link

Nice work, I wonder if the finetuned model will be released or not.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants