Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can it be launched on usual VPS? For example, 6 CPU 16 RAM (usual chips) #19

Open
CombainerA19 opened this issue Jul 9, 2022 · 2 comments

Comments

@CombainerA19
Copy link

CombainerA19 commented Jul 9, 2022

Sorry for maybe a stupid question, fortunately, I find your product and want to integrate it into social media accounts; I don't release how to use it from the box (via Docker). As it said in the instruction I need a strong PC with GPU chips (which are pretty expensive for me) and I wonder if there is a way to utilize " input text prompt/variables - get a response in console / API"?
Can you please, @artnitolog, comment on this?

For reference, a way how https://porfirevich.ru/ works.

@blokhin
Copy link

blokhin commented Jul 10, 2022

As you can see, you have to use very advanced GPU-hardware in order to run the inference.

On the other hand, it should be possible to run it using also much limited GPU memory via huggingface-accelerate. Generally, you have to use the hook for the pytorch forward method. This hook should load the weights into GPU memory iteratively and save the intermediate representation to disk until a working inference is obtained. There is a discussion here pointing out it is in principle possible.

@CombainerA19
Copy link
Author

As you can see, you have to use very advanced GPU-hardware in order to run the inference.

On the other hand, it should be possible to run it using also much limited GPU memory via huggingface-accelerate. Generally, you have to use the hook for the pytorch forward method. This hook should load the weights into GPU memory iteratively and save the intermediate representation to disk until a working inference is obtained. There is a discussion here pointing out it is in principle possible.

Thanks, Evgeny for the help and links, I will listen to discussions and check product updates.
AI is cool 🤘🏻

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants