-
Notifications
You must be signed in to change notification settings - Fork 385
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fast captioning #75
Comments
same problem |
Quantization can be beneficial. However, that portion of the code was not written by me. As far as I know, there are no plans to open-source the code. |
We now support 4-bit quantization! See README for more details. |
@rahimentezari Have you tested captioning speed on the A100 GPU for 4-bit quantization yet? |
Hey can anyone say how to caption a folder of images with the sat demo model? I will have a fixed query like describe the image. Any suggestions would be greatly helpful. |
I would like to use CogVLM for captioning images in large scale. Currently using Chat() function takes about 2.5 sec for captioning one image on an A100 GPU which makes it almost impossible to use for million-scale. What do you recommend to speed up? I see that quantization is not yet supported for example #9.
The text was updated successfully, but these errors were encountered: