Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fast captioning #75

Closed
rahimentezari opened this issue Nov 2, 2023 · 5 comments
Closed

Fast captioning #75

rahimentezari opened this issue Nov 2, 2023 · 5 comments

Comments

@rahimentezari
Copy link

rahimentezari commented Nov 2, 2023

I would like to use CogVLM for captioning images in large scale. Currently using Chat() function takes about 2.5 sec for captioning one image on an A100 GPU which makes it almost impossible to use for million-scale. What do you recommend to speed up? I see that quantization is not yet supported for example #9.

@eezywu
Copy link

eezywu commented Nov 9, 2023

same problem

@1049451037
Copy link
Member

Quantization can be beneficial. However, that portion of the code was not written by me. As far as I know, there are no plans to open-source the code.

@1049451037
Copy link
Member

1049451037 commented Dec 7, 2023

We now support 4-bit quantization! See README for more details.

@ProGamerGov
Copy link

@rahimentezari Have you tested captioning speed on the A100 GPU for 4-bit quantization yet?

@chakrabortyrajatsubhra
Copy link

Hey can anyone say how to caption a folder of images with the sat demo model? I will have a fixed query like describe the image. Any suggestions would be greatly helpful.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants