Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implementation of Image-to-Text (Captioning) #6

Closed
bryanwong17 opened this issue May 2, 2024 · 2 comments
Closed

Implementation of Image-to-Text (Captioning) #6

bryanwong17 opened this issue May 2, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@bryanwong17
Copy link

bryanwong17 commented May 2, 2024

Hi, I was wondering if CONCH is able to directly convert an image to text? From the code, it seems like CONCH is only available for "image-to-text retrieval," meaning that given an image and several texts, it will check which text is most similar to the given image. However, in the paper, there is also an example of CONCH doing captioning and a comparison between predicted and corrected captions. If so, could you please provide the code for doing captioning? Thanks!

@bryanwong17 bryanwong17 changed the title Implementation of Image-to-Text Implementation of Image-to-Text (Captioning) May 2, 2024
@Weiqin-Zhao
Copy link

Weiqin-Zhao commented Jul 3, 2024

I am also looking for this amazing function of this excellent work, hope the authors can release the corresponding code and weights in the future.

@fedshyvana fedshyvana added the enhancement New feature or request label Sep 10, 2024
@fedshyvana
Copy link
Collaborator

We don't have the code for image captioning at this time since the repo is meant to only provide inference capabilities to keep the code base clean. But feel free to use the open_clip (https://github.com/mlfoundations/open_clip) as reference if you want to implement this functionality!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants