New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

https://github.com/showlab/Image2Paragraph #29

Open

flrngel opened this issue Apr 18, 2023 · 0 comments

Owner

flrngel commented Apr 18, 2023

Summary

uses blip/blip2 to generate a simple caption
uses grit/detectron2 to generate a dense caption
uses segment anything to generate a region_semantic information
unify all above and prompt to GPT
canny the input image (which is the bullshit part) and generate the new image using StableDiffusionControlNetPipeline

Conclusion

The output prompt from this project cannot generate a similar image to the input without the canny image of the input.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment