Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

https://github.com/showlab/Image2Paragraph #29

Open
flrngel opened this issue Apr 18, 2023 · 0 comments
Open

https://github.com/showlab/Image2Paragraph #29

flrngel opened this issue Apr 18, 2023 · 0 comments

Comments

@flrngel
Copy link
Owner

flrngel commented Apr 18, 2023

Summary

  • uses blip/blip2 to generate a simple caption
  • uses grit/detectron2 to generate a dense caption
  • uses segment anything to generate a region_semantic information
  • unify all above and prompt to GPT
  • canny the input image (which is the bullshit part) and generate the new image using StableDiffusionControlNetPipeline

Conclusion

  • The output prompt from this project cannot generate a similar image to the input without the canny image of the input.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant