-
Notifications
You must be signed in to change notification settings - Fork 122
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unit 4, Introduction: Fusion of Text and Vision #126
Unit 4, Introduction: Fusion of Text and Vision #126
Conversation
Suggestions by @SuryaKrishna02 and @charchit7 And some minor corrections.
Add introduction chapter
Add new image links (after merging)
One last small edit : add Liaon dataset : https://laion.ai/blog/laion-5b/ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is amazing! I really enjoyed reading your intro. I especially liked the parts where you introduce what a modality is and also the part showing example combinations of modalities in real life.
Sure @charchit7 I will add this, I also came across some more models like Kosmos-2 |
Co-authored-by: Mohammed Hamdy <62081584+mmhamdy@users.noreply.github.com>
Co-authored-by: Mohammed Hamdy <62081584+mmhamdy@users.noreply.github.com>
Yup, @snehilsanyal nice! |
looking good.@snehilsanyal |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Super cool and comprehensive, I only left formatting suggestions!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good, once @merveenoyan's suggestions are fixed, good to go
Word, style changes, and removed newlines. Restructured some portions.
Done @merveenoyan 🤗 |
LAION-5B Dataset in Vision + Text
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this is almost ready to merge!
Add a new example for multimodality
Done @merveenoyan 🤗 removed the previous example and added a simpler one 😄 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good, can be merged
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
Remove comment Co-authored-by: Merve Noyan <merveenoyan@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looking good 👍
Hey everyone 🤗
This PR adds the Introduction chapter on Fusion of Text and Vision for Unit 4: Multimodal Models.
Related to Issue: #54
Already reviewed by: @SuryaKrishna02, @charchit7
Best,
Fusion of Text and Vision Team.