Awesome-Image-Captioning-MLLMs

NOTE: A curated list of awesome Image captioning strudies, aimed at annotating and reporting CT / MRI scans

Focus of the Studies and Limitation 🎯

In the era since the announcement of the self-attention mechanism, the Transformers architecture has become a game-chaning architecture in the field of machine-translation. These findings and advances were quicly mitigated onto other fields in natural language processing (NLP) is when we end up with Language Models [encoder-based] and [generative] we aware of so far. This repository represent a collection of the systems and findids that may help you to advance yourself in development of Multimodal Large Language Models (MLLMs), that support the following modalities:

🖼️ Image (Photo, Scans, even Footages / Video frames gathered into sigle image)
📝 Text (Caption / Report / Question)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Awesome-Image-Captioning-MLLMs

Focus of the Studies and Limitation 🎯

Systems

Encoders

Related Lists

About

Releases

Packages

License

nicolay-r/Awesome-Image-Captioning-MLLMs

Folders and files

Latest commit

History

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Awesome-Image-Captioning-MLLMs

Focus of the Studies and Limitation 🎯

Systems

Encoders

Related Lists

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages