Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR2024]
-
Updated
Mar 21, 2025 - Python
Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR2024]
Caption images across your datasets with state of the art models from Hugging Face and Replicate!
The Fuyu programming language
Fuyu multi-modal language model for use with Autodistill.
Hands on some MultiModal Models
Testing Nvidia Machine Learning api models
Add a description, image, and links to the fuyu topic page so that developers can more easily learn about it.
To associate your repository with the fuyu topic, visit your repo's landing page and select "manage topics."