llava

Joint work as part of a bachelor's thesis on utilizing a combination of NLP and CV methods in implementing multimodal approaches to combat hate speech in memes.

natural-language-processing computer-vision memes hate-speech llava

Updated Jan 28, 2024
Python

nsourlos / bird_detector_ancient_manuscripts

Star

object-detection pdf-extractor image-extractor bird-detection ancient-books llm llava groundingdino grounding-dino

Updated Feb 8, 2024
Python

nsourlos / OCR_with_LLMs

Star

ocr text-extraction object-detection pytesseract llava

Updated Feb 8, 2024
Python

roboflow / multimodal-maestro

Star

Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥

object-detection cross-modal multimodality instance-segmentation lmm gpt-4 visual-prompting prompt-engineering vision-language-model llava segment-anything gpt-4-vision

Updated Feb 13, 2024
Python

mrseanryan / finetune_LLaVA

Star

Fine tune LLaVA 1.5 - based on article by wandb

fine-tuning finetuning vision-transformer llava

Updated Feb 19, 2024
Python

ibnaleem / mikael

Star

a Discord chatbot trained on Mistral and LLaVA language models

chatbot discord-bot artificial-intelligence discord-py mistral multimodal multimodal-deep-learning gpt-4 large-language-models llava mistral-7b mistral-ai

Updated Feb 29, 2024
Python

SkalskiP / awesome-foundation-and-multimodal-models

Sponsor

Star

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

nlp computer-vision image-captioning clip blip multimodal zero-shot-detection foundational-models llava segment-anything open-vocabulary-detection open-vocabulary-segmentation grounding-dino

Updated Feb 29, 2024
Python

nopperl / clip-synthetic-captions

Star

Tiny-scale experiment showing that CLIP models trained using detailed captions generated by multimodal models (CogVLM and LLaVA 1.5) outperform models trained using the original alt-texts on a range of classification and retrieval tasks.

clip synthetic-data multimodal vision-language-model llava cogvlm

Updated Mar 6, 2024
Python

Improve this page

Add a description, image, and links to the llava topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llava topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llava

Here are 71 public repositories matching this topic...

SALT-NLP / LLaVAR

UCSC-VLAA / Sight-Beyond-Text

meyiapir / TubeArt

herrera-luis / vision-core-ai

zhudotexe / kani-vision

GraphPKU / CoI

mapluisch / LLaVA-WebSocket-Server

bjh-developer / Insightface-LLaVa_Integration

Meatfucker / metatron2

instill-ai / model-llava-7b-dvc

robert-mcdermott / LLM-Image-Classification

autodistill / autodistill-llava

WJakubowsk / hateful_memes_detection

nsourlos / bird_detector_ancient_manuscripts

nsourlos / OCR_with_LLMs

roboflow / multimodal-maestro

mrseanryan / finetune_LLaVA

ibnaleem / mikael

SkalskiP / awesome-foundation-and-multimodal-models

nopperl / clip-synthetic-captions

Improve this page

Add this topic to your repo