vqa

A deep learning model with with a web application to answer image-based questions with a non-generative way by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder.

api docker flask-application vqa

Updated Jul 3, 2024
Jupyter Notebook

flyboyhoward / LearningToCountObjectsInVQA

Star

This is the coursework of deep learning in UoS

deep-learning python3 vqa

Updated May 17, 2019
Python

VirtualSpaceman / ia376

Star

Final project for IA376. Attempting to work with WikiTableQuestions Dataset

deep-learning vqa wikitable

Updated Jan 14, 2021
Jupyter Notebook

juletx / egunean-behin-vqa

Star

Egunean Behin Visual Question Answering Dataset

qa vqa question-answering visual-question-answering vqa-dataset visual-question-generation egunean-behin

Updated Mar 31, 2022
Jupyter Notebook

wildchaser1703 / IntelliQuery

Star

Deep Learning-Powered Visual & Textual Answering System

python natural-language-processing deep-learning tensorflow vqa

Updated Nov 26, 2023
Jupyter Notebook

CarolineGao / LoRA-Dataset

Star

[NeurIPS2023] LoRA: A Logical Reasoning Augmented Dataset for Visual Question Answering

vqa lvm multimodal logical-reasoning

Updated Jan 5, 2024
Jupyter Notebook

Ailln / vqa-roadmap

Star

🍌Visual Question Answering Roadmap.

roadmap vqa visual-question-answering

Updated Jan 6, 2020

abdur75648 / MedicalGPT

Star

Medical Report Generation And VQA (Adapting XrayGPT to Any Modality)

medical-imaging vqa llama vqa-dataset medical-dataset vicuna llm medical-report-generation llms chatgpt minigpt4 multimodal-llm medicalgpt chatgpt4o xraygpt

Updated Jun 24, 2024
Python

ycchen218 / VisionQA-Llama2-OWLViT

Star

This is a multimodal model design for the Vision Question Answering (VQA) task. It integrates the Llama2 13B, OWL-ViT, and YOLOv8 models.

deep-learning vqa llama gqa yolov8 owl-vit

Updated Jun 13, 2024
Python

vnnsrk / visual-question-answering-tensorflow

Star

Stacked attention network for open-ended visual Q&A

python tensorflow vqa attention-model

Updated Oct 2, 2017
Python

Improve this page

Add a description, image, and links to the vqa topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vqa topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vqa

Here are 249 public repositories matching this topic...

VVinayak / Captioning-VQA

nothineazi / Visual-Question-Answering

dinesh-kumar-mr / MediVQA

reshalfahsi / vqa-clip-lstm

phucthaiv02 / VQA

radonys / CFB-VQA

markvasin / nscl_reproducability_challenge

SMath0510 / AI-on-Edge

Sam-Coding77 / MediVisor

Twice22 / VQA

sneha1012 / SeeQA

flyboyhoward / LearningToCountObjectsInVQA

VirtualSpaceman / ia376

juletx / egunean-behin-vqa

wildchaser1703 / IntelliQuery

CarolineGao / LoRA-Dataset

Ailln / vqa-roadmap

abdur75648 / MedicalGPT

ycchen218 / VisionQA-Llama2-OWLViT

vnnsrk / visual-question-answering-tensorflow

Improve this page

Add this topic to your repo