vision-language-pretraining

Here are 25 public repositories matching this topic...

ahmdtaha / distributed_sigmoid_loss

Unofficial implementation for Sigmoid Loss for Language Image Pre-Training

python3 pytorch unsupervised-learning vision-and-language multimodal-deep-learning self-supervised-learning vision-language contrastive-learning distributed-data-parallel vision-transformer vision-language-pretraining

Updated Sep 26, 2023
Python

unitaryai / VTC-dataset

Star

dataset video-understanding video-text-retrieval vision-language-pretraining vision-language-dataset

Updated May 1, 2024
Python

alinlab / b2t

Star

Bias-to-Text: Debiasing Unknown Visual Biases through Language Interpretation

explainable-ai vision-language-pretraining bias-and-fairness

Updated May 21, 2023
Python

unitaryai / VTC

Star

VTC: Improving Video-Text Retrieval with User Comments

comments video-understanding multimodal-deep-learning video-text-retrieval vision-language-transformer vision-language-pretraining

Updated Jun 18, 2024
Python

megvii-research / protoclip

Star

📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)

self-supervised-learning contrastive-learning vision-language-pretraining

Updated Nov 8, 2023
Python

LooperXX / ManagerTower

Star

Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning

vision-language multi-modal-learning vision-language-pretraining vision-language-learning

Updated Dec 12, 2023
Python

yiren-jian / BLIText

Star

[NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training

multimodal-deep-learning vision-language-transformer vision-language-pretraining

Updated Dec 5, 2023
Python

TXH-mercury / COSA

Star

Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model

video-captioning video-qa video-retrieval vision-language-pretraining video-language-pretrainng

Updated Aug 1, 2023
Python

HieuPhan33 / CVPR2024_MAVL

Star

Multi-Aspect Vision Language Pretraining - CVPR2024

zero-shot-classification vision-language-pretraining vision-language-model zero-shot-segmentation medical-vision-and-language-pretraining

Updated Jul 1, 2024
Python

ChenDelong1999 / ITRA

Star

A codebase for flexible and efficient Image Text Representation Alignment

computer-vision deep-learning pytorch multimodal-learning vision-language-pretraining

Updated Jun 20, 2023
Python

TencentARC / FLM

Star

Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)

language-modeling vision-language-pretraining

Updated May 15, 2023
Python

vgthengane / Continual-CLIP

Star

Official repository for "CLIP model is an Efficient Continual Learner".

baseline clip continual-learning vision-language-pretraining foundational-models

Updated Dec 13, 2022
Python

Zoky-2020 / SGA

Star

Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models. [ICCV 2023 Oral]

adversarial-attack vision-language-pretraining

Updated Sep 6, 2023
Python

YyzHarry / vlm-fairness

Star

Demographic Bias of Vision-Language Foundation Models in Medical Imaging

medical-imaging fairness subpopulation algorithmic-fairness bias-mitigation ood-generalization foundation-models vision-language-pretraining vision-language-model

Updated Feb 23, 2024
Python

omipan / svl_adapter

Star

SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models

self-supervised-learning vision-language-pretraining

Updated Jan 11, 2024
Python

Surrey-UP-Lab / RegionSpot

Star

Recognize Any Regions

open-world object-detection zero-shot instance-segmentation auto-labeling vision-language-pretraining open-vocabulary vision-language-model multimodal-representation-learning vision-foundation-model vision-language-foundation-model

Updated Nov 22, 2023
Python

sail-sg / ptp

Star

[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》

cross-modality vlp vision-language-pretraining

Updated Jun 7, 2023
Python

mbzuai-oryx / VideoGPT-plus

Star

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

chatbot clip image-encoder video-encoder multimodal dual-encoder vision-language vicuna gpt4 vision-language-pretraining llava video-conversation video-chatbot llama3 gpt4o phi-3-mini

Updated Jun 20, 2024
Python

jusiro / FLAIR

Star

FLAIR: A Foundation LAnguage-Image model of the Retina for fundus image understanding.

medical-imaging fundus-image-analysis foundation-models vision-language-pretraining

Updated May 15, 2024
Python

ArrowLuo / SegCLIP

Star

PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"

transfer-learning semantic-segmentation contrastive-learning zero-shot-semantic-segmentation vision-language-pretraining open-vocabulary open-vocabulary-semantic-segmentation

Updated Jun 28, 2023
Python

Improve this page

Add a description, image, and links to the vision-language-pretraining topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-language-pretraining topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vision-language-pretraining

Here are 25 public repositories matching this topic...

ahmdtaha / distributed_sigmoid_loss

unitaryai / VTC-dataset

alinlab / b2t

unitaryai / VTC

megvii-research / protoclip

LooperXX / ManagerTower

yiren-jian / BLIText

TXH-mercury / COSA

HieuPhan33 / CVPR2024_MAVL

ChenDelong1999 / ITRA

TencentARC / FLM

vgthengane / Continual-CLIP

Zoky-2020 / SGA

YyzHarry / vlm-fairness

omipan / svl_adapter

Surrey-UP-Lab / RegionSpot

sail-sg / ptp

mbzuai-oryx / VideoGPT-plus

jusiro / FLAIR

ArrowLuo / SegCLIP

Improve this page

Add this topic to your repo