A PyTorch implementation of Learning Transferable Visual Models From Natural Language Supervision
git clone https://github.com/andregaio/clip.git
cd clip
conda create -n clip python=3.8
conda activate clip
pip install -e .
Name | Image encoder | Text encoder |
---|---|---|
clip | resnet50 | distilbert-base-uncased |
python clip/train.py
python clip/infer.py
- The code has been heavily borrowed from https://github.com/moein-shariatnia/OpenAI-CLIP