Thing/Not Thing

This is a zero-shot binary image classifier. Type in the name of an object and AI predicts whether your uploaded photo matches it.

How it Works

This project uses OpenCLIP, an open-source implementation of OpenAI's CLIP.

Examples

Try It Yourself!

This project is available as a web demo here. But it will be slower than when the project is run locally on a GPU.

You can expand "Additional Inputs" to allow adjusting the cosine similarity threshold below which your photo is deemed Not <object>.

Running Locally on a GPU

Tested on Debian.

Requirements

NVIDIA Container Toolkit
Docker (and Docker Compose)
An NVIDIA GPU with sufficient VRAM for your chosen ViT model (model size can be changed in app.py)

Startup

Create a .env file which points to the path where you downloaded the OpenCLIP model. Then run:

docker compose build
docker compose run torch

Finally, go to http://localhost:7860 in your browser.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
readme-images		readme-images
thirdparty/open_clip		thirdparty/open_clip
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Thing/Not Thing

How it Works

Examples

Try It Yourself!

Running Locally on a GPU

Requirements

Startup

About

Releases

Packages

Languages

License

jbinvnt/thing-not-thing

Folders and files

Latest commit

History

Repository files navigation

Thing/Not Thing

How it Works

Examples

Try It Yourself!

Running Locally on a GPU

Requirements

Startup

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages