Hand inference This repository lets you filter a dataset with open clip. Inference structure Calculate CLIP similarity scores for image embeddings and ("hand", "hands", "finger", "fingers") tokens Crop hands from image Run CLIP similarity scores again Perform finger detection