Skip to content

AI-powered image deduplication tool using CLIP embeddings and FAISS similarity search. Smart content-based duplicate detection, not just file hashes.

Notifications You must be signed in to change notification settings

marooncoder09/NearPix

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

neariPix 🖼️

A smart image deduplication tool that uses AI to detect similar images based on content, not just file hashes.

✨ Features

  • AI-Powered Detection: Uses OpenAI's CLIP model to understand image content
  • Smart Similarity: Finds duplicates based on visual similarity, not exact matches
  • Fast Search: FAISS-based similarity search with HNSW indexing
  • Easy to Use: Simple Makefile commands for common operations
  • Local Storage: All data stored locally for privacy

🚀 Quick Start

# Install dependencies
make install

# Upload a single image
make upload IMAGE=path/to/image.jpg

# Process all images in a folder
make bulk-custom FOLDER=path/to/images

# Check status
make status

📋 Requirements

  • Python 3.7+
  • ~150MB for CLIP model (downloaded automatically)

🎯 Use Cases

  • Clean up photo libraries
  • Remove duplicate screenshots
  • Organize image collections
  • Find similar images across folders

📁 Project Structure

├── main.py              # Core deduplication logic
├── Makefile             # Easy commands
├── models/              # CLIP model wrapper
├── store/               # Images and metadata
└── utils/               # Helper functions

🔧 Configuration

  • Similarity Threshold: 0.9 (adjustable)
  • Supported Formats: PNG, JPG, JPEG, WEBP, BMP, TIFF
  • Storage: Local store/ directory

📝 License

MIT License

About

AI-powered image deduplication tool using CLIP embeddings and FAISS similarity search. Smart content-based duplicate detection, not just file hashes.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published