ThreadShift is a project that leverages state-of-the-art models like CLIP and Stable Diffusion to perform segmentation, image editing, and generation tasks. This project is part of the TYITA B1 G2 course and aims to utilize advanced deep learning techniques to create and manipulate images with precision and creativity.
- ftfy: Fixes broken Unicode text.
- regex: Advanced regular expressions library for searching and manipulating text.
- tqdm: Fast, extensible progress bar for loops and iterable objects.
- diffusers by Hugging Face: Provides pretrained vision and audio diffusion models.
- transformers by Hugging Face: APIs and tools for downloading and training state-of-the-art pretrained models.
- scipy: Scientific computing library for optimization, integration, and other mathematical operations.
- accelerate by Hugging Face: Training and inference at scale made simple, efficient, and adaptable.
- Xformers: Optimizes transformers architecture.
- opencv: Library of programming functions for real-time computer vision.