- Paris
- http://rom1504.fr/
-
wechat-dump Public
Forked from ppwwyyxx/wechat-dumpDump wechat messages from android
Python GNU General Public License v3.0 UpdatedFeb 3, 2025 -
img2dataset Public
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
-
CLIP Public
Forked from openai/CLIPContrastive Language-Image Pretraining
-
laion-prepro Public
Get hundred of million of image+url from the crawling at home dataset and preprocess them
-
embedding-reader Public
Efficiently read embedding in streaming from any filesystem
-
-
clip-retrieval Public
Easily compute clip embeddings and build a clip retrieval system with them
-
word_knn Public
Quickly find closest words using an efficient knn and word embeddings
-
-
cc2dataset Public
Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...
-
aria2 Public
Forked from aria2/aria2aria2 is a lightweight multi-protocol & multi-source, cross platform download utility operated in command-line. It supports HTTP/HTTPS, FTP, SFTP, BitTorrent and Metalink.
C++ GNU General Public License v2.0 UpdatedOct 10, 2023 -
EnMicroMsg.db-Password-Cracker Public
Forked from chg-hou/EnMicroMsg.db-Password-CrackerCrack the password of EnMicroMsg.db with brute-force attack.
Python GNU General Public License v3.0 UpdatedOct 1, 2023 -
-
distributed-shuffle Public
A simple implementation of distributed shuffle, intended for learning
-
-
-
slurm-tracking-bot Public
Simple slurm tracking bot to check usage
-
image_embeddings Public
Using efficientnet to provide embeddings for retrieval
-
embedbase Public
Forked from different-ai/embedbaseThe native Software 3.0 stack
TypeScript MIT License UpdatedMay 25, 2023 -
prismarine-web-client Public
Forked from extremeheat/prismarine-web-clientmineflayer, running in your browser
-
any2dataset Public
Turn any collection of files into a dataset
-
gpu-tester Public
gpu tester detects broken and slow gpus in a cluster
-
whisper Public
Forked from openai/whisperRobust Speech Recognition via Large-Scale Weak Supervision
-
audio2dataset Public
Easily turn large sets of audio urls to an audio dataset.
-
task_adaptation Public
Forked from google-research/task_adaptation -
v-diffusion-pytorch Public
Forked from crowsonkb/v-diffusion-pytorchv objective diffusion inference code for PyTorch.
Python MIT License UpdatedNov 10, 2022 -
k-diffusion Public
Forked from crowsonkb/k-diffusionKarras et al. (2022) diffusion models for PyTorch
-
open_clip Public
Forked from mlfoundations/open_clipAn open source implementation of CLIP.
-
accelerate Public
Forked from huggingface/accelerate🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
-
video2numpy Public
Forked from iejMac/video2numpyOptimized library for large-scale extraction of frames and audio from video.