kdexd

💎

🙌

Karan Desai kdexd

💎

🙌

I do computer vision. Prev: CS PhD at the University of Michigan.

748 followers · 64 following

World Labs
San Francisco, CA
09:08 - 7h behind
kdexd.xyz
@kdexd

Achievements

x3 x3

Achievements

x3 x3

Organizations

Stars

juliusberner / sde_sampler

Improved sampling via learned diffusions (ICLR2024) and an optimal control perspective on diffusion-based generative modeling (TMLR2024)

Python 59 9 Updated Aug 30, 2024

apple / ml-depth-pro

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 4,204 300 Updated Oct 5, 2024

lancedb / lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…

Rust 4,243 263 Updated Mar 9, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 20,664 1,459 Updated Feb 6, 2025

meta-llama / llama-models

Utilities intended for use with Llama models.

Python 5,893 1,002 Updated Mar 1, 2025

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,864 128 Updated Oct 30, 2024

rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,935 348 Updated Aug 7, 2024

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 14,246 1,166 Updated May 23, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 28,470 3,307 Updated Jan 26, 2025

facebookresearch / lightplane

Lightplane implements a highly memory-efficient differentiable radiance field renderer, and a module for unprojecting features from images to 3D grids.

Python 280 8 Updated Aug 6, 2024

pytorch / torchtitan

A PyTorch native library for large model training

Python 3,421 308 Updated Mar 7, 2025

rom1504 / cc2dataset

Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...

Python 317 26 Updated Dec 9, 2023

hammoudhasan / SynthCLIP

Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.

Python 95 2 Updated Mar 24, 2024

facebookresearch / MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,360 62 Updated Dec 10, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,462 898 Updated Jul 1, 2024

gautierdag / bpeasy

Fast bare-bones BPE for modern tokenizer training

Python 148 3 Updated Oct 21, 2024

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 13,708 973 Updated Mar 9, 2025

huggingface / datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,280 169 Updated Mar 4, 2025

google-research-datasets / wit

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

1,036 42 Updated Sep 27, 2024