Skip to content
View kdexd's full-sized avatar
💎
🙌
💎
🙌

Organizations

@batra-mlp-lab @mdgspace @Cloud-CV @redcaps-dataset

Block or report kdexd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Improved sampling via learned diffusions (ICLR2024) and an optimal control perspective on diffusion-based generative modeling (TMLR2024)

Python 59 9 Updated Aug 30, 2024

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 4,204 300 Updated Oct 5, 2024

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…

Rust 4,243 263 Updated Mar 9, 2025

Official inference repo for FLUX.1 models

Python 20,664 1,459 Updated Feb 6, 2025

Utilities intended for use with Llama models.

Python 5,893 1,002 Updated Mar 1, 2025

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,864 128 Updated Oct 30, 2024

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,935 348 Updated Aug 7, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 14,246 1,166 Updated May 23, 2024

The official Meta Llama 3 GitHub site

Python 28,470 3,307 Updated Jan 26, 2025

Lightplane implements a highly memory-efficient differentiable radiance field renderer, and a module for unprojecting features from images to 3D grids.

Python 280 8 Updated Aug 6, 2024

A PyTorch native library for large model training

Python 3,421 308 Updated Mar 7, 2025

Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...

Python 317 26 Updated Dec 9, 2023

Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.

Python 95 2 Updated Mar 24, 2024

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,360 62 Updated Dec 10, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,462 898 Updated Jul 1, 2024

Fast bare-bones BPE for modern tokenizer training

Python 148 3 Updated Oct 21, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 13,708 973 Updated Mar 9, 2025

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,280 169 Updated Mar 4, 2025

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

1,036 42 Updated Sep 27, 2024

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,227 59 Updated Nov 22, 2024

Building blocks for foundation models.

459 19 Updated Jan 3, 2024

MLX: An array framework for Apple silicon

C++ 19,481 1,108 Updated Mar 8, 2025

A batched offline inference oriented version of segment-anything

Python 1,218 72 Updated Sep 13, 2024

Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"

Jupyter Notebook 903 87 Updated Jun 22, 2024

The MAX Platform (includes Mojo)

Mojo 23,761 2,585 Updated Mar 9, 2025

Fast Implementation of Generalised Geodesic Distance Transform for CPU (OpenMP) and GPU (CUDA)

C++ 96 14 Updated Feb 9, 2024

Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023

Python 152 15 Updated Aug 23, 2023

Fast and memory-efficient exact attention

Python 16,171 1,531 Updated Mar 9, 2025

Learnable latent embeddings for joint behavioral and neural analysis - Official implementation of CEBRA

Python 952 82 Updated Mar 6, 2025

Tutorial on Deep Declarative Networks

HTML 1 1 Updated May 30, 2023
Next
Showing results