Stars
Efficient few-shot learning with Sentence Transformers
A flexible, adaptive classification system for dynamic text classification
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Efficiently find the best-suited language model (LM) for your NLP task
Open source no-code system for text annotation and building of text classifiers
Seamlessly integrate LLMs into scikit-learn.
Replace 'hub' with 'ingest' in any github url to get a prompt-friendly extract of a codebase
📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) o…
Store data created during your `pytest` tests execution, and retrieve it at the end of the session, e.g. for applicative benchmarking purposes.
Embeddable property graph database management system built for query speed and scalability. Implements Cypher.
Tesseract Open Source OCR Engine (main repository)
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Scalable toolkit for efficient model alignment
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
Scalable data pre processing and curation toolkit for LLMs
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
A Datacenter Scale Distributed Inference Serving Framework
Open source AI coding agent. Designed for large projects and real world tasks.
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
Low-code framework for building custom LLMs, neural networks, and other AI models
Train transformer language models with reinforcement learning.
Ongoing research training transformer models at scale