Skip to content
View aburkov's full-sized avatar

Block or report aburkov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

⚡ TabPFN: Foundation Model for Tabular Data ⚡

Python 3,080 263 Updated Mar 26, 2025

A Simplified Pytorch Version of the Dreamer Algorithm

Python 123 17 Updated Jul 24, 2023

A completely customizable framework for building rich text editors. (Currently in beta.)

TypeScript 30,531 3,282 Updated Mar 22, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,434 903 Updated Mar 20, 2025

A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

Jupyter Notebook 4,089 526 Updated Mar 14, 2025
Python 62 12 Updated Nov 30, 2024

🔥Open Source No Code Web Data Extraction Platform. Turn Websites To APIs & Spreadsheets With No-Code Robots In Minutes🔥

TypeScript 9,685 768 Updated Mar 28, 2025

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 37,341 3,876 Updated Mar 28, 2025

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Python 6,698 700 Updated Oct 12, 2024

Blazingly fast LLM inference.

Rust 5,365 388 Updated Mar 28, 2025

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 34,348 2,997 Updated Mar 28, 2025

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 33,309 2,871 Updated Mar 28, 2025

real time face swap and one-click video deepfake with only a single image

Python 48,631 7,140 Updated Mar 28, 2025

A Bulletproof Way to Generate Structured JSON from Language Models

Jupyter Notebook 4,666 172 Updated Feb 24, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 10,011 1,266 Updated Mar 28, 2025

An implementation of Shazam's song recognition algorithm.

Go 4,192 462 Updated Mar 28, 2025

A vector search SQLite extension that runs anywhere!

C 5,358 199 Updated Jan 24, 2025

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,464 115 Updated Jan 24, 2025

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 2,869 207 Updated Mar 7, 2025

Apps Script samples for Google Workspace products.

JavaScript 4,745 1,881 Updated Mar 26, 2025

Use Large Language Models (LLM) in Google Sheets

JavaScript 43 7 Updated Jul 20, 2024

🔥Highlighting the top ML papers every week.

11,030 673 Updated Mar 13, 2025

Data validation using Python type hints

Python 23,049 2,062 Updated Mar 28, 2025

AI's query engine - Platform for building AI that can learn and answer questions over large scale federated data.

Python 27,509 4,937 Updated Mar 28, 2025

📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.

HTML 690 70 Updated Mar 10, 2025

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 2,052 181 Updated Mar 26, 2025

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

Jupyter Notebook 3,035 275 Updated Dec 24, 2024

Efficient few-shot learning with Sentence Transformers

Jupyter Notebook 2,423 237 Updated Jan 13, 2025

Curated list of datasets and tools for post-training.

2,884 252 Updated Jan 29, 2025

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 4,077 305 Updated Mar 15, 2025
Next
Showing results