the01

d01 the01

20 followers · 9 following

Achievements

Lists (24)

Sort

Stars

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 25,657 2,464 Updated Mar 20, 2025

soul-catcher / mypy-gitlab-code-quality

Python 17 7 Updated Feb 22, 2025

PyCQA / flake8-json

JSON formatter for Flake8 output

Python 12 5 Updated Feb 4, 2025

allenai / olmocr

Toolkit for linearizing PDFs for LLM datasets/training

Python 10,272 690 Updated Mar 21, 2025

Megvii-BaseDetection / YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Python 9,721 2,283 Updated Nov 20, 2024

pypdfium2-team / pypdfium2

Python bindings to PDFium

Python 549 23 Updated Mar 17, 2025

NVIDIA / nv-ingest

NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…

Python 2,613 226 Updated Mar 21, 2025

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 84,808 10,328 Updated Mar 21, 2025

rhasspy / wyoming-faster-whisper

Wyoming protocol server for faster whisper speech to text system

Python 164 48 Updated Dec 10, 2024

eBUS / ebus.github.io

eBUS configuration files

C++ 9 2 Updated Feb 16, 2025

chrishubert / whatsapp-api

This project is a REST API wrapper for the whatsapp-web.js library, providing an easy-to-use interface to interact with the WhatsApp Web platform.

JavaScript 1,103 471 Updated Dec 31, 2024

42wim / matterbridge

bridge between mattermost, IRC, gitter, xmpp, slack, discord, telegram, rocketchat, twitch, ssh-chat, zulip, whatsapp, keybase, matrix, microsoft teams, nextcloud, mumble, vk and more with REST API…

Go 6,890 641 Updated Dec 12, 2024

Goldziher / kreuzberg

A text extraction library supporting PDFs, images, office documents and more

Python 1,629 54 Updated Mar 21, 2025

DocumindHQ / documind

Open-source platform for extracting structured data from documents using AI.

JavaScript 1,273 44 Updated Feb 21, 2025

AnswerDotAI / RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 3,325 223 Updated Feb 11, 2025

AnswerDotAI / byaldi

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

Python 754 78 Updated Jan 28, 2025

dzhng / deep-research

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 14,818 1,517 Updated Mar 14, 2025