Skip to content
View the01's full-sized avatar

Block or report the01

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-Sora: Democratizing Efficient Video Production for All

Python 25,657 2,464 Updated Mar 20, 2025

JSON formatter for Flake8 output

Python 12 5 Updated Feb 4, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 10,272 690 Updated Mar 21, 2025

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Python 9,721 2,283 Updated Nov 20, 2024

Python bindings to PDFium

Python 549 23 Updated Mar 17, 2025

NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…

Python 2,613 226 Updated Mar 21, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 84,808 10,328 Updated Mar 21, 2025

Wyoming protocol server for faster whisper speech to text system

Python 164 48 Updated Dec 10, 2024

eBUS configuration files

C++ 9 2 Updated Feb 16, 2025

This project is a REST API wrapper for the whatsapp-web.js library, providing an easy-to-use interface to interact with the WhatsApp Web platform.

JavaScript 1,103 471 Updated Dec 31, 2024

bridge between mattermost, IRC, gitter, xmpp, slack, discord, telegram, rocketchat, twitch, ssh-chat, zulip, whatsapp, keybase, matrix, microsoft teams, nextcloud, mumble, vk and more with REST API…

Go 6,890 641 Updated Dec 12, 2024

A text extraction library supporting PDFs, images, office documents and more

Python 1,629 54 Updated Mar 21, 2025

Open-source platform for extracting structured data from documents using AI.

JavaScript 1,273 44 Updated Feb 21, 2025

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 3,325 223 Updated Feb 11, 2025

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

Python 754 78 Updated Jan 28, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 14,818 1,517 Updated Mar 14, 2025

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 32,048 2,742 Updated Mar 21, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,262 639 Updated Feb 10, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,937 629 Updated Mar 7, 2025

The Free Software Media System - Server Backend & API

C# 38,025 3,393 Updated Mar 20, 2025

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 4,245 308 Updated Oct 5, 2024

Interface for OuteTTS models.

Python 955 84 Updated Feb 14, 2025

Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.

Python 183 21 Updated Mar 18, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,916 2,403 Updated Aug 12, 2024

SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement

Python 1,405 165 Updated Jan 22, 2025

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

Python 20,413 2,633 Updated Mar 21, 2025

Convenience Docker images for Apache Tika Server

Shell 168 74 Updated Feb 5, 2025

A powerful vCard parser

Python 1 Updated Jun 23, 2024
Next
Showing results