

Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Agno is a lightweight library for building Multimodal Agents. Use it to give LLMs superpowers like memory, knowledge, tools and reasoning.
A Claude MCP tool to interact with the ChatGPT desktop app on macOS
Repo for finetuning Llama 2 with Vietnamese dataset
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
Get your documents ready for gen AI
A high-throughput and memory-efficient inference and serving engine for LLMs
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Composable building blocks to build Llama Apps
Tools for merging pretrained large language models.
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Running large language models on a single GPU for throughput-oriented scenarios.
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
A collection of learning resources for curious software engineers
OCR, layout analysis, reading order, table recognition in 90+ languages
This code is an implementation of a chatbot using LLM chat model API and Langchain.
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Open source Python library for converting PDF to DOCX.
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
A Fast and Accurate Vietnamese Word Segmenter (LREC 2018)
PhoGPT: Generative Pre-training for Vietnamese (2023)
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Typed interactions with the GitHub API v3
Automatically extract the main text content (and more) from an HTML document