Skip to content
View bratao's full-sized avatar
👻
Improving inefficiencies
👻
Improving inefficiencies
  • Escavador

Highlights

  • Pro

Organizations

@FORMAS

Block or report bratao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fast Semantic Text Deduplication

Python 555 23 Updated Feb 28, 2025

Lightweight Nearest Neighbors with Flexible Backends

Python 258 8 Updated Mar 2, 2025

Production-ready Inference, Ingestion and Indexing built in Rust 🦀

Rust 471 40 Updated Mar 6, 2025

A parallel computing interface to the L-BFGS-B optimizer

Python 65 13 Updated Mar 31, 2024

Official repository of the xLSTM.

Python 1,736 129 Updated Jan 14, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 12,541 827 Updated Mar 7, 2025

A python module to repair invalid JSON from LLMs

Python 1,556 79 Updated Feb 23, 2025

🔧 Repair JSON!Solution for JSON Anomalies from LLMs.

Go 224 10 Updated Jul 17, 2024

Simple GRPO scripts and configurations.

Python 56 4 Updated Feb 6, 2025

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

Python 1,838 125 Updated Mar 7, 2025

Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"

Python 16 1 Updated Feb 14, 2024

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & C…

Python 762 55 Updated Nov 29, 2024

🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library

Python 2,723 119 Updated Mar 10, 2025

Fast State-of-the-Art Static Embeddings

Python 1,088 48 Updated Mar 2, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 13,203 1,361 Updated Mar 5, 2025

Build fast and accurate GenAI apps with GraphRAG SDK at scale.

Python 270 31 Updated Mar 5, 2025

Automated Code Reviewer for GitLab merge requests

TypeScript 7 3 Updated Dec 27, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 80,028 11,700 Updated Mar 10, 2025

A playground to make it easy to try crazy things

Python 33 1 Updated Mar 9, 2025

Code for our paper accepted at EMNLP 2023 (Findings)

Python 13 Updated Jan 5, 2024

A p2p reverse proxy with NAT traversal. Inspired by frp, rathole and ngrok

Go 333 10 Updated Feb 19, 2025

🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.

Python 14,182 1,248 Updated Mar 8, 2025

Dead Simple LLM Abliteration

Python 206 7 Updated Feb 18, 2025

Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and WordPiece tokenization in JavaScript, Python and Rust.

Rust 15 Updated Feb 20, 2025
Python 3 Updated Jan 5, 2025

Automatic code review using OpenAI API triggered by GitHub/GitLab webhooks

Python 4 Updated Dec 20, 2024

Python tool for converting files and office documents to Markdown.

Python 39,739 1,851 Updated Mar 9, 2025

A tiny HTML5 parser

Python 7 Updated Oct 29, 2024

Get your documents ready for gen AI

Python 23,615 1,372 Updated Mar 7, 2025
Next
Showing results