Skip to content
View pr-Mais's full-sized avatar
👾
👾

Organizations

@firebase @googlemaps @fluttercommunity @FlutterVikings @Thmanyah-LLC

Block or report pr-Mais

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

NPM library to splice HLS VOD

JavaScript 18 4 Updated Dec 18, 2024

💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline

Python 58,251 4,124 Updated Mar 28, 2025

methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositories

Python 146 39 Updated Dec 4, 2023

Analysis scripts for log data sets used in anomaly detection.

Python 60 7 Updated Jul 30, 2024

Firebase SDK for Cloud Functions

TypeScript 1,036 206 Updated Mar 25, 2025

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Python 556 59 Updated May 9, 2024

Mastering Diverse Domains through World Models

Python 1,571 264 Updated Feb 22, 2025

Train transformer language models with reinforcement learning.

Python 12,909 1,742 Updated Mar 30, 2025

Schedule-Free Optimization in PyTorch

Python 2,125 72 Updated Mar 24, 2025

Fine-tune LLM agents with online reinforcement learning

Python 1,097 51 Updated Mar 19, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 822 51 Updated Mar 24, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,484 201 Updated Aug 11, 2024

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 82,669 7,151 Updated Mar 26, 2025

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 10,235 1,809 Updated Mar 24, 2025

Powerful menu bar manager for macOS

Swift 17,663 314 Updated Jan 26, 2025
Python 1 Updated Mar 17, 2025

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,771 678 Updated Mar 24, 2025

Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.

Python 94 6 Updated Feb 9, 2024

Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various c…

Python 146 12 Updated Mar 18, 2024

Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.

348 17 Updated Sep 12, 2024
Python 141 14 Updated May 2, 2024

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

2,509 172 Updated Mar 21, 2025

Data for paper "Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness"

Python 32 6 Updated May 3, 2023
Python 63 16 Updated Oct 25, 2024

✨A static blog template built with Astro.

Astro 2,064 505 Updated Mar 29, 2025

Sample code illustrating the VS Code extension API.

TypeScript 9,201 3,552 Updated Mar 14, 2025
Python 149 95 Updated Mar 25, 2025

🦌 Soothing pastel theme for VSCode

TypeScript 1,628 58 Updated Mar 29, 2025

An innovative superfamily of fonts for code

TypeScript 15,724 270 Updated Mar 7, 2025

LLM101n: Let's build a Storyteller

32,992 1,804 Updated Aug 1, 2024
Next
Showing results