pr-Mais

👾

Mais Alheraki pr-Mais

👾

Software engineer

596 followers · 45 following

@Thmanyah-LLC
Dammam, Saudi Arabia
11:16 - 3h ahead
g.dev/mais
@pr_Mais
https://mais.codes

Achievements

x3 x3

Achievements

x3 x3

Organizations

Lists (1)

Sort

Flutter

1 repository

Starred repositories

Eyevinn / hls-splice

NPM library to splice HLS VOD

JavaScript 18 4 Updated Dec 18, 2024

localstack / localstack

💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline

Python 58,251 4,124 Updated Mar 28, 2025

microsoft / methods2test

methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositories

Python 146 39 Updated Dec 4, 2023

ait-aecid / anomaly-detection-log-datasets

Analysis scripts for log data sets used in anomaly detection.

Python 60 7 Updated Jul 30, 2024

firebase / firebase-functions

Firebase SDK for Cloud Functions

TypeScript 1,036 206 Updated Mar 25, 2025

voidful / TextRL

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Python 556 59 Updated May 9, 2024

danijar / dreamerv3

Mastering Diverse Domains through World Models

Python 1,571 264 Updated Feb 22, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 12,909 1,742 Updated Mar 30, 2025

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 2,125 72 Updated Mar 24, 2025

KhoomeiK / LlamaGym

Fine-tune LLM agents with online reinforcement learning

Python 1,097 51 Updated Mar 19, 2024

ContextualAI / HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 822 51 Updated Mar 24, 2025

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,484 201 Updated Aug 11, 2024

fastapi / fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 82,669 7,151 Updated Mar 26, 2025

DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 10,235 1,809 Updated Mar 24, 2025

jordanbaird / Ice

Powerful menu bar manager for macOS

Swift 17,663 314 Updated Jan 26, 2025

ksaa-nlp / balsam-eval

Python 1 Updated Mar 17, 2025

lucidrains / PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,771 678 Updated Mar 24, 2025

WooooDyy / LLM-Reverse-Curriculum-RL

Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.

Python 94 6 Updated Feb 9, 2024

raghavc / LLM-RLHF-Tuning-with-PPO-and-DPO

Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various c…

Python 146 12 Updated Mar 18, 2024