OpenMOSE

OpenMOSE OpenMOSE

Farmer,Welder,Electrician,Programming

Block or Report

Block or report OpenMOSE

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

RWKV-LM-LISA RWKV-LM-LISA Public

Layerwise Importance Sampled AdamW for RWKV, aiming for RWKV-5 and 6. SFT, Aligning(DPO,ORPO). Cuda and Rocm6.0. can train 7b on 24GB GPU!

Python 11
RWKV5-LM-LoRA RWKV5-LM-LoRA Public

RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN an…

Python 9 1
RWKV-infctx-trainer-LoRA RWKV-infctx-trainer-LoRA Public

RWKV v5, v6 infctx LoRA trainer with 4bit quantization,Cuda and Rocm supported, for training arbitary context sizes, to 10k and beyond!

Python 8 2
RWKV-LM-RLHF-DPO-LoRA RWKV-LM-RLHF-DPO-LoRA Public

Forked from Triang-jyed-driung/RWKV-LM-RLHF-DPO

Direct Preference Optimization LoRA for RWKV, aiming for RWKV-5 and 6.

Python 1
RWKV-LM-State-4bit-Orpo RWKV-LM-State-4bit-Orpo Public

State tuning with Orpo of RWKV v6 can be performed with 4-bit quantization. Every model can be trained with Orpo on Single 24GB GPU!

Python 4 1
RWKV-Infer RWKV-Infer Public

A large-scale RWKV v6 inference wrapper using the Cuda backend. Easy to deploy on docker. Supports multi-batch generation and dynamic State switching. Let's spread RWKV, which combines RNN technolo…

Python 4 1