#

deepspeed

Here are 45 public repositories matching this topic...

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

llama cuda-kernels deepspeed llm fastertransformer llm-inference turbomind internlm llama2 codellama llama3

Updated Jul 11, 2024
Python

OpenLLMAI / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

reinforcement-learning raylib transformers deepspeed large-language-models reinforcement-learning-from-human-feedback vllm

Updated Jul 11, 2024
Python

openpsi-project / ReaLHF

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

distributed-systems reinforcement-learning distributed-computing transformers large-scale-machine-learning deepspeed megatron-lm large-language-models llm reinforcement-learning-from-human-feedback llm-training llm-framework

Updated Jul 11, 2024
Python

wangclnlp / DeepSpeed-Chat-Extension

This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).

llama sft deepspeed llm rlhf

Updated Jul 2, 2024
Python

NiuHuangxiaozi / Deep-Learning-Parallelism

This repository outlines a comprehensive guide for training a distributed deep learning model.

pytorch ps ddp allreduce pipline deepspeed tensor-parallelism

Updated Jul 2, 2024
Python

zjunlp / KnowLM

An Open-sourced Knowledgable Large Language Model Framework.

Updated Jun 26, 2024
Python

janelu9 / EasyLLM

Running Large Language Model easily.

pipeline llama zero deepspeed llm

Updated Jun 25, 2024
Python

OpenMOSS / CoLLiE

Collaborative Training of Large Language Models in an Efficient Way

nlp deep-learning pytorch deepspeed

Updated Jun 18, 2024
Python

SalientPharaoh / Optimisation_of_dynamic_neural_networks

Code and analysis for optimizing dynamic neural networks. This project investigates and implements various optimization techniques to enhance dynamic neural networks.

python flask nextjs pruning quantization huggingface-transformers deepspeed llm-inference

Updated Jun 13, 2024
Python

PKU-Alignment / safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Updated Jun 13, 2024
Python

dyedd / deepspeed-diffusers

🚀 原生使用 Deepspeed 训练 Diffusers | Native Training of Diffusers with Deepspeed

model diffusion deepspeed diffusers

Updated Jun 7, 2024
Python

DONGRYEOLLEE1 / LLM-FT

Efficient Fine-tuning for LLM

deepspeed llm-training

Updated Jun 5, 2024
Python

OpenCSGs / llm-inference

llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource management, monitoring, and more.

transformer ray deepspeed llama-cpp vllm llm-inference

Updated May 17, 2024
Python

glb400 / Toy-RecLM

A toy large model for recommender system based on LLaMA2/SASRec/Meta's generative recommenders. Besides, note and experiments of official implementation for Meta's generative recommenders.

recommender-system sasrec deepspeed large-language-models llama2 actions-speak-louder-than-words

Updated Apr 25, 2024
Python

XplainMind / LLMindCraft

Shaping Language Models with Cognitive Insights

docker transformers pretraining deepspeed large-language-models reinforcement-learning-from-human-feedback instruct-tuning

Updated Feb 29, 2024
Python

SulRash / minLLMTrain

Minimal yet high performant code for pretraining llms. Attempts to implement some SOTA features. Implements training through: Deepspeed, Megatron-LM, and FSDP. WIP

huggingface pretraining deepspeed megatron-lm llm fsdp

Updated Feb 6, 2024
Python

CoinCheung / gdGPT

Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.

nlp bloom pipeline pytorch deepspeed llm full-finetune model-parallization flash-attention llama2 baichuan2-7b chatglm3-6b mixtral-8x7b

Updated Feb 5, 2024
Python

xyjigsaw / LLM-Pretrain-SFT

Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)

llama lora mistral deepspeed large-language-models baichuan2

Updated Jan 30, 2024
Python

Reason-Wang / InstructLLM

The official implementation of paper "Demystifying Instruction Mixing for Fine-tuning Large Language Models"

nlp transformers fine-tuning deepspeed llm instruction-tuning llama2

Updated Jan 5, 2024
Python

alibaba / Megatron-LLaMA

Best practice for training LLaMA models in Megatron-LM

pytorch llama distributed-training pretraining deepspeed megatron-lm llm

Updated Jan 2, 2024
Python

Improve this page

Add a description, image, and links to the deepspeed topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the deepspeed topic, visit your repo's landing page and select "manage topics."