#

llm-training

Here are 353 public repositories matching this topic...

gitleaks / gitleaks

Find secrets with Gitleaks 🔑

git go cli golang open-source security secret ci-cd cicd hacktoberfest dlp security-tools devsecops data-loss-prevention gitleaks ai-powered llm llm-training llm-inference

Updated Mar 7, 2025
Go

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

llm llmops llm-serving llm-training llm-inference

Updated Mar 2, 2025
HTML

ludwig

ludwig-ai / ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

Updated Mar 3, 2025
Python

skypilot-org / skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 14+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Updated Mar 7, 2025
Python

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

triton llama mistral finetuning llms llm-training llama3 phi3 gemma2 triton-kernels

Updated Mar 7, 2025
Python

InternLM / xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent chatbot conversational-ai peft baichuan msagent large-language-models llm supervised-finetuning llava llm-training chatglm2 internlm llama2 qwen chatglm3 mixtral llama3 phi3

Updated Feb 21, 2025
Python

h2oai / h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

ai chatbot llama gpt generative fine-tuning finetuning llm generative-ai chatgpt llm-training llama2

Updated Mar 7, 2025
Python

databricks / dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

databricks llm generative-ai gen-ai llm-training llm-inference mosaic-ai

Updated May 1, 2024
Python

dstackai / dstack

dstack is a lightweight, open-source alternative to Kubernetes & Slurm, simplifying AI container orchestration with multi-cloud & on-prem support. It natively supports NVIDIA, AMD, TPU, and Intel accelerators.

python training kubernetes aws machine-learning cloud azure gpu gcp orchestration k8s fine-tuning llms llmops llm-training llm-inference

Updated Mar 7, 2025
Python

MoonshotAI / MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

pytorch transformer moe llm llm-serving llm-training flash-attention

Updated Mar 7, 2025
Python

intelligent-machine-learning / dlrover

DLRover: An Automatic Distributed Deep Learning System

k8s hacktoberfest distributed-training llm-training

Updated Mar 7, 2025
Python

utkuozdemir / nvidia_gpu_exporter

Nvidia GPU exporter for prometheus using nvidia-smi binary

ai monitoring gaming prometheus nvidia cryptocurrency prometheus-exporter nvidia-smi nvidia-gpu llm llm-training

Updated Mar 4, 2025
Go

sail-sg / Adan

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Updated Jul 2, 2024
Python

volcengine / veScale

A PyTorch Native LLM Training Framework

pytorch llm-training

Updated Dec 27, 2024
Python

ghimiresunil / LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing

LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.

open-source transformers bert huggingface large-language-models llm-training llm-inference open-source-llm llm-tutorials

Updated Jan 19, 2025
Jupyter Notebook

rohan-paul / LLM-FineTuning-Large-Language-Models

LLM (Large Language Model) FineTuning

pytorch gpt-3 large-language-models llm llm-serving gpt3-turbo llm-training llm-inference open-source-llm llama2 llm-finetuning mistral-7b

Updated May 19, 2024
Jupyter Notebook

anarchy-ai / LLM-VM

irresponsible innovation. Try now at https://chat.dev/

machine-learning deep-learning artificial-intelligence distillation distillation-model llm llm-agent llm-training llm-inference llm-local

Updated May 14, 2024
Python

mallorbc / Finetune_LLMs

Repo for fine-tuning Casual LLMs

docker falcon mpt llama gpt gpt-3 gpt-4 gpt-j-6b llm gpt-35-turbo llm-training llama2

Updated Mar 27, 2024
Python

feifeibear / long-context-attention

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

pytorch attention-is-all-you-need llm-training llm-inference ring-attention deepspeed-ulysses

Updated Feb 19, 2025
Python

FlagAI-Open / Aquila2

The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.

llm llm-training llm-inference

Updated Oct 11, 2024
Python

Improve this page

Add a description, image, and links to the llm-training topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-training topic, visit your repo's landing page and select "manage topics."