large-language-models

Star

Here are 12 public repositories matching this topic...

SJTU-IPADS / PowerInfer

Star

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

falcon llama large-language-models llm local-inference llm-inference bamboo-7b

Updated May 20, 2024
C++

li-plus / chatglm.cpp

Star

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & more LLMs

nlp baichuan large-language-models chatglm chatglm2 internlm baichuan2 codegeex2-6b chatglm3

Updated Apr 29, 2024
C++

mit-han-lab / TinyChatEngine

Star

TinyChatEngine: On-Device LLM Inference Library

c arm deep-learning cpp x86-64 quantization edge-computing cuda-programming on-device-ai large-language-models

Updated May 24, 2024
C++

QwenLM / qwen.cpp

Star

C++ implementation of Qwen-LM

large-language-models qwen

Updated Dec 25, 2023
C++

UbiquitousLearning / mllm

Star

Fast Multimodal LLM on Mobile Devices

llama multimodal large-language-models

Updated May 4, 2024
C++

keith2018 / TinyGPT

Star

Tiny C++11 GPT-2 inference implementation from scratch

nlp deep-learning cpp inference tensor gpt nerual-network gpt-2 large-language-models

Updated Jan 4, 2024
C++

TorchMoE / MoE-Infinity

Star

PyTorch library for cost-effective, fast and easy serving of MoE models.

pytorch inference-engine mixture-of-experts huggingface large-language-models

Updated May 11, 2024
C++

smvorwerk / xlstm-cuda

Star

Cuda implementation of Extended Long Short Term Memory (xLSTM) with C++ and PyTorch ports

machine-learning cuda pytorch lstm rnn-lstm large-language-models llms xlstm

Updated May 11, 2024
C++

rendezqueue / rendezllama

Star

CLI for llama.cpp with various commands to guide, edit, and regenerate tokens on the fly.

ai chatbot llama large-language-models llm llamacpp

Updated Apr 7, 2024
C++

Eleanor-H / MUSTARD

Star

Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data

machine-learning natural-language-processing theorem-proving data-synthesis automated-theorem-proving math-word-problem large-language-models ai4math complex-reasoning math-word-problem-solving

Updated Apr 24, 2024
C++

yvonwin / qwen2.cpp

Star

qwen1.5 cpp implementation

nlp moe large-language-models qwen qwen2 qwen1-5

Updated May 8, 2024
C++

UEFI-code / PyTorch_For_PoorGuys

Star

This is a special PyTorch For Poor Guys Who can't afford big GPU

deep-learning neural-network dma vram large-language-models cheap-gpu

Updated Feb 22, 2024
C++

Improve this page

Add a description, image, and links to the large-language-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the large-language-models topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

large-language-models

Here are 12 public repositories matching this topic...

SJTU-IPADS / PowerInfer

li-plus / chatglm.cpp

mit-han-lab / TinyChatEngine

QwenLM / qwen.cpp

UbiquitousLearning / mllm

keith2018 / TinyGPT

TorchMoE / MoE-Infinity

smvorwerk / xlstm-cuda

rendezqueue / rendezllama

Eleanor-H / MUSTARD

yvonwin / qwen2.cpp

UEFI-code / PyTorch_For_PoorGuys

Improve this page

Add this topic to your repo