qwen

Star

Here are 6 public repositories matching this topic...

QwenLM / qwen.cpp

Star

C++ implementation of Qwen-LM

large-language-models qwen

Updated Dec 25, 2023
C++

intel / xFasterTransformer

Star

intel inference transformer xeon llama model-serving llm chatglm qwen

Updated Aug 2, 2024
C++

inferflow / inferflow

Star

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

bloom falcon moe gemma mistral mixture-of-experts model-quantization multi-gpu-inference m2m100 llamacpp llm-inference internlm llama2 qwen baichuan2 mixtral phi-2 deepseek minicpm

Updated Mar 15, 2024
C++

AXERA-TECH / ax-llm

Star

Explore LLM model deployment based on AXera's AI chips

transformer edge-computing huggingface llm llama2 qwen tinyllama axear

Updated Aug 7, 2024
C++

yvonwin / qwen2.cpp

Star

qwen2 and llama3 cpp implementation

nlp moe large-language-models qwen qwen2 qwen1-5

Updated Jun 7, 2024
C++

1994 / qwen.java

Star

Java Bindings for Qwen.cpp

java qwen

Updated Oct 15, 2023
C++

Improve this page

Add a description, image, and links to the qwen topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the qwen topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qwen

Here are 6 public repositories matching this topic...

QwenLM / qwen.cpp

intel / xFasterTransformer

inferflow / inferflow

AXERA-TECH / ax-llm

yvonwin / qwen2.cpp

1994 / qwen.java

Improve this page

Add this topic to your repo