Pinned Loading
Repositories
Showing 9 of 9 repositories
- alpaca_eval Public Forked from tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
- exllama Public Forked from turboderp/exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
- vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
- text-generation-inference Public Forked from huggingface/text-generation-inference
Large Language Model Text Generation Inference
- Evaluation Public