Running large language models on a single GPU for throughput-oriented scenarios.
-
Updated
Apr 19, 2024 - Python
Running large language models on a single GPU for throughput-oriented scenarios.
A crowdsourced distributed cluster for AI art and text generation
New OTP Bot, working with any company or service name to fetch otp code.
Train very large language models in Jax.
MinT: Minimal Transformer Library and Tutorials
This is the official PyTorch implementation of "LLM-QBench: A Benchmark Towards the Best Practice for Post-training Quantization of Large Language Models", and also an efficient LLM compression tool with various advanced compression methods, supporting multiple inference backends.
4D reconstruction of developmental trajectories using spherical harmonics
This bot attends the online classes held on Microsoft teams, according to the given timetable.Informs if bot is successfully joined the meeting through discord.
Training and inference scripts for Meta's OPT LLM models using the Alpaca Instruct format.
This bot attends the online classes held on Microsoft teams, according to the given timetable.Informs if bot is successfully joined the meeting through discord.
Add a description, image, and links to the opt topic page so that developers can more easily learn about it.
To associate your repository with the opt topic, visit your repo's landing page and select "manage topics."