Running large language models on a single GPU for throughput-oriented scenarios.
-
Updated
Jul 24, 2024 - Python
Running large language models on a single GPU for throughput-oriented scenarios.
Run Mixtral-8x7B models in Colab or consumer desktops
dpdk infrastructure for software acceleration. Currently working on RX and ACL pre-filter
DPU-Powered File System Virtualization over virtio-fs
A collection of tests for the Open vSwitch HW offload.
Backend.AI Client Library for Python
A framework for IoT devices to offload tasks to the cloud, resulting in efficient computation and decreased cloud costs.
LeapIO: Efficient and Portable Virtual NVMe Storage on ARM SoCs (ASPLOS'20)
The container-based cloud platform for mobile code offloading
A Dynamic Programming Offloading Algorithm for Mobile Cloud Computing
Monero hardware wallet protocol implementation for Trezor, agent
Simulator for workflow scheduling and task offloading among mobile phones and edge clouds.
A lightweight framework that enables serverless users to reduce their bills by harvesting non-serverless compute resources such as their VMs, on-premise servers, or personal computers.
👷 Web Worker Offloading Framework to migrate Web Worker from browser to server
Monero wallet Trezor integration documentation
Add a description, image, and links to the offloading topic page so that developers can more easily learn about it.
To associate your repository with the offloading topic, visit your repo's landing page and select "manage topics."