adaptive-inference
Here are 7 public repositories matching this topic...
[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
-
Updated
Jan 22, 2025 - Python
Spatially Adaptive Inference with Stochastic Feature Sampling and Interpolation, ECCV 2020 Oral
-
Updated
Aug 26, 2020 - Python
-
Updated
Dec 26, 2019 - Python
This repository features RARE-UNet a resolution-aware 3D U-Net for adaptive medical segmentation. It uses multi-scale entry blocks and resolution-based routing to dynamically adjust the inference path to input resolution. Combined with consistency-based training, RARE-UNet delivers accurate, efficient segmentation across resolutions.
-
Updated
Feb 5, 2026 - Python
Scalable framework for adaptive LLM serving: classify prompt complexity → select quantized drafts → verify with FP16 target, no model retraining required.
-
Updated
Nov 9, 2025 - Python
Research prototype: category-aware layer skipping for small LLMs on Apple Silicon. Honest write-up of what worked (1.9x batch throughput on Llama 3.2 3B) and what didn't (MMLU collapse). Outperformed by LayerSkip/MoD/vLLM.
-
Updated
May 12, 2026 - Python
Improve this page
Add a description, image, and links to the adaptive-inference topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the adaptive-inference topic, visit your repo's landing page and select "manage topics."