adaptive-inference

This repository features RARE-UNet a resolution-aware 3D U-Net for adaptive medical segmentation. It uses multi-scale entry blocks and resolution-based routing to dynamically adjust the inference path to input resolution. Combined with consistency-based training, RARE-UNet delivers accurate, efficient segmentation across resolutions.

brain-mri dynamic-routing multi-scale medical-image-segmentation tumor-segmentation 3d-unet adaptive-inference resolution-aware consistency-training hippocampus-segmentation rare-unet

Updated Feb 5, 2026
Python

HyperKuvid-Labs / SpecQuant

Star

Scalable framework for adaptive LLM serving: classify prompt complexity → select quantized drafts → verify with FP16 target, no model retraining required.

quantization adaptive-inference llm-inference speculative-decoding

Updated Nov 9, 2025
Python

Vadale / adaptive-inference-system

Star

Research prototype: category-aware layer skipping for small LLMs on Apple Silicon. Honest write-up of what worked (1.9x batch throughput on Llama 3.2 3B) and what didn't (MMLU collapse). Outperformed by LayerSkip/MoD/vLLM.

inference transformer llama mps gemma efficient-inference faiss mac-mini adaptive-inference negative-results apple-silicon llm layer-skipping

Updated May 12, 2026
Python

Improve this page

Add a description, image, and links to the adaptive-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the adaptive-inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adaptive-inference

Here are 7 public repositories matching this topic...

kalviny / MSDNet-PyTorch

InternScience / AdaptiveDiffusion

zdaxie / SpatiallyAdaptiveInference-Detection

kalviny / IMTA

simonwinther / RARE-UNet

HyperKuvid-Labs / SpecQuant

Vadale / adaptive-inference-system

Improve this page

Add this topic to your repo