Machine Learning Engineer bringing AI to the Cloudflare Global Network.
- Austin, TX
Highlights
- Pro
Pinned Loading
-
vllm-kvcompress
vllm-kvcompress PublicKV cache compression for high-throughput LLM inference
-
Syntactically-Constrained-Sampling
Syntactically-Constrained-Sampling PublicLLM sampling method for enforcing syntax adherence in generated output
-
animation-assist-data
animation-assist-data PublicSimple application for downloading and labeling Flickr images with deployment to GCP
Jupyter Notebook
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.