-
Notifications
You must be signed in to change notification settings - Fork 46
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
License
AI-Hypercomputer/JetStream
ErrorLooks like something went wrong!
About
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
Topics
Resources
License
Stars
Watchers
Forks
Packages 0
No packages published