Skip to content

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

License

Notifications You must be signed in to change notification settings

AI-Hypercomputer/JetStream

Error
Looks like something went wrong!

About

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 38

Languages