[NeurIPS'23] Speculative Decoding with Big Little Decoder
-
Updated
Feb 6, 2024 - Python
[NeurIPS'23] Speculative Decoding with Big Little Decoder
Extension of the ScheduleFlow Simulator to allow speculative request times at submission and during backfill
Add a description, image, and links to the speculative-execution topic page so that developers can more easily learn about it.
To associate your repository with the speculative-execution topic, visit your repo's landing page and select "manage topics."