#### Motivation * Reduce latency when multiple requests are required * Stream output from the predictor as it's generated