Skip to content

Commit

Permalink
Fix input_data and device order for streaming (deepjavalibrary#809)
Browse files Browse the repository at this point in the history
  • Loading branch information
xyang16 authored and KexinFeng committed Aug 16, 2023
1 parent 2328b6c commit 59541c3
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions engines/python/setup/djl_python/deepspeed.py
Original file line number Diff line number Diff line change
Expand Up @@ -324,8 +324,8 @@ def inference(self, inputs: Input):
"DeepSpeed")
device = torch.cuda.current_device()
outputs.add_stream_content(
stream_generator(self.model, self.tokenizer, device,
input_data, **model_kwargs))
stream_generator(self.model, self.tokenizer, input_data,
device, **model_kwargs))
return outputs
if self.task == "text-generation":
tokenized_inputs = self.tokenizer(
Expand Down

0 comments on commit 59541c3

Please sign in to comment.