Fix a bug in DeepSpeedMLP #4389

sakogan · 2023-09-22T18:36:39Z

The initialization of data_type in DeepSpeedMLP appears to be incorrect when int8 inference is requested, leading to seemingly wrong text generation.

For instance, running the following command (using inference-test.py from the DeepSpeedExamples repo)

deepspeed --num_gpus 1 inference/huggingface/text-generation/inference-test.py --model bigscience/bloom-7b1 --batch_size 1 --use_kernel --use_meta_tensor --dtype int8

produces the following output:

in=DeepSpeed is a machine learning framework
out=DeepSpeed is a machine learning framework 6 10 tràm kú Tá aixướm de de截útil恩opció 15 Beraz Tá P L Tá 7 80pata terbakar aixjai尼 Tá l'Oficina de al 3'hi aix-Els L Tá T Haut-Commissariat Tá 80juntament de de a 20 10 kú

The proposed fix modifies the change to the data_type initialization introduced in #3425

loadams · 2023-09-28T17:30:18Z

@sakogan - when you tested this with the DeepSpeedExamples change, did it produce quality output?

sakogan · 2023-09-28T18:06:19Z

@loadams Not sure what DeepSpeedExamples change you are referring. I tested it with the latest version of the DeepSpeedExamples master branch. With the proposed fix, the output is valid (as well as when using --dtype float16 in that command, with or without the fix)

loadams · 2023-09-28T18:11:40Z

@loadams Not sure what DeepSpeedExamples change you are referring. I tested it with the latest version of the DeepSpeedExamples master branch. With the proposed fix, the output is valid (as well as when using --dtype float16 in that command, with or without the fix)

@sakogan - apologies, I meant if you tested this change with the error repro you had in DeepSpeedExamples. But sounds like you did, thanks!

Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>

Fix a bug in DeepSpeedMLP

426f3f4

sakogan requested review from RezaYazdaniAminabadi, jeffra, mrwyattii, awan-10, cmikeh2 and arashb as code owners September 22, 2023 18:36

loadams added 2 commits September 25, 2023 08:59

Merge branch 'master' into mlp-bugfix

ec569c8

Merge branch 'master' into mlp-bugfix

fa5f13b

loadams requested a review from lekurile September 28, 2023 16:26

loadams approved these changes Sep 28, 2023

View reviewed changes

loadams enabled auto-merge September 28, 2023 18:25

Merge branch 'master' into mlp-bugfix

f687df1

loadams added this pull request to the merge queue Oct 4, 2023

Merged via the queue into microsoft:master with commit 7099f99 Oct 4, 2023
15 checks passed

mauryaavinash95 pushed a commit to mauryaavinash95/DeepSpeed that referenced this pull request Oct 9, 2023

Fix a bug in DeepSpeedMLP (microsoft#4389)

4e9a045

Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix a bug in DeepSpeedMLP #4389

Fix a bug in DeepSpeedMLP #4389

sakogan commented Sep 22, 2023

loadams commented Sep 28, 2023

sakogan commented Sep 28, 2023

loadams commented Sep 28, 2023

Fix a bug in DeepSpeedMLP #4389

Fix a bug in DeepSpeedMLP #4389

Conversation

sakogan commented Sep 22, 2023

loadams commented Sep 28, 2023

sakogan commented Sep 28, 2023

loadams commented Sep 28, 2023