Invalid syntax error when unpacking *moe_losses in python-3.7 #24

adammoody · 2021-12-18T07:10:45Z

I am trying to use the new MOE support from DeepSpeed 0.5.8 on a system with python 3.7.11. However, I get "invalid syntax" errors for all of the statements like:

75: 3:   File "/path/to/megatron/model/language_model.py", line 408
75: 3:     return encoder_output, pooled_output, *moe_losses
75: 3:                                           ^
75: 3: SyntaxError: invalid syntax

It seems like I can work around those errors with a tuple(...) statement. I hit this problems in at least the following places.

diff --git a/megatron/model/gpt_model.py b/megatron/model/gpt_model.py
index 4eb983c..1efa1da 100644
--- a/megatron/model/gpt_model.py
+++ b/megatron/model/gpt_model.py
@@ -124,15 +124,23 @@ class GPTModel(MegatronModule):
             get_key_value=get_key_value)
 
         if self.post_process:
-            return post_language_model_processing(
+            #return post_language_model_processing(
+            #    lm_output, labels,
+            #    self.word_embeddings_weight(),
+            #    get_key_value,
+            #    self.parallel_output,
+            #    forward_method_parallel_output,
+            #    self.fp16_lm_cross_entropy), *moe_losses
+            return tuple(post_language_model_processing(
                 lm_output, labels,
                 self.word_embeddings_weight(),
                 get_key_value,
                 self.parallel_output,
                 forward_method_parallel_output,
-                self.fp16_lm_cross_entropy), *moe_losses
+                self.fp16_lm_cross_entropy), *moe_losses)
         else:
-            return lm_output, *moe_losses
+            #return lm_output, *moe_losses
+            return tuple(lm_output, *moe_losses)
 
     def state_dict_for_save_checkpoint(self, destination=None, prefix='',
                                        keep_vars=False):
diff --git a/megatron/model/language_model.py b/megatron/model/language_model.py
index cb27498..e7853e6 100644
--- a/megatron/model/language_model.py
+++ b/megatron/model/language_model.py
@@ -405,9 +405,11 @@ class TransformerLanguageModel(MegatronModule):
         # similarity between two sequences by average pooling
         if not self.add_decoder or output_enc_hidden:
             if self.add_pooler and self.post_process:
-                return encoder_output, pooled_output, *moe_losses
+                #return encoder_output, pooled_output, *moe_losses
+                return tuple(encoder_output, pooled_output, *moe_losses)
             else:
-                return encoder_output, *moe_losses
+                #return encoder_output, *moe_losses
+                return tuple(encoder_output, *moe_losses)
 
         # Decoder Embedding
         dec_embedding_output = self.embedding(dec_input_ids,
@@ -421,9 +423,11 @@ class TransformerLanguageModel(MegatronModule):
                                       enc_dec_attn_mask=enc_dec_attn_mask)
 
         if self.add_pooler and self.post_process:
-            return decoder_output, encoder_output, pooled_output, *moe_losses
+            #return decoder_output, encoder_output, pooled_output, *moe_losses
+            return tuple(decoder_output, encoder_output, pooled_output, *moe_losses)
         else:
-            return decoder_output, encoder_output, *moe_losses
+            #return decoder_output, encoder_output, *moe_losses
+            return tuple(decoder_output, encoder_output, *moe_losses)
 
     def state_dict_for_save_checkpoint(self, destination=None, prefix='',
                                        keep_vars=False):

The text was updated successfully, but these errors were encountered:

awan-10 · 2022-02-24T00:01:45Z

@adammoody if the merged PR fixes this issue on your end, please close this issue. And thank you for the PR :)

conglongli · 2022-04-25T20:53:44Z

I believe the merged PR fixes this issue, so closing this now but feel free to reopen if needed.

adammoody · 2022-04-25T20:58:48Z

Yes, I can confirm the merged PR fixed this issue. Thanks!

Fixes microsoft#23.

adammoody mentioned this issue Dec 20, 2021

unpack list into a tuple constructor for python-3.7 #25

Merged

conglongli closed this as completed Apr 25, 2022

hyoo pushed a commit to hyoo/Megatron-DeepSpeed that referenced this issue Apr 21, 2023

Update constants.py (microsoft#24)

3591201

Fixes microsoft#23.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Invalid syntax error when unpacking *moe_losses in python-3.7 #24

Invalid syntax error when unpacking *moe_losses in python-3.7 #24

adammoody commented Dec 18, 2021

awan-10 commented Feb 24, 2022

conglongli commented Apr 25, 2022

adammoody commented Apr 25, 2022

Invalid syntax error when unpacking *moe_losses in python-3.7 #24

Invalid syntax error when unpacking *moe_losses in python-3.7 #24

Comments

adammoody commented Dec 18, 2021

awan-10 commented Feb 24, 2022

conglongli commented Apr 25, 2022

adammoody commented Apr 25, 2022