-
Notifications
You must be signed in to change notification settings - Fork 538
[CI] AWS batch job tool for GluonNLP (Part I) #1251
Conversation
Codecov Report
@@ Coverage Diff @@
## numpy #1251 +/- ##
=======================================
Coverage 82.44% 82.44%
=======================================
Files 38 38
Lines 5450 5450
=======================================
Hits 4493 4493
Misses 957 957 |
Would you bind the MXNet version to |
I have looked at the CI error log. Now I find that
Maybe the code owner of this file could have a look at this issue and fix it so that CI can pass tests. |
This is due to the recent upgrade of gluon API.
Get Outlook for iOS<https://aka.ms/o0ukef>
________________________________
From: WANG, Chen <notifications@github.com>
Sent: Saturday, June 27, 2020 12:49:59 AM
To: dmlc/gluon-nlp <gluon-nlp@noreply.github.com>
Cc: Xingjian SHI <xshiab@connect.ust.hk>; Team mention <team_mention@noreply.github.com>
Subject: Re: [dmlc/gluon-nlp] [CI] AWS batch job tool for GluonNLP (Part I) (#1251)
I have looked at the CI error log. Now I find that https://github.com/dmlc/gluon-nlp/blob/numpy/tests/test_attention_cell.py#L32 produces the error and it seems that this is a variable type error.
self = <[AttributeError("'MultiHeadAttentionCell' object has no attribute '_query_units'",) raised in repr()] MultiHeadAttentionCell object at 0x7f905e3062e8>
query_units = 16, num_heads = 2, attention_dropout = 0.0, scaled = False
normalized = False, eps = 1e-06, dtype = 'float32', layout = 'NKT'
use_einsum = False, prefix = None, params = None
def __init__(self, query_units=None, num_heads=None, attention_dropout=0.0,
scaled: bool = True, normalized: bool = False, eps: float = 1E-6,
dtype='float32', layout='NTK', use_einsum=False,
prefix=None, params=None):
super().__init__(prefix=prefix, params=params)
E TypeError: __init__() got an unexpected keyword argument 'prefix'
src/gluonnlp/attention_cell.py:606: TypeError
____ test_multi_head_dot_attention_cell[no_share_head-False-False-False-3] _____
Maybe the code owner of this file could have a look at this issue and fix it so that CI can pass tests.
—
You are receiving this because you are on a team that was mentioned.
Reply to this email directly, view it on GitHub<#1251 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ABHQH3UUEDVH4VI735CVBMDRYWQCPANCNFSM4OE5OR3A>.
|
rm ~/miniconda.sh && \ | ||
/opt/conda/bin/conda clean -ya | ||
ENV PATH /opt/conda/bin:$PATH | ||
RUN git clone https://github.com/dmlc/gluon-nlp |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should we consider swithing this to numpy branch
git fetch origin $SOURCE_REF:working | ||
git checkout working |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@zheyuye branch switch happens when running the job here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That looks good for me
commit 35a586676036f627bffd0d3c753c6cd0a70d63cf Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Fri Jul 17 10:10:14 2020 +0800 Squashed commit of the following: commit 673344d Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Wed Jul 15 22:43:07 2020 +0800 CharTokenizer commit 8dabfd6 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Wed Jul 15 15:47:24 2020 +0800 lowercase commit f5c94a6 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Tue Jul 14 17:45:28 2020 +0800 test commit dc55fc9 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Tue Jul 14 05:45:01 2020 +0800 tiny update on run_squad commit 4defc7a Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jul 13 23:18:08 2020 +0800 update testings commit 2719e81 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jul 13 23:08:32 2020 +0800 re-upload xlmr commit cd0509d Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jul 13 22:30:47 2020 +0800 fix get_pretrained commit 8ed8a72 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jul 13 22:28:13 2020 +0800 re-upload roberta commit 5811d40 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jul 13 18:27:23 2020 +0800 update commit 44a09a3 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sat Jul 11 15:06:33 2020 +0800 fix commit 4074a26 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Fri Jul 10 16:08:49 2020 +0800 inference without horovod commit 31cb953 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Thu Jul 9 18:41:55 2020 +0800 update commit 838be2a Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Thu Jul 9 15:14:39 2020 +0800 horovod for squad commit 1d374a2 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Thu Jul 9 12:09:19 2020 +0800 fix commit e4fba39 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Thu Jul 9 10:35:08 2020 +0800 remove multiply_grads commit 007f07e Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Tue Jul 7 11:26:38 2020 +0800 multiply_grads commit b8c85bb Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jul 6 12:28:56 2020 +0800 fix ModelForQABasic commit 0e13a58 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sat Jul 4 18:42:12 2020 +0800 clip_grad_global_norm with zeros max_grad_norm commit bd270f2 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Fri Jul 3 20:21:31 2020 +0800 fix roberta commit 4fc564c Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Fri Jul 3 19:36:08 2020 +0800 update hyper-parameters of adamw commit 59cffbf Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Fri Jul 3 16:25:46 2020 +0800 try commit a84f782 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Thu Jul 2 20:39:03 2020 +0800 fix mobilebert commit 4bc3a96 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Thu Jul 2 11:14:39 2020 +0800 layer-wise decay commit 07186d5 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Thu Jul 2 02:14:43 2020 +0800 revise commit a5a6475 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Wed Jul 1 19:50:20 2020 +0800 topk commit 34ee884 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Wed Jul 1 19:25:09 2020 +0800 index_update commit 74178e2 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Wed Jul 1 00:48:32 2020 +0800 rename commit fa011aa Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Tue Jun 30 23:40:28 2020 +0800 update commit 402d625 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Tue Jun 30 21:40:30 2020 +0800 multiprocessing for wiki commit ddbde75 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Tue Jun 30 20:41:35 2020 +0800 fix bookcorpus commit 6cc5ccd Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Tue Jun 30 16:39:12 2020 +0800 fix wiki commit 9773efd Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Tue Jun 30 15:52:13 2020 +0800 fix openwebtext commit 1fb8eb8 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jun 29 19:51:25 2020 +0800 upload gluon_electra_small_owt commit ca83fac Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jun 29 18:09:48 2020 +0800 revise train_transformer commit 1450f5c Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jun 29 18:07:04 2020 +0800 revise commit b460bbe Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jun 29 17:24:00 2020 +0800 repeat for pretraining commit 8ee381b Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jun 29 17:06:43 2020 +0800 repeat commit aea936f Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jun 29 16:39:22 2020 +0800 fix mobilebert commit eead164 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sun Jun 28 18:44:28 2020 +0800 fix commit 8645115 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sun Jun 28 17:27:43 2020 +0800 update commit 2b7f7a3 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sun Jun 28 17:18:00 2020 +0800 fix roberta commit 86702fe Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sun Jun 28 16:27:43 2020 +0800 use_segmentation commit 6d03d7a Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sun Jun 28 15:52:40 2020 +0800 fix commit 5c0ca43 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sun Jun 28 15:49:48 2020 +0800 fix token_ids commit ff7aae8 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sun Jun 28 13:56:07 2020 +0800 fix xlmr commit 2070b86 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sun Jun 28 13:54:26 2020 +0800 fix roberta commit 70a1887 Author: Leonard Lausen <lausen@amazon.com> Date: Fri Jul 17 00:07:08 2020 +0000 Update for Block API (dmlc#1261) - Remove params and prefix arguments for MXNet 2 and update parameter sharing implementation - Remove Block.name_scope() for MXNet 2 - Remove self.params.get() and self.params.get_constant() commit ea9152b Author: Xingjian Shi <xshiab@connect.ust.hk> Date: Thu Jul 16 15:42:04 2020 -0700 Fixes to make the CI more stable (dmlc#1265) * Some fixes to make the CI more stable * add retries * Update tokenizers.py commit a646c34 Author: ht <wawawa@akane.waseda.jp> Date: Sun Jul 12 02:49:53 2020 +0800 [FEATURE] update backtranslation and add multinomial sampler (dmlc#1259) * back translation bash * split "lang-pair" para in clean_tok_para_corpus * added clean_tok_mono_corpus * fix * add num_process para * fix * fix * add yml * rm yml * update cfg name * update evaluate * added max_update / save_interval_update params * fix * fix * multi gpu inference * fix * update * update multi gpu inference * fix * fix * split evaluate and parallel infer * fix * test * fix * update * add comments * fix * remove todo comment * revert remove todo comment * raw lines remove duplicated '\n' * update multinomaial sampler * fix * fix * fix * fix * sampling * update script * fix * add test_case with k > 1 in topk sampling * fix multinomial sampler * update docs * comments situation eos_id = None * fix Co-authored-by: Hu <huta@a483e74650ff.ant.amazon.com> commit 83e1f13 Author: Leonard Lausen <lausen@amazon.com> Date: Thu Jul 9 20:57:55 2020 -0700 Use Amazon S3 Transfer Acceleration (dmlc#1260) commit cd48efd Author: Leonard Lausen <lausen@amazon.com> Date: Tue Jul 7 17:39:42 2020 -0700 Update codecov action to handle different OS and Python versions (dmlc#1254) codecov/codecov-action#80 (comment) commit 689eba9 Author: Sheng Zha <szha@users.noreply.github.com> Date: Tue Jul 7 09:55:34 2020 -0700 [CI] AWS batch job tool for GluonNLP (Part I) (dmlc#1251) * AWS batch job tool for GluonNLP * limit range Co-authored-by: Xingjian Shi <xshiab@connect.ust.hk> commit e06ff01 Author: Leonard Lausen <lausen@amazon.com> Date: Tue Jul 7 08:36:24 2020 -0700 Pin mxnet version range on CI (dmlc#1257)
* fix roberta * fix xlmr * fix token_ids * fix * use_segmentation * fix roberta * update * fix * fix mobilebert * repeat * repeat for pretraining * revise * revise train_transformer * upload gluon_electra_small_owt * fix openwebtext * fix wiki * fix bookcorpus * multiprocessing for wiki * update * rename * index_update * topk * revise * layer-wise decay * fix mobilebert * try * update hyper-parameters of adamw * fix roberta * clip_grad_global_norm with zeros max_grad_norm * fix ModelForQABasic * multiply_grads * remove multiply_grads * fix * horovod for squad * update * inference without horovod * fix * update * re-upload roberta * fix get_pretrained * re-upload xlmr * update testings * tiny update on run_squad * test * lowercase * CharTokenizer * Squashed commit of the following: commit 35a586676036f627bffd0d3c753c6cd0a70d63cf Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Fri Jul 17 10:10:14 2020 +0800 Squashed commit of the following: commit 673344d Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Wed Jul 15 22:43:07 2020 +0800 CharTokenizer commit 8dabfd6 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Wed Jul 15 15:47:24 2020 +0800 lowercase commit f5c94a6 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Tue Jul 14 17:45:28 2020 +0800 test commit dc55fc9 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Tue Jul 14 05:45:01 2020 +0800 tiny update on run_squad commit 4defc7a Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jul 13 23:18:08 2020 +0800 update testings commit 2719e81 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jul 13 23:08:32 2020 +0800 re-upload xlmr commit cd0509d Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jul 13 22:30:47 2020 +0800 fix get_pretrained commit 8ed8a72 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jul 13 22:28:13 2020 +0800 re-upload roberta commit 5811d40 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jul 13 18:27:23 2020 +0800 update commit 44a09a3 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sat Jul 11 15:06:33 2020 +0800 fix commit 4074a26 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Fri Jul 10 16:08:49 2020 +0800 inference without horovod commit 31cb953 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Thu Jul 9 18:41:55 2020 +0800 update commit 838be2a Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Thu Jul 9 15:14:39 2020 +0800 horovod for squad commit 1d374a2 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Thu Jul 9 12:09:19 2020 +0800 fix commit e4fba39 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Thu Jul 9 10:35:08 2020 +0800 remove multiply_grads commit 007f07e Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Tue Jul 7 11:26:38 2020 +0800 multiply_grads commit b8c85bb Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jul 6 12:28:56 2020 +0800 fix ModelForQABasic commit 0e13a58 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sat Jul 4 18:42:12 2020 +0800 clip_grad_global_norm with zeros max_grad_norm commit bd270f2 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Fri Jul 3 20:21:31 2020 +0800 fix roberta commit 4fc564c Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Fri Jul 3 19:36:08 2020 +0800 update hyper-parameters of adamw commit 59cffbf Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Fri Jul 3 16:25:46 2020 +0800 try commit a84f782 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Thu Jul 2 20:39:03 2020 +0800 fix mobilebert commit 4bc3a96 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Thu Jul 2 11:14:39 2020 +0800 layer-wise decay commit 07186d5 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Thu Jul 2 02:14:43 2020 +0800 revise commit a5a6475 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Wed Jul 1 19:50:20 2020 +0800 topk commit 34ee884 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Wed Jul 1 19:25:09 2020 +0800 index_update commit 74178e2 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Wed Jul 1 00:48:32 2020 +0800 rename commit fa011aa Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Tue Jun 30 23:40:28 2020 +0800 update commit 402d625 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Tue Jun 30 21:40:30 2020 +0800 multiprocessing for wiki commit ddbde75 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Tue Jun 30 20:41:35 2020 +0800 fix bookcorpus commit 6cc5ccd Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Tue Jun 30 16:39:12 2020 +0800 fix wiki commit 9773efd Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Tue Jun 30 15:52:13 2020 +0800 fix openwebtext commit 1fb8eb8 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jun 29 19:51:25 2020 +0800 upload gluon_electra_small_owt commit ca83fac Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jun 29 18:09:48 2020 +0800 revise train_transformer commit 1450f5c Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jun 29 18:07:04 2020 +0800 revise commit b460bbe Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jun 29 17:24:00 2020 +0800 repeat for pretraining commit 8ee381b Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jun 29 17:06:43 2020 +0800 repeat commit aea936f Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Mon Jun 29 16:39:22 2020 +0800 fix mobilebert commit eead164 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sun Jun 28 18:44:28 2020 +0800 fix commit 8645115 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sun Jun 28 17:27:43 2020 +0800 update commit 2b7f7a3 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sun Jun 28 17:18:00 2020 +0800 fix roberta commit 86702fe Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sun Jun 28 16:27:43 2020 +0800 use_segmentation commit 6d03d7a Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sun Jun 28 15:52:40 2020 +0800 fix commit 5c0ca43 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sun Jun 28 15:49:48 2020 +0800 fix token_ids commit ff7aae8 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sun Jun 28 13:56:07 2020 +0800 fix xlmr commit 2070b86 Author: ZheyuYe <zheyu.ye1995@gmail.com> Date: Sun Jun 28 13:54:26 2020 +0800 fix roberta commit 70a1887 Author: Leonard Lausen <lausen@amazon.com> Date: Fri Jul 17 00:07:08 2020 +0000 Update for Block API (#1261) - Remove params and prefix arguments for MXNet 2 and update parameter sharing implementation - Remove Block.name_scope() for MXNet 2 - Remove self.params.get() and self.params.get_constant() commit ea9152b Author: Xingjian Shi <xshiab@connect.ust.hk> Date: Thu Jul 16 15:42:04 2020 -0700 Fixes to make the CI more stable (#1265) * Some fixes to make the CI more stable * add retries * Update tokenizers.py commit a646c34 Author: ht <wawawa@akane.waseda.jp> Date: Sun Jul 12 02:49:53 2020 +0800 [FEATURE] update backtranslation and add multinomial sampler (#1259) * back translation bash * split "lang-pair" para in clean_tok_para_corpus * added clean_tok_mono_corpus * fix * add num_process para * fix * fix * add yml * rm yml * update cfg name * update evaluate * added max_update / save_interval_update params * fix * fix * multi gpu inference * fix * update * update multi gpu inference * fix * fix * split evaluate and parallel infer * fix * test * fix * update * add comments * fix * remove todo comment * revert remove todo comment * raw lines remove duplicated '\n' * update multinomaial sampler * fix * fix * fix * fix * sampling * update script * fix * add test_case with k > 1 in topk sampling * fix multinomial sampler * update docs * comments situation eos_id = None * fix Co-authored-by: Hu <huta@a483e74650ff.ant.amazon.com> commit 83e1f13 Author: Leonard Lausen <lausen@amazon.com> Date: Thu Jul 9 20:57:55 2020 -0700 Use Amazon S3 Transfer Acceleration (#1260) commit cd48efd Author: Leonard Lausen <lausen@amazon.com> Date: Tue Jul 7 17:39:42 2020 -0700 Update codecov action to handle different OS and Python versions (#1254) codecov/codecov-action#80 (comment) commit 689eba9 Author: Sheng Zha <szha@users.noreply.github.com> Date: Tue Jul 7 09:55:34 2020 -0700 [CI] AWS batch job tool for GluonNLP (Part I) (#1251) * AWS batch job tool for GluonNLP * limit range Co-authored-by: Xingjian Shi <xshiab@connect.ust.hk> commit e06ff01 Author: Leonard Lausen <lausen@amazon.com> Date: Tue Jul 7 08:36:24 2020 -0700 Pin mxnet version range on CI (#1257) * frozen_params * remove conversion to a sperate pr * fix * fix * update * test * revise * update performance numbers * update apply_layerwisw_decay * use shuffle * fix mobilebert * fix vocab_file
Description
AWS batch job tool for GluonNLP
Checklist
Essentials
Changes
Comments
cc @dmlc/gluon-nlp-team