-
Notifications
You must be signed in to change notification settings - Fork 241
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Enable INC Distributed Tuning (#470)
* draft multinode code * fix wrong cur_cfg_id * add more details for pipeline Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> * fix end tag and boundary overflow Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> * remove tune cfg list limitation * draft distributed basic strategy Signed-off-by: spycsh <sihan.chen@intel.com> * fix syntax bugs * set q model * add some debug info * fix bug * fix best_tune_cfg_id bug Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> * add multi stage detail * remove debug info Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> * add UT flow Signed-off-by: Spycsh <sihan.chen@intel.com> * add UT example Signed-off-by: Spycsh <sihan.chen@intel.com> * distributed config * fix use_distributed_tuning config bugs * updated the accuracy comparison Signed-off-by: yiliu30 <yi4.liu@intel.com> * logger info * add UT stage 2 * add more UT without coordination on fake_eval index Signed-off-by: spycsh <sihan.chen@intel.com> * add UTs Signed-off-by: yiliu30 <yi4.liu@intel.com> * Add WA for UTs Signed-off-by: yiliu30 <yi4.liu@intel.com> * revert wrongly commented next_tune_cfg code * fix fake eval in UT Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> * fix bug in tuning config gap on single-node and multi-node Signed-off-by: spycsh <sihan.chen@intel.com> * set tune_cfg_lst as a class attribute & delete debug info Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> * bypass onnx exporting on slave ndoes * enable example on multi nodes Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> * remove UT tmp files Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> * update op_fallback_acc_impact after stage 3 Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> * remove redundant code for strategy Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> * set use distributed tuning in example Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> * skip UT if no mpi4py Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> * add docstring Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> * use LazyImport and code check Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> * add a blank line between summary line and description Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> * code check Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> * refine document Signed-off-by: spycsh <sihan.chen@intel.com> * remove Lazyimport mpi4py line Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> --------- Signed-off-by: Kaihui-intel <kaihui.tang@intel.com> Signed-off-by: spycsh <sihan.chen@intel.com> Signed-off-by: Spycsh <sihan.chen@intel.com> Signed-off-by: yiliu30 <yi4.liu@intel.com> Co-authored-by: Kaihui-intel <kaihui.tang@intel.com> Co-authored-by: yiliu30 <yi4.liu@intel.com>
- Loading branch information
1 parent
08e2551
commit e1fe50e
Showing
11 changed files
with
785 additions
and
46 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
4 changes: 4 additions & 0 deletions
4
...ggingface_models/text-classification/quantization/ptq_static/fx/run_distributed_tuning.sh
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,4 @@ | ||
source ~/miniconda3/etc/profile.d/conda.sh | ||
conda activate MPIENV | ||
cd /YOURWORKDIR/examples/pytorch/nlp/huggingface_models/text-classification/quantization/ptq_static/fx | ||
python -u ./run_glue.py --model_name_or_path distilbert_mrpc --task_name mrpc --do_eval --max_seq_length 128 --per_device_eval_batch_size 16 --no_cuda --output_dir ./int8_model_dir --tune --overwrite_output_dir |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.