Enable Transformer LT search space for Dynamic Neural Architecture Search Toolkit #197

macsz · 2022-11-30T17:22:14Z

Type of Change

Feature. Extends current Dynamic Neural Architecture Search Toolkit set of supported search spaces with Transformer LT super-network for En-De language translation.

Description

DyNAS-T (Dynamic Neural Architecture Search Toolkit) is a SuperNet NAS optimization package (part of Intel Neural Compressor) designed for finding the optimal Pareto front during neural architecture search while minimizing the number of search validation measurements. It supports single-/multi-/many-objective problems for a variety of domains supported. The system currently heavily utilizes the pymoo optimization library. Some of the key DyNAS-T features are:

Automatic handling of super-network parameters for search and predictor training
Genetic Algorithm (e.g., NSGA-II) multi-objective subnetworks
LINAS (Lightweight Iterative Neural Architecture Search) accelerated search using approximate predictors
Warm-start (transfer) search
Search population statistical analysis

This PR extends supported search spaces with Transformer-based Language Translation (transformer_lt_wmt_en_de) for English and German languages. Implementation of the supernet is based on Hardware Aware Transformers (HAT) by MIT HAN Lab.

How has this PR been tested?

To run an example, trained supernet weights and preprocessed WMT En-De dataset is needed. Both can be downloaded from Hardware Aware Transformers (HAT) repository.

Script to download preprocessed dataset: link
Download trained supernet weights: link

Example code to test new functionality:

config = NASConfig(approach='dynas', search_algorithm='nsga2')
config.dynas.supernet = 'transformer_lt_wmt_en_de'
config.seed = 42
config.dynas.metrics = ['acc', 'macs']

config.dynas.population = 50
config.dynas.num_evals = 500
config.dynas.batch_size = 64
config.dynas.results_csv_path = 'results.csv'
config.dynas.dataset_path = '/datasets/hat_dataset/data/binary/wmt16_en_de'
config.dynas.supernet_ckpt_path  ='/datasets/hat_dataset/HAT_wmt14ende_super_space0.pt'
agent = NAS(config)
results = agent.search()

Dependency Change?

fairseq
sacremoses
torchprofile

Signed-off-by: Maciej Szankin maciej.szankin@intel.com

…rce/frameworks.ai.lpot.intel-lpot into dynas/transformer

This reverts commit 79a4758.

Signed-off-by: Maciej Szankin <maciej.szankin@intel.com>

macsz · 2022-12-10T08:28:00Z

Failing on timeout now. Locally the same test passes. Will re-run to see if anything has changed...

Signed-off-by: Maciej Szankin <maciej.szankin@intel.com>

Signed-off-by: Xinyu Ye <xinyu.ye@intel.com> Conflicts: test/nas/test_nas.py

chensuyue · 2022-12-12T08:06:47Z

/Azurepipeline run

ftian1 · 2022-12-12T08:06:55Z

@macsz we have root caused the timeout issue and fixed it. we also made rebase operation. now we are waiting for test report. if it passes, we will merge it today.

azure-pipelines · 2022-12-12T08:07:09Z

Azure Pipelines successfully started running 4 pipeline(s).

macsz · 2022-12-12T08:43:58Z

@macsz we have root caused the timeout issue and fixed it. we also made rebase operation. now we are waiting for test report. if it passes, we will merge it today.

Thanks! Appreciate the help to rebase to speed things up! I will be monitoring the PR as well.

chensuyue · 2022-12-12T09:06:09Z

Failed with UT coverage regression. I think we can merge at first for code freeze.

Signed-off-by: Maciej Szankin <maciej.szankin@intel.com>

macsz · 2022-12-12T09:18:47Z

Merged master & Removed cleanup in UTs that I added before you fixed the model cache problem. This triggered the CI to re-run the tests.

So we are ok with this PR once it passes? To avoid additional delays I won't be touching anything till you say so.

By the way - Where in the CI can I see visual test coverage, like the one you posted @chensuyue ? Tried decoding raw output but it was a little bit too much ...

chensuyue · 2022-12-12T09:26:13Z

Go to the artifacts.
Download the coverage report package.
Open the coverage compare html.

macsz · 2022-12-12T10:18:18Z

Should we merge now for code freeze, as @chensuyue suggested? I will work on adding unit tests as a follow up to this PR.

…arch Toolkit (#197) Signed-off-by: Maciej Szankin <maciej.szankin@intel.com> Co-authored-by: Nittur Sridhar, Sharath <sharath.nittur.sridhar@intel.com> Co-authored-by: Xinyu Ye <xinyu.ye@intel.com> Signed-off-by: zehao-intel <zehao.huang@intel.com>

* add primitive_cache & weight_sharing & dispatcher_tuning ut * fix pybandit * add weight sharing with dispatcher ut * change model load method * change model load method * add the ut of dispatcher tuning perf * remove unuse files * change moudle,dataset address * remove unuse file * fix the ut and add glog level =2 * add the time module * change format * remove the ir * modify * modify- * review modify * test modify * modify for unuse Co-authored-by: Wang, Wenqi2 <wenqi2.wang@intel.com> Co-authored-by: sys-lpot-val <sys_lpot_val@intel.com> Co-authored-by: Bo Dong <bo1.dong@intel.com>

macsz and others added 30 commits November 4, 2022 15:44

Update normalization in predictors

f419a7f

Add notebook's checkpoint to gitignore

363bdff

Add TODO note.

9d7f1f4

Add Runners dictionary

1dceb8a

add transformer example

2c17af6

Run autopep8

7b20527

Merge branch 'master' into dynas/transformer

dec5dfa

Cleanup imports

4b05fd2

Cleanup imports

1e19ff8

Replace print with logger

8d7f56b

Replace os.system('rm ...') with os.remove

55e7725

Remove unused logs

16cccb4

Make variable names more descriptive

cb121e2

Remove duplicate definitions

1e9b6b8

Remove unused code

322c277

Add TODOs

abddec7

Update progress tracking

051ae26

Move supernetwork dir under dynast dir

d70c55e

Update logging and TODos

4a60f47

Log warning when measuring MACs for transformer LT (not supported)

d8866a9

Update LINAS loop

41bebb4

Fix error when CSV file does not exist

e773a55

Change column names in CSV file for Transformer LT

aba0495

add macs computation for transformers

ead6999

Add batch size to compute latency for TransformerLT

e222eb7

Remove old cuda calls

f5b7922

Remove comment

4ef8f05

replace bleu with sacrebleu

f022e2a

Merge branch 'dynas/transformer' of https://github.com/intel-innersou…

c624eda

…rce/frameworks.ai.lpot.intel-lpot into dynas/transformer

Merge branch 'master' into dynas/transformer

c2fbc12

macsz added 4 commits December 9, 2022 15:46

Revert "Remove reference to DyNAS from UTs"

07943de

This reverts commit 79a4758.

LazyLoad transformer_interface

e3e67e2

Signed-off-by: Maciej Szankin <maciej.szankin@intel.com>

Removed commented code

eb5b063

Signed-off-by: Maciej Szankin <maciej.szankin@intel.com>

Shorten import line

8f1e03e

Signed-off-by: Maciej Szankin <maciej.szankin@intel.com>

macsz added 6 commits December 10, 2022 12:48

Merge branch 'master' into dynas/transformer

0a84cf7

Remove cached torch files when running tests

6951d7a

Signed-off-by: Maciej Szankin <maciej.szankin@intel.com>

Split NAS tests

e9f872a

Signed-off-by: Maciej Szankin <maciej.szankin@intel.com>

Catch network timeout in tests

ce29fe4

Signed-off-by: Maciej Szankin <maciej.szankin@intel.com>

Remove try catch from tests

36365ed

Signed-off-by: Maciej Szankin <maciej.szankin@intel.com>

Move .torch cleanup to teardown

858d7b0

Signed-off-by: Maciej Szankin <maciej.szankin@intel.com>

macsz force-pushed the dynas/transformer branch from e08f567 to 858d7b0 Compare December 11, 2022 00:39

XinyuYe-Intel approved these changes Dec 12, 2022

View reviewed changes

XinyuYe-Intel force-pushed the dynas/transformer branch from 0232314 to d1a11ce Compare December 12, 2022 06:36

Merge branch 'master' into dynas/transformer

758701c

Signed-off-by: Xinyu Ye <xinyu.ye@intel.com> Conflicts: test/nas/test_nas.py

XinyuYe-Intel force-pushed the dynas/transformer branch from d1a11ce to 758701c Compare December 12, 2022 06:42

macsz added 2 commits December 12, 2022 01:08

Merge branch 'master' into dynas/transformer

706eb4f

Skip cleanup

7dd6615

Signed-off-by: Maciej Szankin <maciej.szankin@intel.com>

chensuyue merged commit 40ab5a3 into master Dec 12, 2022

chensuyue deleted the dynas/transformer branch December 12, 2022 11:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable Transformer LT search space for Dynamic Neural Architecture Search Toolkit #197

Enable Transformer LT search space for Dynamic Neural Architecture Search Toolkit #197

macsz commented Nov 30, 2022 •

edited

macsz commented Dec 10, 2022 •

edited

chensuyue commented Dec 12, 2022

ftian1 commented Dec 12, 2022

azure-pipelines bot commented Dec 12, 2022

macsz commented Dec 12, 2022

chensuyue commented Dec 12, 2022

macsz commented Dec 12, 2022

chensuyue commented Dec 12, 2022

macsz commented Dec 12, 2022

Enable Transformer LT search space for Dynamic Neural Architecture Search Toolkit #197

Enable Transformer LT search space for Dynamic Neural Architecture Search Toolkit #197

Conversation

macsz commented Nov 30, 2022 • edited

Type of Change

Description

How has this PR been tested?

Dependency Change?

macsz commented Dec 10, 2022 • edited

chensuyue commented Dec 12, 2022

ftian1 commented Dec 12, 2022

azure-pipelines bot commented Dec 12, 2022

macsz commented Dec 12, 2022

chensuyue commented Dec 12, 2022

macsz commented Dec 12, 2022

chensuyue commented Dec 12, 2022

macsz commented Dec 12, 2022

macsz commented Nov 30, 2022 •

edited

macsz commented Dec 10, 2022 •

edited