Merge from awslabs/sockeye #2

lorisbaz · 2018-06-29T13:59:44Z

No description provided.

)

* Updating default parameters with values from our arXiv paper.

Adding an end of sentence symbol to the source side.

#398) * Removed separate beam prune system test and added pruning to another system test. Prevents timeouts on CRON job for Travis. * fix

…ture. (#399) * Argument specifications for each CLI are now stored in a static structure.

Some notes on tests and PyPI.

* Partially reverted 5a3bf5f * Added support for config files Specify a config file with --config. Command line parameters have precedence over the values read from the config file, as expected. The config file is a json serialization of the namespace returned by argparse, casted to a dictionary. The file args.json produced by the training can be used as config file. The config file does not need to be complete. Missing parameters will be read from the command line or will take default values. Additionally the functionality introduced in 5a3bf5f has been reimplemented, accessing the argument_definitions member of ConfigArgumentParser. * Added unit tests for config files * Addressed (most of) github comments * switch from json to yaml * changelog, minor version * Changed config file format to YAML * Typo * mypi * Added test overwriting config file argument with command line

* Fix logic with training resumption * fix

* Merge Sockeye Autopilot. * Typing cleanup. * Update version, changelog. * Update description of Autopilot in changelog.

* Added hard lexical constraints (Post & Vilar, NAACL 2018) This commit adds hard lexical constraints as described in Post & Vilar, Fast Lexically Constrained Decoding with Dynamic Beam Allocation for Neural Machine Translation (NAACL 2018). Hard lexical constraints are words or phrases that must appear in the translation output. To use this feature, for a particular input sentences, create a (1-line) JSON object of the following form: ``` { "text": "Einer soll ein hoch@@ rangi@@ ges Mitglied aus Berlin gewesen sein .", "constraints": ["is said to", "powerful"] } ``` You then need to pass the JSON input flag (`--json-input`) to `sockeye.translate`.

…gged the memory on the primary GPU device over time when many checkpoints were done. Gradient histograms are now logged to Tensorboard separated by device. (#407)

* added wmt18 data sets

Included tutorial about adapting NMT models, including LHUC.

This PR updates Sockeye to MXNet 1.2 which was released May 21st 2018. Core change to Sockeye is the use of the new LayerNormalization operator which reduces GPU memory usage. It uses the same set of parameters existing models are compatible, but running sockeye now requires mxnet 1.2.

Introducing the image captioning module. Type of models supported: ConvNet encoder and Sockeye NMT decoders Features: - Image encoder that extracts features using preetrained nets: `image_captioning.encoder` - Feature extraction script to dump features to disk `image_captioning.extract_features` - Pass-through embedding, since we do not need it for images - Image-text iterator that loads features on the fly during training with the option of loading all to memory: `image_captioning.data_io` - Training and inference pipelines for image captioning: `image_captioning.train`, `image_captioning.inference` and `image_captioning.captioner` - README with instructions on how to use the image captioning module: `image_captioning/README.md` - Visualization script that loads images and captions (prediction+ground truth) and display them: `image_captioning.visualize`

Also some cleanup of overly long lines, imports etc.

* Added LHUC for transformer model Note: Changed apply() function of LHUC to __call__ * Sized down transformer-lhuc integration test

…ron. (#411)

* Downsized integration tests for faster test runs * downsized integration test data set sizes

…nd some reordering (#417)

…oder (#419)

* Add a fork of VizTools that generates D3-based graphs of beam searches output by sockeye's `beam_store` output handler. * Code added under `contrib/`

This cleans up the pruning logic a little bit and continues work started in #422. Changes include: 1) the modifications on various data structures as a result of pruning are now more local to where it matters. 2) vectorized the pruning function and moved it to `utils.py` (similar to `topk()`). Also using a partial now. Vectorization may help us in moving operations to HybridBlocks in the future.

* bugfix: added cast; plus fixed stupid mistakes in test cases that let it sneak in

This refactors beam search to group fixed-size operations in beam search into cached ops through Gluon HybridBlocks. My testing showed ~3% speed improvement. Not much, but consistent. Another change that is included here is to not use columns from `sequences` to pass into the decoder step module, but use `best_word_indices` from the previous iteration. NDArray indexing seems expensive and ideally we should aim for avoiding all indexing ops in an iteration.

* Beam search concat and mypy fix. * fix

* Fix: Word based batching memory with longer source sentences. * comment clarification. * typos

* added wmt18 en<>de to autopilot

* Fix for transformer-with-conv-embed encoder. * changelog * version fix

…ib. (#440)

Surprisingly, allocating memory for sequences/attentions once in beam search and writing to index t seems to be slower than concatenating with every step. Likewise, getting rid of the pad_dist write to C.PAD_ID index is faster. This change gives about +1.5 sent/sec on a laptop/CPU and +0.1–0.5 on P2/P3s with latest MXNet.

* updated tutorial parameters to use RNN * typo * updated layers

* Update requirements to mxnet-*mkl. * Update documentation and setup for requirements dir. * Use isclose to slightly relax equality check for coverage test. (Test was failing with MKL version of MXNet) * Add requirements to MANIFEST.in. * Update changelog.

Occasionally this symlink exists and then the whole training procedures dies. My guess is that stray NFS files prevent the deletion of the temporary directory from the previous round. The other files are safely overwritten because they use direct writes instead of a symlink. This should solve that problem.

ROUGE is now available as the stopping criterion for tasks such as summarization.

* Fix source factor splitting for single factor. - Use version of ndarray split that always returns a list for uniform handling. * Unit test for factor splitting. * Update version, changelog. * Keep original split call for sym_gen.

The new pyyaml version is introducing some issues, which we should fix. For now let's depend on the old version to unblock Travis.

…#361) * add chapter multi-instance translate for README.md and script mlt-trans to run multi-instance * add chapter multi-instance translate for README.md * change method to get physical cores number and add hyperlink for C4 and C5 in README.md * add option for running as benchmark or not * rewrite mlt-trans script using python and move it to tutorials * change code according to the commnets * add chapter multi-instance translate for README.md and script mlt-trans to run multi-instance * add chapter multi-instance translate for README.md * change method to get physical cores number and add hyperlink for C4 and C5 in README.md * move mlt_cpu_trans_benchmark to tutorials and rename it process_per_core_translation * fix typo * add option for running as benchmark or not * rewrite mlt-trans script using python and move it to tutorials * change code according to the commnets * move mlt_cpu_trans_benchmark to tutorials and rename it process_per_core_translation * add python script in tutortial * add license (Apache) and authors

logogin and others added 30 commits May 12, 2018 15:01

experimental option to override transformer dtype during inference (#374

763efb3

)

Updating default parameters with values from our arXiv paper. (#381)

ec143fb

* Updating default parameters with values from our arXiv paper.

Adding an end of sentence symbol to the source side. (#392)

bb0b782

Adding an end of sentence symbol to the source side.

fix system test Copy:lstm:pruning after default value changes. (#393)

b964fe6

Fix sphinx documentation build (#394)

0c500a4

Fix Sphinx 1.7.4 build (as run by readthedocs.io). (#396)

ad016dd

Removed separate beam prune system test and added pruning to another … (

de627d6

#398) * Removed separate beam prune system test and added pruning to another system test. Prevents timeouts on CRON job for Travis. * fix

Argument specifications for each CLI are now stored in a static struc…

5a3bf5f

…ture. (#399) * Argument specifications for each CLI are now stored in a static structure.

Update development.md (#401)

80ac270

Some notes on tests and PyPI.

Fix logic with training resumption (#404)

3f8cb0b

* Fix logic with training resumption * fix

Merge in Sockeye Autopilot (#405)

fea3d59

* Merge Sockeye Autopilot. * Typing cleanup. * Update version, changelog. * Update description of Autopilot in changelog.

Removed summation of gradient arrays when logging gradients. This clo…

cfde4e4

…gged the memory on the primary GPU device over time when many checkpoints were done. Gradient histograms are now logged to Tensorboard separated by device. (#407)

Sacrebleu: added wmt18 data sets (#406)

5861107

* added wmt18 data sets

fix changelog (#408)

ecf03d8

Added tutorial about domain adaptation, including LHUC (#409)

af59303

Included tutorial about adapting NMT models, including LHUC.

Removed some unused code in inference. (#412)

776539d

Also some cleanup of overly long lines, imports etc.

Added LHUC for transformer model (#414)

c0f9a68

* Added LHUC for transformer model Note: Changed apply() function of LHUC to __call__ * Sized down transformer-lhuc integration test

Trim down runtime and disk usage for autopilot tests, add to Travis c…

ff8e4c5

…ron. (#411)

Further trim down Autopilot tests for Travis cron. (#416)

d2f0682

Downsized integration tests for faster test runs (#415)

530e988

* Downsized integration tests for faster test runs * downsized integration test data set sizes

Small cleanup in inference regarding data types, copy vs assignment a…

17bac9d

…nd some reordering (#417)

Remove unused argument from Transformer pre/post process blocks (#418)

2e5d8c0

Allow specifying different transformer parameters for encoder and dec…

163dec8

…oder (#419)

Add beam visualization tool to contrib/ (#328)

bc6571e

* Add a fork of VizTools that generates D3-based graphs of beam searches output by sockeye's `beam_store` output handler. * Code added under `contrib/`

Avoid function with side effect in inference (#422)

5db05f4

mjpost and others added 20 commits June 1, 2018 17:38

bugfix: added cast; plus fixed stupid mistakes in test cases (#426)

30a58fe

* bugfix: added cast; plus fixed stupid mistakes in test cases that let it sneak in

Update README.md

e50a8f7

Inference mypy fix (#432)

cd1161a

* Beam search concat and mypy fix. * fix

Fix: Word based batching memory with longer source sentences. (#430)

83468d2

* Fix: Word based batching memory with longer source sentences. * comment clarification. * typos

added wmt18 en<>de (#431)

7a1f3f7

* added wmt18 en<>de to autopilot

Fix for transformer-with-conv-embed encoder. (#434)

bea8e6a

* Fix for transformer-with-conv-embed encoder. * changelog * version fix

Top-K lexicon update: defaults, files for running fast_align in contr…

e6a28fb

…ib. (#440)

Tiny update: link to fast_align Apache license.

12abdef

updated tutorial parameters to use RNN (#448)

49c116b

* updated tutorial parameters to use RNN * typo * updated layers

Set theme jekyll-theme-slate

eb155a3

Added new docs yml file to Manifest (#452)

a0eebaa

Added ROUGE score evaluation

a4f8698

ROUGE is now available as the stopping criterion for tasks such as summarization.

Fixing the pyyaml version. (#458)

241a6c8

The new pyyaml version is introducing some issues, which we should fix. For now let's depend on the old version to unblock Travis.

Unrestricted vocabulary by default. (#456)

d48cc5c

lorisbaz merged commit 11d6066 into lorisbaz:master Jun 29, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge from awslabs/sockeye #2

Merge from awslabs/sockeye #2

lorisbaz commented Jun 29, 2018

Merge from awslabs/sockeye #2

Merge from awslabs/sockeye #2

Conversation

lorisbaz commented Jun 29, 2018