Add `allennlp` example. #949

himkt · 2020-02-21T06:29:25Z

This PR adds an example using allennlp for defining and training a model.
(edited: this example is based on allentune's official example.)

codecov-io · 2020-02-21T06:49:16Z

Codecov Report

Merging #949 into master will decrease coverage by 0.03%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master     #949      +/-   ##
==========================================
- Coverage   90.22%   90.19%   -0.04%     
==========================================
  Files         112      114       +2     
  Lines        9293     9506     +213     
==========================================
+ Hits         8385     8574     +189     
- Misses        908      932      +24

Impacted Files	Coverage Δ
setup.py	`0% <ø> (ø)`	⬆️
optuna/integration/lightgbm_tuner/__init__.py	`52.38% <0%> (-7.62%)`	⬇️
optuna/trial.py	`87.05% <0%> (-0.72%)`	⬇️
optuna/storages/rdb/storage.py	`96.52% <0%> (-0.36%)`	⬇️
...ration_tests/lightgbm_tuner_tests/test_optimize.py	`98.09% <0%> (-0.33%)`	⬇️
optuna/exceptions.py	`100% <0%> (ø)`	⬆️
optuna/integration/cma.py	`93.95% <0%> (ø)`	⬆️
optuna/study.py	`92.72% <0%> (ø)`	⬆️
optuna/integration/lightgbm.py	`89.79% <0%> (ø)`	⬆️
... and 10 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f2a9ce0...e534b92. Read the comment docs.

himkt · 2020-02-22T08:34:17Z

I made some modifications and this PR is ready for review, I think.

Although allennlp users may define and train a model by allennlp-cli (example),
this PR doesn't use allennlp-cli since I didn't get a good idea to smoothly integrate with optuna.
(related: allentune)

…nlp-example

toshihikoyanase

Thank you for your great PR!

I have some small comments. In addition, I found this example required hours of computational time without GPUs. Is it possible to reduce the computational time to about ten minutes because we execute all examples in daily CI using VMs without GPUs.

toshihikoyanase · 2020-02-25T12:43:01Z

examples/allennlp_simple.py

+import allennlp.modules
+import optuna
+import torch
+import uuid


Please separate standard library imports and third library imports?
(c.f., #514)

Sorry, fixed in e72dc3d.

examples/allennlp_simple.py

toshihikoyanase · 2020-02-25T12:53:05Z

examples/allennlp_simple.py

+
+
+if __name__ == '__main__':
+    optuna.logging.set_verbosity(optuna.logging.WARNING)


Could you tell me why you suppress Optuna's log messages?

I didn't have some specific intention. 🙇
(I made this PR based on example/chainer_simple.py, which have this setting)
I removed it in 037d12a.

Co-Authored-By: Toshihiko Yanase <toshihiko.yanase@gmail.com>

…nlp-example

himkt · 2020-02-26T14:05:47Z

0db55bc 095358d

I update experimental configurations to save computational time.

$ time poetry run python examples/allennlp_simple.py
...
poetry run python examples/allennlp_simple.py  619.55s user 13.39s system 183% cpu 5:44.74 total

toshihikoyanase

Thank you for reflecting my comments. The current version is much faster than the previous one.
I have some comments, so please check them.

examples/allennlp_simple.py

toshihikoyanase · 2020-03-02T06:29:44Z

examples/allennlp_simple.py

+
+# Run tuning with small portion of data
+# to reduce computational time.
+# https://github.com/optuna/optuna/pull/949#pullrequestreview-364110499


IMO, we can remove this link according to the past PRs. Maybe it would be helpful if we share code conventions like this in a design document.

examples/allennlp_simple.py

toshihikoyanase · 2020-03-02T09:06:37Z

examples/allennlp_simple.py

+        patience=3,
+        num_epochs=6,
+        cuda_device=DEVICE,
+        serialization_dir=f'/tmp/xx/{uuid.uuid1()}',


It would be difficult for users to find the model corresponding to the specific trial. This is partly because the directory names are generated based on uuid and partly because the directory is not displayed in log messages. What do you think if we remove it for simplicity? If you remove it, please delete import uuid too.

Suggested change

serialization_dir=f'/tmp/xx/{uuid.uuid1()}',

Based on pytorch_lightning_simple.py, I revise my PR to use trial.number for creating a model serialization directory. e534b92

toshihikoyanase · 2020-03-02T09:08:00Z

setup.py

@@ -73,6 +73,7 @@ def get_extras_require():
            'sphinx_rtd_theme',
        ],
        'example': [
+            'allennlp',


allennlp does not support Python 3.5. (c.f., https://pypi.org/project/allennlp/). Please exclude it from the installation targets if the Python version is 3.5.
In addition, please exclude examples/allennlp from the CI targets if the job is examples-python35.

Co-Authored-By: Toshihiko Yanase <toshihiko.yanase@gmail.com>

…nlp-example

toshihikoyanase

LGTM. Thank you for your contribution.

[Note]
allennlp will change the arguments of trainer. For example, train_dataset and validation_dataset will be replaced with dataloader.

hvy

Thanks for the PR! I left some comments if you could take a look.

examples/allennlp_simple.py

hvy · 2020-03-10T09:00:56Z

examples/allennlp_simple.py

+MODEL_DIR = os.path.join(DIR, 'result')
+
+GLOVE_FILE_PATH = (
+    'https://s3-us-west-2.amazonaws.com/allennlp/datasets/glove/glove.6B.50d.txt.gz'


Sorry I'm not familiar with AllenNLP but is it the case that this AWS URL (and the ones below) is officially documented somewhere?

This example is based on the allentune's official example and this AWS URL is used here.

himkt · 2020-03-12T12:49:54Z

@hvy Thank you for taking the time, I revised my PR!

crcrpar · 2020-03-18T02:11:25Z

examples/allennlp_simple.py

+    model = create_model(vocab, trial)
+
+    if DEVICE > -1:
+        model.cuda(DEVICE)


Nit: model.to(torch.device(’cuda:{}’.format(DEVICE))).

hvy

Sorry for the delayed review, changes LGTM.

hvy · 2020-03-18T04:33:53Z

Let me modify the title of this PR to match the format of our release notes.

himkt added 2 commits February 21, 2020 01:04

Add simple example of allennlp

435aa2d

Implement simple example to use allennlp

2982f8d

himkt added 4 commits February 21, 2020 09:54

Add search space used in allentune example

afeb9d3

Add create_model to separate defining model from objective

d08e2a6

Use CPU by default

85f8586

Remove test_dataset (it is not used)

cd7d3e8

himkt marked this pull request as ready for review February 22, 2020 08:23

himkt changed the title ~~[WIP] Add allennlp example~~ Add allennlp example Feb 22, 2020

himkt added 2 commits February 23, 2020 23:43

Update README

6fbea99

Merge branch 'allennlp-example' of github.com:himkt/optuna into allen…

764bcdb

…nlp-example

toshihikoyanase requested changes Feb 25, 2020

View reviewed changes

himkt and others added 9 commits February 26, 2020 21:07

Update order of import

e72dc3d

Update examples/allennlp_simple.py

782ac6a

Co-Authored-By: Toshihiko Yanase <toshihiko.yanase@gmail.com>

Update examples/allennlp_simple.py

0f9b3f4

Co-Authored-By: Toshihiko Yanase <toshihiko.yanase@gmail.com>

Update examples/allennlp_simple.py

f97cfe4

Co-Authored-By: Toshihiko Yanase <toshihiko.yanase@gmail.com>

Update examples/allennlp_simple.py

07a2437

Co-Authored-By: Toshihiko Yanase <toshihiko.yanase@gmail.com>

Merge branch 'allennlp-example' of github.com:himkt/optuna into allen…

fe08bad

…nlp-example

Reduce data size

037d12a

Adjust experimental settings for CI

0db55bc

Tune num_epoch for computational time

095358d

Update setup.py

ec1e8ea

toshihikoyanase requested changes Mar 2, 2020

View reviewed changes

himkt and others added 5 commits March 2, 2020 23:33

Apply suggestions from code review

fadc5cd

Co-Authored-By: Toshihiko Yanase <toshihiko.yanase@gmail.com>

Ignore allennlp if Python version is 3.5 or 3.8

c94080f

Apply suggestions from code review

86e6891

Co-Authored-By: Toshihiko Yanase <toshihiko.yanase@gmail.com>

Apply suggestions from code review

0113b9c

Co-Authored-By: Toshihiko Yanase <toshihiko.yanase@gmail.com>

Merge branch 'allennlp-example' of github.com:himkt/optuna into allen…

0819437

…nlp-example

himkt added 2 commits March 3, 2020 21:54

Remove comment

d11af78

Use trial.number to create unique serialization_dir

e534b92

toshihikoyanase approved these changes Mar 4, 2020

View reviewed changes

toshihikoyanase self-assigned this Mar 10, 2020

hvy added the example label Mar 10, 2020

hvy requested changes Mar 10, 2020

View reviewed changes

Move prepare_data inside objective for optuna cli

8f62e22

himkt added 3 commits March 12, 2020 22:31

Add description for allennlp example

4277a9a

Merge branch 'master' into allennlp-example

ef99a13

Apply black

108d746

crcrpar reviewed Mar 18, 2020

View reviewed changes

Apply feedback

1e59f0e

hvy approved these changes Mar 18, 2020

View reviewed changes

hvy merged commit 9f73f8c into optuna:master Mar 18, 2020

hvy added this to the v1.3.0 milestone Mar 18, 2020

hvy changed the title ~~Add allennlp example~~ Add allennlp example. Mar 18, 2020

himkt deleted the allennlp-example branch March 18, 2020 04:41

toshihikoyanase mentioned this pull request Mar 18, 2020

Add a link to the AllenNLP example in README.md. #1040

Merged

himkt mentioned this pull request Apr 2, 2020

[Proposal] Add integration with allennlp training with jsonnet config files. #1078

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `allennlp` example. #949

Add `allennlp` example. #949

himkt commented Feb 21, 2020 •

edited

codecov-io commented Feb 21, 2020 •

edited

himkt commented Feb 22, 2020

toshihikoyanase left a comment

toshihikoyanase Feb 25, 2020

himkt Feb 26, 2020

toshihikoyanase Feb 25, 2020

himkt Feb 26, 2020

himkt commented Feb 26, 2020

toshihikoyanase left a comment

toshihikoyanase Mar 2, 2020

toshihikoyanase Mar 2, 2020

himkt Mar 3, 2020 •

edited

toshihikoyanase Mar 2, 2020

toshihikoyanase left a comment

hvy left a comment

hvy Mar 10, 2020

himkt Mar 12, 2020

himkt commented Mar 12, 2020

crcrpar Mar 18, 2020

hvy left a comment •

edited

hvy commented Mar 18, 2020



		if __name__ == '__main__':
		optuna.logging.set_verbosity(optuna.logging.WARNING)

Add allennlp example. #949

Add allennlp example. #949

Conversation

himkt commented Feb 21, 2020 • edited

codecov-io commented Feb 21, 2020 • edited

Codecov Report

himkt commented Feb 22, 2020

toshihikoyanase left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

himkt commented Feb 26, 2020

toshihikoyanase left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

himkt Mar 3, 2020 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

toshihikoyanase left a comment

Choose a reason for hiding this comment

hvy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

himkt commented Mar 12, 2020

Choose a reason for hiding this comment

hvy left a comment • edited

Choose a reason for hiding this comment

hvy commented Mar 18, 2020

Add `allennlp` example. #949

Add `allennlp` example. #949

himkt commented Feb 21, 2020 •

edited

codecov-io commented Feb 21, 2020 •

edited

himkt Mar 3, 2020 •

edited

hvy left a comment •

edited