second tutorial (configuration.md) #212

joelgrus · 2017-08-28T23:15:50Z

in addition,

renames run to run.py (I left run there so as not to break anyone's workflow, but it logs a "deprecated" warning, and I plan to get rid of it eventually)
changes "getting started" tutorial to use installation-independent python -m allennlp.run
exports main as allennlp.commands.main() so that we can tell people to make their own run scripts by importing and calling it

schmmd · 2017-08-30T15:09:45Z

tutorials/getting_started/configuration.md

+let's take a deeper look at our experiment configuration file,
+[tutorials/getting_started/simple_tagger.json](https://github.com/allenai/allennlp/blob/master/tutorials/getting_started/simple_tagger.json).
+
+The configuration is a JSON (or [HOCON](https://github.com/typesafehub/config/blob/master/HOCON.md)) object


I find this confusing. Is it JSON or HOCON? Perhaps you mean "The configuration is HOCON (but don't worry if you are not familiar with HOCON--JSON is a valid subset of HOCON)"

schmmd · 2017-08-30T15:17:05Z

tutorials/getting_started/configuration.md

+that we need to override the list of non-padded namespaces:
+
+```js
+  "non_padded_namespaces": [],


Are we doing this to get padding, or simply @@UNKNOWN@@ tokens for certain postags?

schmmd · 2017-08-30T15:19:11Z

tutorials/getting_started/configuration.md

+Let's first look at the text field embedder configuration:
+
+```js
+    "text_field_embedder": {


The indentation here is quite weird.

Would it help to have comments on these lines?

schmmd · 2017-08-30T15:21:02Z

tutorials/getting_started/configuration.md

+The `"tokens"` namespace (which consists of integer encodings of the lowercased words in the input)
+gets fed into an
+[`Embedding`](http://docs.allennlp.org/en/latest/api/allennlp.modules.token_embedders.html?highlight=embedding#allennlp.modules.token_embedders.embedding.Embedding)
+module that embeds the vocabulary words in a 50-dimensional space.


append "(specified by embedding-dim)"?

schmmd · 2017-08-30T15:21:50Z

tutorials/getting_started/configuration.md

+
+```js
+    "stacked_encoder": {
+            "type": "lstm",


I would only indent by 4, rather than by 8. You could also remove the initial whitespace.

schmmd · 2017-08-30T15:26:32Z

tutorials/getting_started/configuration.md

+concatenated with a 50-dimensional vector for `"token_characters`";
+that is, a 100-dimensional vector.
+
+### The Seq2SeqEncoder


Why is this a Seq2SeqEncoder? The JSON below has nothing about Seq2Seq but rather specifies stacked_encoder. I'm guessing that stacked_encoder = Seq2SeqEncoder but it's an implciit connection.

schmmd · 2017-08-30T15:27:59Z

tutorials/getting_started/configuration.md

+
+Finally, we'll run the training for 40 epochs;
+we'll stop prematurely if we get no improvement for 10 epochs;
+and we'll train on the CPU.


You might add instruction (or point somewhere) for how to specify a GPU, as this field is a bit opaque.

schmmd · 2017-08-30T15:28:15Z

tutorials/getting_started/configuration.md

+
+## What's Next
+
+TODO(joelgrus): next part of the tutorial


schmmd

Love it!

matt-gardner · 2017-08-30T16:39:27Z

tutorials/getting_started/configuration.md

+Each encoding has a "namespace", in this case `"tokens"` and `"token_characters"`.
+The `SequenceLabelField` also has a namespace, `"labels"`.
+
+## non-padded namespaces


I was hoping this would be a topic almost all users could ignore, that just shows up in some deep dive. It's unfortunate that we need to use it here. Can you construct the validation dataset such that it's not necessary in this tutorial?

yeah, that's a good idea, I'll just filter out the lines with the stray tags

ok, I fixed the datasets and removed all references to non-padded namespaces from the tutorial and config file

* Set default input limits for forms (#209) * Use rudder for deploy related functionality. (#211)

* Set default input limits for forms (#209) * Limit the max input of text inputs and areas. * maxlength -> maxLength * Use rudder for deploy related functionality. (#211) * Use rudder for deploy related functionality. This ensures that the application is deployed to the latest n' greatest skiff cluster. * Use the right image path. * Release pending changes. (#212) (#213) * Set default input limits for forms (#209) * Use rudder for deploy related functionality. (#211) * Bert srl (#214) * update srl model * update description * bump to commit which includes srl model * set timeout to 15 minutes * update to latest image, add numbers (#216)

joelgrus added 11 commits August 28, 2017 11:26

fixing cli

a6284e4

second part of tutorial

0faecb9

spell my name correctly

a43d2ab

remove duplicate main file

d4779eb

fix docs

f2e4229

fix up bugs

e433bd0

fix docs

c0a45bd

fix merge conflict + mypy issues

dc9574a

move main function to __init__.py

b60c59d

polish tutorials

92e52b7

Merge branch 'master' into click

b85f373

joelgrus changed the title ~~[WIP] second tutorial~~ second tutorial (configuration.md) Aug 29, 2017

joelgrus requested review from matt-gardner, schmmd and DeNeutoy and removed request for DeNeutoy August 29, 2017 18:35

schmmd reviewed Aug 30, 2017

View reviewed changes

tutorials/getting_started/configuration.md Outdated

## What's Next

TODO(joelgrus): next part of the tutorial

Copy link

Member

schmmd Aug 30, 2017

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

TODO

schmmd approved these changes Aug 30, 2017

View reviewed changes

matt-gardner reviewed Aug 30, 2017

View reviewed changes

joelgrus added 4 commits August 30, 2017 10:11

remove non_padded_namespaces from tutorial

9eb02ce

Merge branch 'click' of https://github.com/allenai/allennlp into click

bfa1b80

Merge branch 'master' into click

097b904

fix merge conflicts

c58feaf

joelgrus added 3 commits August 30, 2017 15:03

fix broken docs

31847de

address PR feedback

13a7240

add GPU guidance

233244f

joelgrus merged commit 61b6813 into master Aug 30, 2017

joelgrus deleted the click branch August 30, 2017 22:33

schmmd pushed a commit that referenced this pull request Feb 26, 2020

Release pending changes. (#212)

c928f47

* Set default input limits for forms (#209) * Use rudder for deploy related functionality. (#211)

schmmd pushed a commit that referenced this pull request Feb 26, 2020

Release pending changes. (#212) (#213)

2668c15

* Set default input limits for forms (#209) * Use rudder for deploy related functionality. (#211)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

second tutorial (configuration.md) #212

second tutorial (configuration.md) #212

joelgrus commented Aug 28, 2017 •

edited

Loading

schmmd Aug 30, 2017

schmmd Aug 30, 2017

schmmd Aug 30, 2017

schmmd Aug 30, 2017

schmmd Aug 30, 2017

schmmd Aug 30, 2017

schmmd Aug 30, 2017

schmmd Aug 30, 2017

schmmd Aug 30, 2017

schmmd left a comment

matt-gardner Aug 30, 2017

joelgrus Aug 30, 2017

joelgrus Aug 30, 2017

matt-gardner Aug 30, 2017

second tutorial (configuration.md) #212

second tutorial (configuration.md) #212

Conversation

joelgrus commented Aug 28, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

schmmd left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joelgrus commented Aug 28, 2017 •

edited

Loading