Language model classes for making predictions (both masked LM and next token LM) #3201

matt-gardner · 2019-08-26T19:32:15Z

These models are only tested for predictions, not for training. The masked LM might be useful for training, but also might just be too slow. The next token LM definitely is not for training. These are noted in the class docstrings.

DeNeutoy

LGTM, although having this in the main library is still a very questionable decision in my opinion. We're just creating work for ourselves when there is a clear route to having this in the demo with a separate git repo/package.

DeNeutoy · 2019-08-28T18:21:01Z

allennlp/data/dataset_readers/masked_language_modeling.py

@@ -55,7 +55,7 @@ def _read(self, file_path: str):
        import sys
        # You can call pytest with either `pytest` or `py.test`.
        if 'test' not in sys.argv[0]:
-            raise RuntimeError('_read is only implemented for unit tests at the moment')
+            logger.error('_read is only implemented for unit tests at the moment')


How come these can't be RuntimeErrors?

It worked locally, but the check above didn't work in team city, for reasons I'm not really sure of. I couldn't easily find what sys.argv would be in team city, and I didn't want to deal with a 20 minute debug cycle to figure it out.

DeNeutoy · 2019-08-28T18:23:08Z

allennlp/models/masked_language_model.py

+            contextual_embeddings = embeddings
+
+        batch_index = torch.arange(0, batch_size).long().unsqueeze(1)
+        mask_embeddings = contextual_embeddings[batch_index, mask_positions]


Could do with a comment that the mask_positions are the ones we actually want.

matt-gardner · 2019-08-28T18:38:52Z

It is precisely because of these discussions about what belongs in the library and what doesn't that I suggested splitting out the models entirely. As you know, I agree with this. But we're not set up for that yet, and actually setting things up so that we are, and still get testing and CI right and everything else, is going to take a while. We want to launch AllenNLP Interpret very soon (camera ready deadline is Friday), and so I want this merged now, before having to figure out how everything will work once we split out things into separate repos.

DeNeutoy · 2019-08-28T18:46:30Z

Thanks, "camera ready deadline is Friday" was the piece of the puzzle I was missing.

pidugusundeep · 2019-10-29T10:01:28Z

@DeNeutoy is there a tutorial explaining the usage of the same?

…t token LM) (allenai#3201) * Models, tests, and doc * test fixtures * models/__init__.py * Fix test * docstrings * pylint, other cleanup * mypy; more cleanup * fix imports * change runtime errors to logger.error * pylint * Add comment

matt-gardner mentioned this pull request Aug 26, 2019

Predictors for demo LMs, update for coref predictor #3202

Merged

matt-gardner added 7 commits August 27, 2019 13:26

Models, tests, and doc

96a24a8

test fixtures

6dcde9c

models/__init__.py

1771f92

Fix test

0a56709

docstrings

a75f6e7

pylint, other cleanup

77323bd

mypy; more cleanup

6f88e6a

matt-gardner force-pushed the new_language_models branch from 98ced63 to 6f88e6a Compare August 27, 2019 20:27

matt-gardner requested a review from DeNeutoy August 27, 2019 20:38

matt-gardner mentioned this pull request Aug 27, 2019

Targeted hotflip attacks and beam search for input reduction #3206

Merged

matt-gardner added 3 commits August 27, 2019 14:35

fix imports

11e038b

change runtime errors to logger.error

cea7ea0

pylint

810f9cf

DeNeutoy approved these changes Aug 28, 2019

View reviewed changes

Add comment

149e6fd

matt-gardner merged commit d78ac70 into allenai:master Aug 28, 2019

matt-gardner deleted the new_language_models branch August 28, 2019 18:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Language model classes for making predictions (both masked LM and next token LM) #3201

Language model classes for making predictions (both masked LM and next token LM) #3201

matt-gardner commented Aug 26, 2019 •

edited

Loading

DeNeutoy left a comment

DeNeutoy Aug 28, 2019

matt-gardner Aug 28, 2019

DeNeutoy Aug 28, 2019

matt-gardner Aug 28, 2019

matt-gardner commented Aug 28, 2019

DeNeutoy commented Aug 28, 2019

pidugusundeep commented Oct 29, 2019

Language model classes for making predictions (both masked LM and next token LM) #3201

Language model classes for making predictions (both masked LM and next token LM) #3201

Conversation

matt-gardner commented Aug 26, 2019 • edited Loading

DeNeutoy left a comment

Choose a reason for hiding this comment

DeNeutoy Aug 28, 2019

Choose a reason for hiding this comment

matt-gardner Aug 28, 2019

Choose a reason for hiding this comment

DeNeutoy Aug 28, 2019

Choose a reason for hiding this comment

matt-gardner Aug 28, 2019

Choose a reason for hiding this comment

matt-gardner commented Aug 28, 2019

DeNeutoy commented Aug 28, 2019

pidugusundeep commented Oct 29, 2019

matt-gardner commented Aug 26, 2019 •

edited

Loading