TypeError in forward() in language model #3324

geoffbacon · 2019-10-04T17:02:16Z

Describe the bug
I'm trying to train language models using the built-in LanguageModelingReader and LanguageModel. I get TypeError: forward() got an unexpected keyword argument 'input_tokens', even though the LanguageModelingReader yields Instances with 'input_tokens' Fields (see here).

The full stack trace is:

2019-10-04 09:24:00,362 - INFO - allennlp.training.trainer - Training
  0%|          | 0/34685 [00:00<?, ?it/s]Traceback (most recent call last):
  File "/Users/bacon/miniconda/bin/allennlp", line 10, in <module>
    sys.exit(run())
  File "/Users/bacon/miniconda/lib/python3.7/site-packages/allennlp/run.py", line 18, in run
    main(prog="allennlp")
  File "/Users/bacon/miniconda/lib/python3.7/site-packages/allennlp/commands/__init__.py", line 102, in main
    args.func(args)
  File "/Users/bacon/miniconda/lib/python3.7/site-packages/allennlp/commands/train.py", line 124, in train_model_from_args
    args.cache_prefix)
  File "/Users/bacon/miniconda/lib/python3.7/site-packages/allennlp/commands/train.py", line 168, in train_model_from_file
    cache_directory, cache_prefix)
  File "/Users/bacon/miniconda/lib/python3.7/site-packages/allennlp/commands/train.py", line 252, in train_model
    metrics = trainer.train()
  File "/Users/bacon/miniconda/lib/python3.7/site-packages/allennlp/training/trainer.py", line 478, in train
    train_metrics = self._train_epoch(epoch)
  File "/Users/bacon/miniconda/lib/python3.7/site-packages/allennlp/training/trainer.py", line 320, in _train_epoch
    loss = self.batch_loss(batch_group, for_training=True)
  File "/Users/bacon/miniconda/lib/python3.7/site-packages/allennlp/training/trainer.py", line 261, in batch_loss
    output_dict = self.model(**batch)
  File "/Users/bacon/miniconda/lib/python3.7/site-packages/torch/nn/modules/module.py", line 547, in __call__
    result = self.forward(*input, **kwargs)
TypeError: forward() got an unexpected keyword argument 'input_tokens'

To Reproduce

My config file is:

{
    "dataset_reader": {
        "type": "language_modeling",
        "tokenizer": {
            "type": "word",
            "word_splitter": {
                "type": "just_spaces"
            }
        },
        "token_indexers": {
            "input_tokens": {
                "type": "single_id"
            }
        },
        "lazy": false
    },
    "train_data_path": "corpus.txt",
    "model": {
        "type": "language_model",
        "text_field_embedder": {
            "type": "basic",
            "token_embedders": {
                "input_tokens": {
                    "type": "embedding",
                    "embedding_dim": 100
                }
            }
        },
        "contextualizer": {
            "type": "lstm",
            "input_size": 100,
            "hidden_size": 200
        },
        "dropout": 0.2,
        "initializer": {},
        "regularizer": {}
    },
    "iterator": {
        "type": "basic"
    },
    "trainer": {
        "optimizer": {
            "type": "adam"
        },
        "patience": 2,
        "num_epochs": 10
    }
}

which was slightly manually edited from the output of the configuration wizard.

Run allennlp train config.json -s model, with a corpus in "corpus.txt" and an empty directory "model". As you can see from the traceback, everything up until the actual training works fine, but then it produces the error above.

Expected behavior
I would have expected that naming the token_indexer and and token_embedder "input_tokens" would be all you need to do in order to use the out-of-the-box model with the out-of-the-box dataset reader.

System (please complete the following information):

OS: OSX
Python version: 3.7.4
AllenNLP version: v0.9.0
PyTorch version: v1.2.0 from AllenNLP

Additional context
This issue is very similar to the discussion in #2528. I googled around, checked StackOverflow and other related issues in allennlp but did not find any solutions. The documentation and tutorial does make it very clear that "The forward method expects dicts of tensors as input, and it expects their names to be the names of the fields in your Instance.", but in this case I think I'm doing that and I'm still getting an error.

The text was updated successfully, but these errors were encountered:

brendan-ai2 · 2019-10-04T22:31:16Z

As you can see from https://github.com/allenai/allennlp/blob/master/allennlp/models/language_model.py#L248, the forward method of the language model actually wants an argument called source. The confusion here is the result of us having multiple dataset readers for the LM task. You actually need to use the one named simple_language_modeling which you can view at https://github.com/allenai/allennlp/blob/master/allennlp/data/dataset_readers/simple_language_modeling.py.

Sorry for the confusion! I'm going to delete/deprecate the https://github.com/allenai/allennlp/blob/master/allennlp/data/dataset_readers/language_modeling.py as it's not compatible with any models we currently have.

brendan-ai2 · 2019-10-04T22:32:09Z

Also, working config for LanguageModel which you can tweak as desired can be found here: https://github.com/allenai/allennlp/blob/master/training_config/bidirectional_language_model.jsonnet.

geoffbacon · 2019-10-08T04:08:36Z

Thanks for your quick reply! Switching to the simple_language_modeling dataset reader fixed this.

- This is confusing users who really need `SimpleLanguageModelingDatasetReader`. - For #3324

kernelmachine assigned kernelmachine and brendan-ai2 and unassigned kernelmachine Oct 4, 2019

This was referenced Oct 4, 2019

Deprecate old language modeling dataset reader #3325

Merged

Remove LanguageModelingReader #3327

Closed

geoffbacon closed this as completed Oct 8, 2019

brendan-ai2 added a commit that referenced this issue Dec 14, 2019

Deprecate old language modeling dataset reader (#3325)

360e4d0

- This is confusing users who really need `SimpleLanguageModelingDatasetReader`. - For #3324

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TypeError in forward() in language model #3324

TypeError in forward() in language model #3324

geoffbacon commented Oct 4, 2019

brendan-ai2 commented Oct 4, 2019

brendan-ai2 commented Oct 4, 2019

geoffbacon commented Oct 8, 2019

TypeError in forward() in language model #3324

TypeError in forward() in language model #3324

Comments

geoffbacon commented Oct 4, 2019

brendan-ai2 commented Oct 4, 2019

brendan-ai2 commented Oct 4, 2019

geoffbacon commented Oct 8, 2019