# Lab4a Named-entity-recognition using fine-tuned transformers

Copyright: Vrije Universiteit Amsterdam, Faculty of Humanities, CLTL

Before reading this notebook make sure you have consulted **Lab3.4 SentimentClassification using transformer models**, which contains some disclaimers, tips and explains the sentence representations obtained from the transformer models.

In this notebook we will use the simpletransformer package that provides a simple API on top of the transformer packge.

In [1]:
#Requires installing transformers, pytorch and simpletransformers
#!conda install pytorch cpuonly -c pytorch
#!pip install transformers
#!pip install simpletransformers

We load a transformer model 'bert-base-NER' from the Hugging face repository, which is fine-tuned for Named Entity recognition: 

https://huggingface.co/models

We need to load the model for the sequence classifcation and the tokenizer to convert the sentences into tokens according to the vocabulary of the model.

Loading the model takes some time and requires you have sufficient memory to load the model

In [2]:
from simpletransformers.ner import NERModel
#sentences = ["Example sentence 1", "Example sentence 2"]
englishmodel = NERModel(
        model_type="bert",
        model_name="dslim/bert-base-NER",
        use_cuda=False
)

We create an instance of the NERModel that can be used for training, evaluation, and prediction in Named-Entity-Recognition (NER) tasks. The full parameter list for a NERModel object:

* model_type: The type of model (bert, roberta)
* model_name: Default Transformer model name or path to a directory containing Transformer model file (pytorch_nodel.bin).
* labels (optional): A list of all Named Entity labels. If not given, [“O”, “B-MISC”, “I-MISC”, “B-PER”, “I-PER”, “B-ORG”, “I-ORG”, “B-LOC”, “I-LOC”] will be used.
* args (optional): Default args will be used if this parameter is not provided. If provided, it should be a dict containing the args that should be changed in the default args.
* use_cuda (optional): Use GPU if available. Setting to False will force model to use CPU only.

In [3]:
predictions, raw_outputs = englishmodel.predict(["Apple sued Samsung for patents last year."])

HBox(children=(HTML(value=''), FloatProgress(value=0.0, max=1.0), HTML(value='')))




HBox(children=(HTML(value='Running Prediction'), FloatProgress(value=0.0, max=1.0), HTML(value='')))




In [4]:
predictions

[[{'Apple': 'B-ORG'},
  {'sued': 'O'},
  {'Samsung': 'B-ORG'},
  {'for': 'O'},
  {'patents': 'O'},
  {'last': 'O'},
  {'year.': 'O'}]]

In [6]:
dutchmodel = NERModel(
        model_type="bert",
        model_name="wietsedv/bert-base-dutch-cased-finetuned-conll2002-ner",
        use_cuda=False
)
# Some other NERC models released by the same author
#wietsedv/bert-base-dutch-cased-finetuned-conll2002-ner
#wietsedv/bert-base-multilingual-cased-finetuned-conll2002-ner
#wietsedv/bert-base-dutch-cased-finetuned-sonar-ner

In [7]:
predictions, raw_outputs = dutchmodel.predict(["Apple sleept Samsumg voor de rechter vanwege schending van patenten."])

HBox(children=(HTML(value=''), FloatProgress(value=0.0, max=1.0), HTML(value='')))




HBox(children=(HTML(value='Running Prediction'), FloatProgress(value=0.0, max=1.0), HTML(value='')))




In [8]:
predictions

[[{'Apple': 'I-MISC'},
  {'sleept': 'I-LOC'},
  {'Samsumg': 'B-PER'},
  {'voor': 'I-LOC'},
  {'de': 'I-LOC'},
  {'rechter': 'I-LOC'},
  {'vanwege': 'I-LOC'},
  {'schending': 'I-LOC'},
  {'van': 'I-LOC'},
  {'patenten.': 'I-LOC'}]]

# End of this notebook