BERT / ELMO embeddings for NER #22

RyanDsilva · 2019-10-20T09:44:10Z

When are the pretrained embeddings for BERT and ELMO with regards to NER planned for ?
Could you help me with the development process, I could try to contribute these features.

amaiya · 2019-10-20T18:07:12Z

This sounds like a good first issue if you want to take a crack at it. I'm happy to answer any questions regarding development process.

One way to implement this is to maybe leverage TensorFlow Hub to generate the embeddings dynamically as sentences are transformed for input into the NN.

RyanDsilva · 2019-10-20T18:22:07Z

Great! I'd definately want to try this! Taking this up! Thanks @amaiya

code4kunal · 2020-02-07T11:55:18Z

Hey @amaiya, Thanks a lot for this great work till date. This package has been so intuitive and helpful for most of the tasks. I have tested BERT with text classification, worked like a charm.

Wanted to know when we can have NER tasks supported with bert, distilled-bert embeddings?

Best Regards
Kunal

amaiya · 2020-02-11T18:43:09Z

@code4kunal Thanks for your comments.

The user above volunteered to look into this a couple of months ago, but I don't know where it stands., The original idea was to maybe use TensorFlow Hub for this. However, now that the Hugging Face transormers library supports TensorFlow 2, it would probably make more sense to generate the embeddings using the transformers library in ktrain. This is still on the TODO list, which is why this issue is open, but I don't have an exact timeframe for this, unfortunately. Thanks again.

mdavis95 · 2020-02-28T14:50:44Z

This might help someone:
huggingface/transformers#1950 (comment)

amaiya · 2020-03-03T21:59:17Z

@mdavis95 Thanks - it shouldn't be too difficult to incorporate this for sequence-labeling.

amaiya · 2020-03-30T19:38:13Z

As v0.12.x of ktrain, BERT and Elmo embeddings for downstream tasks like NER are now supported:

# English NER with BERT embeddings
import ktrain
from ktrain import text
(trn, val, preproc) = text.entities_from_conll2003(train.txt, val_filepath=valid.txt)
model = text.sequence_tagger('bilstm-bert', preproc, bert_model='bert-base-cased')
learner = ktrain.get_learner(model, train_data=trn, val_data=val, batch_size=128)
learner.fit(0.01, 2, cycle_len=5)

amaiya added the enhancement New feature or request label Oct 20, 2019

amaiya closed this as completed Mar 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BERT / ELMO embeddings for NER #22

BERT / ELMO embeddings for NER #22

RyanDsilva commented Oct 20, 2019

amaiya commented Oct 20, 2019

RyanDsilva commented Oct 20, 2019

code4kunal commented Feb 7, 2020 •

edited

Loading

amaiya commented Feb 11, 2020

mdavis95 commented Feb 28, 2020

amaiya commented Mar 3, 2020

amaiya commented Mar 30, 2020

BERT / ELMO embeddings for NER #22

BERT / ELMO embeddings for NER #22

Comments

RyanDsilva commented Oct 20, 2019

amaiya commented Oct 20, 2019

RyanDsilva commented Oct 20, 2019

code4kunal commented Feb 7, 2020 • edited Loading

amaiya commented Feb 11, 2020

mdavis95 commented Feb 28, 2020

amaiya commented Mar 3, 2020

amaiya commented Mar 30, 2020

code4kunal commented Feb 7, 2020 •

edited

Loading