: Idiomfier as an NER tagger #15

eubinecto · 2022-03-12T09:43:17Z

How?

First of all, let's try this with the baseline approach - just using a simple linear layer approach. Performance is not what matters as of right now.

Could you use BART for this? Yes you could, but.. BART is an auto-regressive model - cannot refer to the future when processing the past. BERT would be a better choice than BART.

To-do's

delete tokenizer- related fetchers and paths
change the builders: InputsBuilder
- we must make sure that the tokens are not split further than it is now..
explore_ inputs_builder
change the builders: LabelsBuilder
explore_labels_builder
rewrite Idiomifier to learn NER with BERT

The text was updated successfully, but these errors were encountered:

eubinecto · 2022-03-22T15:12:00Z

How should we handle subwords in a token-level classification task?

eubinecto mentioned this issue Mar 12, 2022

Chronicles #4

Open

34 tasks

eubinecto changed the title ~~: Idiomfier as an NER tagger.~~ : Idiomfier as an NER tagger Mar 22, 2022

eubinecto added a commit that referenced this issue Mar 22, 2022

[#15] deleting all the codes for mentioning and generating a tokenizer

c638873

eubinecto added a commit that referenced this issue Apr 10, 2022

[#15] check

360fac3

eubinecto closed this as completed Apr 10, 2022

eubinecto added a commit that referenced this issue Apr 10, 2022

[#15] explore_openai_completion.py

f8749d8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

: Idiomfier as an NER tagger #15

: Idiomfier as an NER tagger #15

eubinecto commented Mar 12, 2022 •

edited

eubinecto commented Mar 22, 2022 •

edited

: Idiomfier as an NER tagger #15

: Idiomfier as an NER tagger #15

Comments

eubinecto commented Mar 12, 2022 • edited

How?

To-do's

eubinecto commented Mar 22, 2022 • edited

How should we handle subwords in a token-level classification task?

eubinecto commented Mar 12, 2022 •

edited

eubinecto commented Mar 22, 2022 •

edited