bert-ner-cmv

Code for paper: Exploring Cross-sentence Contexts for Named Entity Recognition with BERT https://aclanthology.org/2020.coling-main.78.pdf

Dependencies:

bert: tokenization.py (added as bert_tokenization.py to this project. FullTokenizer is used instead of keras-bert tokenizer)

keras-bert (https://pypi.org/project/keras-bert/)

Pretrained BERT model, e.g. from:

input data e.g. from:

Input data is expected to be in CONLL:ish format where Token and Tag are tab separated. First string on the line corresponds to Token and second string to Tag

Quickstart

Get pretrained models and data

./scripts/get-models.sh
./scripts/get-finer.sh
./scripts/get-turku-ner.sh

Experiment on Turku NER corpus data (run-turku-ner.sh trains, use different input file and '--use_ner_model' for predicting )

./scripts/run-turku-ner.sh

Run an experiment on FiNER news data

./scripts/run-finer-news.sh

If in a Slurm environment, edit scripts/slurm-run.sh to match your setup and run

sbatch scripts/slurm-run.sh scripts/run-finer-news.sh

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
bert @ eedf571		bert @ eedf571
output		output
results		results
scripts		scripts
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
bert_tokenization.py		bert_tokenization.py
common.py		common.py
compare.py		compare.py
config.py		config.py
conlleval.py		conlleval.py
ner.py		ner.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bert @ eedf571

bert @ eedf571

output

output

results

results

scripts

scripts

.gitmodules

.gitmodules

LICENSE

LICENSE

README.md

README.md

bert_tokenization.py

bert_tokenization.py

common.py

common.py

compare.py

compare.py

config.py

config.py

conlleval.py

conlleval.py

ner.py

ner.py

Repository files navigation

bert-ner-cmv

Dependencies:

Quickstart

About

Releases

Packages

Languages

License

jouniluoma/bert-ner-cmv

Folders and files

Latest commit

History

Repository files navigation

bert-ner-cmv

Dependencies:

Quickstart

About

Resources

License

Stars

Watchers

Forks

Languages