Skip to content

DWIE-FR : A new French dataset annotated with named entities

License

Notifications You must be signed in to change notification settings

Emvista/DWIE-FR

Repository files navigation

DWIE-FR

DWIE-FR : A new French dataset annotated with named entities

In this repository, you will find a folder preprocessing to reproduce our work about the French DWIE corpus. In the second folder, experiments you will find our scripts to train and evaluate our models on DWIE-FR dataset using Flair framework.

Download DWIE-FR dataset

link: https://drive.google.com/drive/folders/1A4Q8B3VIo64Iz5pto4_3csk_5eTwZ09C?usp=share_link

Reproducing our works

Preprocessing

Packages installation

  • python=3.10
conda env create --file environment.yml

Run preprocessing

cd preprocessing
./run.sh

Training Flair models on DWIE-FR

Packages installation

  • python=3.10
conda env create --file environment.yml

Run Training

cd experiments
./scripts/run_onto_dwie.sh

Citations

Merci d'utiliser la référence suivante lorsque vous faites mention de ce projet : Sylvain Verdy, Maxime Prieur, Guillaume Gadek, Cédric Lopez (2023) DWIE-FR : Un nouveau jeu de données annoté en entités nommées pour le français. Actes de la conférence TALN’23, Paris, 2023, à paraître.

About

DWIE-FR : A new French dataset annotated with named entities

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published