Skip to content

mirfan899/Hindi-NER

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Hindi-NER

This is an example project to let you use your dataset and publish it on hugginface.

The dataset used in this project is IJNLP 2008 Hindi dataset. I have converted to word tag format, separated by tab.

Now split dataset into train, test and validation. You can choose whatever percentange you want for your dataset.

python split_hindi.py

Now you can publish your dataset by adding token and your account info in publish_hindi.py script.

About

Code to publish NER dataset to Huggingface

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages