Skip to content

A script for collecting the United Nations Digital Library dataset in a language modelling friendly format.

License

Notifications You must be signed in to change notification settings

cfoster0/pile_united_nations

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

pile_united_nations

A script for collecting the United Nations Digital Library dataset in a language modelling friendly format.

To run, do:

git clone https://github.com/cfoster0/pile_united_nations.git
cd pile_uspto
virtualenv env
. env/bin/activate
pip install -r requirements.txt
python main.py

About

A script for collecting the United Nations Digital Library dataset in a language modelling friendly format.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages