Skip to content

jtourille/oscar-corpus-downloader

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PyPI - Python Version PyPI - License

OSCAR Corpus Downloader

Simple tool to download the OSCAR corpus.

1. Installation

Installation can be done using pypi:

$ pip install oscar-corpus-downloader

2. Usage

Submit an OSCAR access request following the procedure described on the project page.

Once you have received your credentials, you can use the command line interface to download an OSCAR corpus part.

$ export OSCAR_USERNAME=username
$ export OSCAR_PASSWORD=password
$ oscar download --help
Usage: oscar download [OPTIONS]

Options:
  -u, --url TEXT         OSCAR corpus url  [required]
  -o, --output-dir TEXT  Output directory  [required]
  --resume               Resume download
  --help                 Show this message and exit.

$ oscar download \
  --url https://oscar-prive.huma-num.fr/2301/fr_meta \
  -o ./oscar-fr

About

Simple tool to download the OSCAR corpus.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages