Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stable hosting (+long-term archiving) of preprocessed data sets #227

Open
alexis-michaud opened this issue Mar 1, 2020 · 1 comment
Open

Comments

@alexis-michaud
Copy link

Having preprocessed data sets at hand matters a lot for easier experimenting. Links to online data can break. This happened for Persephone-related materials: #226. The issue was fixed quickly, but in the mid & long run the answer lies in stable hosting (+long-term archiving) of preprocessed data sets.

Some data sets preprocessed by @gw17 for experiments in 2020 are up here:
https://github.com/gw17/sltu_corpora

It's fine to have those in different places, hopefully with some sort of inventory somewhere (in Wiki mode?). Or could the Persephone / Elpis team also offer hosting solutions?

@oadams
Copy link
Collaborator

oadams commented Mar 20, 2020

I agree it'd be good to think about something long term. I'm pretty open to where we host such things. You're a beacon of light when it comes to making data is available in a stable way for sharing, so I'll defer to your best judgment on this!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants