-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
License #102
Comments
The different resources in It'll be great to package |
It wouldn't be in I see now that the |
|
This remains a problem for distributions packaging nltk. Looking at https://www.nltk.org/nltk_data/, many of the fields have a blank licence/copyright field. Would it be possible for nltk to construct a free/libre dataset which can be safely redistributed? Thanks. |
Many of the NLTK data resources themselves contain licensing, copyright or README files that contain additional information on to what extent the data may be distributed. Perhaps that will help somewhat. |
I did end up untarring the whole lot and taking a look but many of them had either no README (etc) or if they did have one, indicated they were proprietary. |
For the record, I'm removing NLTK from Gentoo because of this. IANAL but it looks like many of the corpora shouldn't be redistributed as part of nltk_data in the first place, and letting NLTK download them puts users at risk of copyright violation. |
Can you clarify what license the
nltk_data
files are under? Is it the same license asnltk
? Do the various data files have different licenses?conda-forge
would like to begin packagaingnltk_data
, because a few users have requested it (to make installing more uniform / track versioning / etc; conda-forge/staged-recipes#4463), but we'd need to know the license first.The text was updated successfully, but these errors were encountered: