-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
As an admin, I want a way to reproducibly generate a full-text corpus of all public PPA content in order to support computational research on PPA materials. #556
Comments
Developer steps before acceptance testing:
Acceptance testing checklist:
Things to recheck after development:
|
Skipped test for works with no pages (id: uga1.32108002998303) because suppressed during staging set up. |
Moving additional testing to the related bug issues. any changes where we need to test this script should be batched due to testing effort. |
all tests passed!! 🎊 |
rlskoeser
changed the title
Adapt Vineet's script to export plain text corpus
As an admin, I want a way to reproducibly generate a full-text corpus of all public PPA content in order to support computational research on PPA materials.
Jan 9, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Adapt Vineet's script to export plain text corpus
The text was updated successfully, but these errors were encountered: