Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Thanks for sharing, I've been meaning to look at beautiful soup for a while.
Looks like there is a typo on line 50 of collect_texts.py
Added requirements.txt to capture dependencies to easily install via
pip install -r requirements.txt
especially convenient if using conda environments to avoid cluttering main python env.This also required
liblept5
andfirefox-geckodriver
installed via apt on ubuntu before anything would run. Maybe I'll try to document the full set via aDockerfile
as I'm sure there are other dependencies I already had installed.There appear to be other issues I'm struggling through, not sure if user error, documentation, or something other. I'll try to grok things a bit better so I can articulate the other issues and either create issues or PRs.