You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
current/toolkit-walkthrough.md
109: .keepValidPagesDF()
121:- tells it only to keep the
122: "[valid](https://github.com/archivesunleashed/aut-docs/blob/master/aut-0.50.0/filters.md#keep-valid-pages)"
178: .keepValidPagesDF()
205: .keepDomainsDF(domains)
237: .keepDomainsDF(domains)
260:in and around the `.keepDomainsDF` line. Check out the
266:- **Keep URL Patterns**: Instead of domains, what if you wanted to have text
267: relating to just a certain pattern? Substitute `.keepDomainsDF` for a command
269: `.keepUrlPatternsDF(Set("(?i)http://geocities.com/EnchantedForest/.*".r))`
271: following command after `.webpages()`: `.keepDateDF(List("2006"), "YYYY")`
273: `.keepDomainsDF` add a new line: `.keepLanguagesDF(Set("fr"))`.
286: .keepDomainsDF(domains)
287: .keepLanguagesDF(languages)
300: .keepDateDF(List("2006"), "YYYY")
369: .keepValidPages()
379:By now this should be seeming pretty straightforward! (remember to keep using
The text was updated successfully, but these errors were encountered:
Looks like we still have some keep/discard examples with DataFrames that are still in the docs, and should have been taken care of in #48
The text was updated successfully, but these errors were encountered: