Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Stop doing this fancy-schmancy parsing of the page text in order to g…
…et the sections dynamically. Instead, pull the cached HTML and read that instead. This cuts down the run-time from 40 minutes to about 6 minutes. \o/ Also, exclude redirects in the form of "#REDIRECT [[Foo|Bar]]". These are technically valid, but wildly silly and should be fixed in one or two sweeps. This could be fixed in the database population script as well, but these cases are rare enough that it probably isn't worth it.
- Loading branch information
Showing
1 changed file
with
12 additions
and
22 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters