Skip to content

Conversation

@MatthiasWiesmann
Copy link
Contributor

The process of generating all the HTML pages is pretty CPU intensive and long (~14 minutes on my laptop).
In order to reduce this, I added logic to spawn one process per CPU.

This reduces the time to generate the pages to ~8 minutes. The increase is not linear as each process needs to load and setup the whole data-structure.

Updated the logging classes to add the shard number to the message, so you see which shard does what.

@MatthiasWiesmann MatthiasWiesmann added the python Pull requests that update Python code label Oct 11, 2024
@MatthiasWiesmann MatthiasWiesmann merged commit a2776a3 into main Oct 14, 2024
@jlndr jlndr deleted the generate_take_4 branch November 20, 2025 11:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

python Pull requests that update Python code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants