-
Notifications
You must be signed in to change notification settings - Fork 98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Performance issues #62
Comments
I don't think so – unless this is something that Apart from this you might find that using a multiprocessing library such as |
If you only care about parsing and if the data you extract is simple enough, you might have better luck with |
What @nathaniel-daniel said, and also it's not unexpected that in that program that parsing would take 97% of the running time of your program, given that your program is primarily parsing (with a little io). As for the 9ms figure, that seems fine – how big are your documents? |
I am trying to parse a large number of HTML documents, and I have noticed that the parsing took most of the time, around 97% of the program. Is there any way to speed up the parsing process?
To give you a perspective, the average parsing time is around 9ms per document.
Code example
The text was updated successfully, but these errors were encountered: