RustHtml
is a small html parser written in rust.
Warning: this library is not production-ready. Many of the tags and standard are not implemented (mainly located in
tag_optimize()
). PRs are welcomed.
Completed & planned features:
- Parse simple html
- Parse html with void elements
- Parse html with javascript
- Parse html with complicated elements (such as html without
head
ending tag, etc)
Run cargo bench
to benchmark the program.
On my local device, parsing a 43833
lines html requires 18.697 ms
to complete
Distributed under the GPL-3.0-Only License. See LICENSE
for more information.