Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Updates Python soup_adapter to use BeautifulSoup 4 #368
Thanks for your pull request. It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).
Once you've signed, please reply here (e.g.
referenced this pull request
Jun 8, 2017
If you really need BS4 support, I'd recommend Sigil fork: https://github.com/Sigil-Ebook/sigil-gumbo
However, according to my benchmarks, gumbo + BS4 does not offer any speed benefits over BS4 + html5lib, probably because of ctypes bindings overhead. I even tried to write native Python bindings with Pybind11 library but speed gain was not that significant. Anyway, the library seems abandoned, and html5lib is a safe bet if you need a compliant html5 Python parser.