New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HTMLParser lacking a few features to reconstruct input exactly #70197
Comments
The HTMLParser class (https://docs.python.org/2/library/htmlparser.html) is lacking a few features to reconstruct input exactly. For the most part it can do this, but I found two items where it falls short (there may be others):
Suggested changes:
|
sample file attached containing VerbatimParser |
sample file test1.html attached. When running test2.py on it, the output is identical except for two things: test1.html contains <!DAMMIT HTML PUBLIC CRAP> test1.html contains end tags that are capitalized e.g. </P> or have spaces </ goober > |
What is your use case? |
Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.
Show more details
GitHub fields:
bugs.python.org fields:
The text was updated successfully, but these errors were encountered: