Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

htmldiff: Preserve whitespace after a tag #125

wants to merge 8 commits into from


None yet
2 participants
Copy link

commented Aug 1, 2013

Whitespace after a tag (but before the first bit of non-whitespace text) is currently lost while diffing. These changes modify the start_tag function to keep any whitespace that occurs after it. I also re-named split_trailing_whitespace to split_whitespace and made it return both the start and end whitespace of a string as a tuple.

As part of this token.trailing_whitespace is now re-named to token.whitespace and is a tuple containing the start and end whitespace, although at the moment the start whitespace is always an empty string.


This comment has been minimized.

Copy link

commented on src/lxml/html/ in d9186f6 Aug 1, 2013

Since you're already stripping the word above anyways, it should be possible to avoid the first slicing here.

This comment has been minimized.

Copy link
Owner Author

replied Aug 1, 2013



This comment has been minimized.

Copy link
Contributor Author

commented Sep 2, 2013

I've updated the request to stop the test from failing, and its all green on Travis. Is there anything else you would like me to include in this request, or is this enough?

@orf orf closed this Jan 9, 2014

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
You can’t perform that action at this time.