Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

uppercase umlauts get replaced by lowercase umlaut entities #31

Closed
homeworkprod opened this issue Aug 11, 2010 · 2 comments
Closed

uppercase umlauts get replaced by lowercase umlaut entities #31

homeworkprod opened this issue Aug 11, 2010 · 2 comments
Labels
bug Confirmed bug that we should fix

Comments

@homeworkprod
Copy link

The line

System.out.println(Jsoup.clean("<h1>Überschrift</h1>", Whitelist.none()));

should print

&Uuml;berschrift

but prints

&uuml;berschrift

This used to work correctly in v0.3.1, but fails in v1.2.3.

While baseArray in Entities.java distinguishes between lowercase and uppercase umlauts, the above call yields the wrong result.

@jhy
Copy link
Owner

jhy commented Aug 11, 2010

Thanks for reporting this -- I'll fix it for the next release.

@jhy
Copy link
Owner

jhy commented Aug 13, 2010

Fixed support for case-sensitive HTML escape entities.

Closed by 17d07c5

michael-simons pushed a commit to michael-simons/jsoup that referenced this issue Jul 12, 2011
zazi pushed a commit to dswarm/jsoup that referenced this issue Oct 15, 2015
This issue was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Confirmed bug that we should fix
Projects
None yet
Development

No branches or pull requests

2 participants