Join GitHub today
GitHub is home to over 36 million developers working together to host and review code, manage projects, and build software together.Sign up
IDN support #27
In Denmark, all our domains are harvested and indexed with punycode, but users also want to search using the accented domains, like øx.dk etc.
I think this should be done server-side since different frontends shouldn't be implementing the same thing.
I had a look at how we could achieve this.
There are probably more places to modify in other configurations like Proxy mode, but that should be easy enough to do.
The thing with this solution is that I convert to puny-version before the canonicalazion. I don't know if that is a problem, but it is then not the case that the canonicalizer know that it is processing the unmodified URL.
I've made a possible solution here: https://github.com/iipc/openwayback/tree/issue27_IDN