Been working on this music library project with lyrics and in french we have words with ligatures like "œuvre" and "æquo" which are often written "oeuvre" and "aequo". This causes problems when searching since a first person can enter some CD or work using the ligature while the other person will search it without. Hence replacing the ligature to the non-ligature version during indexing.
I've used http://en.wikipedia.org/wiki/Typographic_ligature to find the ligatures utilized in French and did not dare implement those from other languages no knowing the actual grammatical rules of those languages.
Hoping this improves the Gem.
added decoding of french ligatures