Skip to content

Replace decomposed unicode characters with precomposed#72

Merged
walkermatt merged 1 commit intomainfrom
normalize-unicode
Feb 22, 2023
Merged

Replace decomposed unicode characters with precomposed#72
walkermatt merged 1 commit intomainfrom
normalize-unicode

Conversation

@walkermatt
Copy link
Copy Markdown
Member

As per https://www.postgresql.org/message-id/1523.1407287675%40sss.pgh.pa.us Postgres doesn't support decomposed UTF8. In practice decomposed characters such as 'a' followed by 'acute' can be stored but causes an issue when trying to output to LATIN1. Replacing decomposed with the precomposed counterparts resolves the issue.

@walkermatt walkermatt merged commit 28d9f78 into main Feb 22, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant