Skip to content

Excel often has a hard time opening and saving files that are UTF-8 encoded. Often this corrupts characters that are not ASCII. Here is a dictionary of the characters excel often messes up on and their approximate conversions. This is not all inclusive, but it probably gets you 90% of the way there.

Notifications You must be signed in to change notification settings

macklin-fluehr/Excel_UTF-8_character_conversions

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Excel_UTF-8_character_conversions

Fix Excel Encoding

This is a simple script to repair a csv. It replaces the junky excel characters with UTF-8 characters.

For best results, to ensure that excel doesn't mess up again in the future, this module also uses the unidecode library to transform the UTF-8 characters into closely approximated ASCII.

Weird_excel_character_dictionary

The dictionary of junky excel stuff. This is not an inclusive dictionary, but it contains many of the characters you are most likely to come across. Feel free to add to it.

About

Excel often has a hard time opening and saving files that are UTF-8 encoded. Often this corrupts characters that are not ASCII. Here is a dictionary of the characters excel often messes up on and their approximate conversions. This is not all inclusive, but it probably gets you 90% of the way there.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages