Skip to content

mapmeld/osm-unicode-coverage

Repository files navigation

OSM Unicode Coverage

Using global OSM extracts from https://download.geofabrik.de/, find areas using each Unicode Block

Excludes Basic Latin, Latin-1 Supplement, and Punctuation. Includes Latin Extended

Lat/lng rounded to nearest two-digits to avoid over-density

Uses

  • Removing errors or vandalism (for example, "Čzech Republik🇨🇿" in New Zealand)
  • Locating local hotspots for languages (Antarctic bases, shops catering toward local communities or foreign visitors)
  • Highlighting use of less common scripts (for example, some Indian scripts are used only rarely on OSM names)
  • Measuring use of dual scripts
  • Measuring use of local script / Latin script in foreigner-mapped areas

Examples

Latin Extended-B hotspot in Guyana

Locating an Antarctic base browsing for Devanagari script

Unicode flag vandalism (found in New Zealand)

License

Open source / CC-Zero

About

Mapping use of different Unicode scripts by block

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published