Skip to content

A realization of a fully-reversible transliteration scheme, Cyr2Url.

License

Notifications You must be signed in to change notification settings

matan-bebop/translit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A realization of Cyr2Url transliteration scheme, which provides fully reversible 
romanization of cyrillic text. It can be suited for the romanization for URLs or filenames.

FEATURES:
- transliterates all characters from Ukrainian, Russian and Belorussian alphabets + space character
- fully reversible (including the lettercase for cyr-lat-cyr)
- fully URL-compatible (latin characters, dot, underscore, tilde)
- based on the transliteration standard developed by the Terminological Comission of the Natural Sciences (TCNS) of the Kyiv National University
- decently readable transliterated form 

Encoding of non-Ukrainian characters:
э = eh, ў = w                  # TCNS standard
ы = yh, ё = joh, ъ = .h

Examples:
  Ukrajina    = Україна        # TCNS standard
  cvtitenj    = цвітень        # TCNS
  горщик      = ghorshhyk      # TCNS
  p.jatj      = п'ять
  z~jorzhenyj = зйоржений
  ob.hjavytj  = объявить       # Russian alphabet
  johzhyk     = ёжик           # Russian 
  ehlektron   = электрон       # Russian & Belorussian 
  pownyh       = поўны         # Belorussian

Implemented translitaration scheme, Ukrainian URL (UURL) or Cyr2Url, was
invented by Andriy V. Makukha. Author's document describing the scheme can be
found on the Web,

http://nbuv.gov.ua/sites/default/files/all_files/201310_artilces_field_dopmat_files/wikiconf_2011_ukraine_-_makukha_-_article.pdf

as well as the original realization,

http://code.google.com/p/cyr2url/