Chechen Transliterator

A Python library for transliterating Chechen text from Cyrillic to Latin script using the Chechen Latin alphabet.

Installation

pip install ce-translit

Quick Start

import ce_translit

# Simple usage - transliterate Chechen text
text = "Нохчийн мотт"
result = ce_translit.transliterate(text)
print(result)  # Outputs: "Noxçiyŋ mott"

Features

Simple API: Clean, single-function interface
Linguistically Accurate: Handles all Chechen-specific rules
Context-Aware: Special handling for letter position rules
Customizable: Advanced options for specialized use cases
Pure Python: No external dependencies
Memory Efficient: Uses minimal memory and efficient string handling

Detailed Usage

Basic Usage

import ce_translit

# Transliterate a single word
word_result = ce_translit.transliterate("дош")  # "doş"

# Transliterate a sentence
sentence = "Муха ду хьал де?"
sentence_result = ce_translit.transliterate(sentence)  # "Muxa du ẋal de?"

Advanced Usage with Custom Rules

from ce_translit import Transliterator

# Create a custom transliterator with your own rules
custom_transliterator = Transliterator(
    # Custom letter mapping
    mapping={
        **Transliterator()._mapping, # First define base mapping
        # Then override specific mappings
        "й": "j",
        # Append completely new mappings
        "1": "j"
    },
    # Override blacklist (Words that should keep the regular 'н' at the end)
    blacklist=["дин", "гӏан", "сан"],
    # Override unsurelist (Words that should use 'ŋ[REPLACE]' at the end)
    unsurelist=["шун", "бен", "цӏен"]
)

# Use the custom transliterator
result = custom_transliterator.transliterate("1аж дин шун")

If you omit **Transliterator()._mapping** from the custom mapping, the custom transliterator will only use the custom mappings you provide.

Oveeride just one of list by defining a list outside

from ce_translit import Transliterator

# Define your own list
my_blacklist = ["дин", "гӏан", "сан"]

# Create a custom transliterator with defined blacklist
custom_transliterator = Transliterator(blacklist=my_blacklist)
result = custom_transliterator.transliterate("дин")

Special Transliteration Rules

The library handles several special rules in Chechen transliteration:

Letter 'е':
- At the start of a word → 'ye' (ex: "елар" → "yelar")
- After 'ъ' → 'ye' (ex: "шелъелча" → "şelyelça")
- In other positions → 'e' (ex: "мела" → "mela")
Letter 'н' at end of words:
- Regular handling → 'ŋ' (ex: "сан" → "saŋ")
- Blacklisted words keep 'n' (ex: "хан" → "xan")
- Unsurelist words use 'ŋ[REPLACE]' (ex: "шун" → "şuŋ[REPLACE]")
Standalone 'а':
- When 'а' is a standalone word → 'ə' (ex: "а" → "ə")
Special Character Combinations:
- 'къ' → 'q̇'
- 'хь' → 'ẋ'
- 'гӏ' → 'ġ'

Technical Details

Performance

The library is optimized for both startup time and runtime performance:

Data is loaded once at import time
Efficient string handling for minimal memory usage
Uses sets for O(1) lookups in blacklists and unsure lists

Development

Setting up the Development Environment

# Create and activate a virtual environment
python -m venv .venv
source .venv/bin/activate

# Install development tools
pip install --upgrade hatch pytest

# Run tests
hatch run test

# Build the package
hatch build

# Test the built package
pip install --force-reinstall dist/ce_translit-1.0.0-py3-none-any.whl

Running Tests

# Install test dependencies
pip install pytest

# Run tests
pytest

Repository Structure

ce-translit-py/
├── src/
│   └── ce_translit/
│       ├── __init__.py         # Public API
│       ├── _transliterator.py  # Core implementation
│       ├── data/
│       │   └── cyrl_latn_map.json  # Character mapping
├── tests/
│   └── test_transliterator.py
├── LICENSE
├── README.md
└── pyproject.toml

License

This project is licensed under the MIT License.

Contributing

Contributions are welcome! Feel free to submit issues or pull requests on the GitHub repository.

Related Projects

ce-translit-js - JavaScript version of this library

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github/workflows		.github/workflows
src/ce_translit		src/ce_translit
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Chechen Transliterator

Installation

Quick Start

Features

Detailed Usage

Basic Usage

Advanced Usage with Custom Rules

Oveeride just one of list by defining a list outside

Special Transliteration Rules

Technical Details

Performance

Development

Setting up the Development Environment

Running Tests

Repository Structure

License

Contributing

Related Projects

About

Uh oh!

Releases 2

Packages

Uh oh!

Languages

License

chechen-language/ce-translit-py

Folders and files

Latest commit

History

Repository files navigation

Chechen Transliterator

Installation

Quick Start

Features

Detailed Usage

Basic Usage

Advanced Usage with Custom Rules

Oveeride just one of list by defining a list outside

Special Transliteration Rules

Technical Details

Performance

Development

Setting up the Development Environment

Running Tests

Repository Structure

License

Contributing

Related Projects

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Languages

Packages