🧹 Python package for text cleaning
-
Updated
May 9, 2023 - Python
🧹 Python package for text cleaning
Chinese text normalization for speech processing
Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks
Code and model files for paper: I. Lourentzou et al., Adapting Sequence to Sequence models for Text Normalization in Social Media", ICWSM'19
This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.
Convert English text from written expressions into spoken forms
📢 Tha (ថា) - A Khmer Text Normalization and Verbalization Toolkit
An online text normalization tool for Chinese-English mixed text-to-speech system
Our source code for the paper "Transformer-based Joint Learning Approach for Text Normalization in Vietnamese ASR"
Utility for string normalization
Small Python wrapper class for the CAB webservice.
Implementation of the paper on Text normalization by Choudhury et al.
Cryptocurrency Market Analysis and Question Answering System
Implementing text normalization for Farsi(Persian) language.
Text Normalization on tweets (Tweet Normalization)
Code, models, and data for "Exploiting Dialect Identification in Automatic Dialectal Text Normalization". ArabicNLP 2024, ACL.
Simple tool to check if Unicode text files are Unicode-normalized
Clipboard Translator is a lightweight desktop application built with PyQt5 that automatically translates text copied to the clipboard into Persian using the Google Translate API. The application features a modern and minimalistic UI, custom styling, and real-time text normalization and tokenization.
Add a description, image, and links to the text-normalization topic page so that developers can more easily learn about it.
To associate your repository with the text-normalization topic, visit your repo's landing page and select "manage topics."