Text Normalization on tweets (Tweet Normalization)
-
Updated
Nov 14, 2018 - Python
Text Normalization on tweets (Tweet Normalization)
Code, models, and data for "Exploiting Dialect Identification in Automatic Dialectal Text Normalization". ArabicNLP 2024, ACL.
Simple tool to check if Unicode text files are Unicode-normalized
Small Python wrapper class for the CAB webservice.
Implementation of the paper on Text normalization by Choudhury et al.
Implementing text normalization for Farsi(Persian) language.
Utility for string normalization
Our source code for the paper "Transformer-based Joint Learning Approach for Text Normalization in Vietnamese ASR"
An online text normalization tool for Chinese-English mixed text-to-speech system
📢 Tha (ថា) - A Khmer Text Normalization and Verbalization Toolkit
Convert English text from written expressions into spoken forms
This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.
Code and model files for paper: I. Lourentzou et al., Adapting Sequence to Sequence models for Text Normalization in Social Media", ICWSM'19
Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks
Chinese text normalization for speech processing
🧹 Python package for text cleaning
Add a description, image, and links to the text-normalization topic page so that developers can more easily learn about it.
To associate your repository with the text-normalization topic, visit your repo's landing page and select "manage topics."