Finite-state script normalization and processing utilities
-
Updated
May 25, 2024 - Python
Finite-state script normalization and processing utilities
Anuvaad - Open Sourced Document Translation Platform for Indic Languages
Software and Resources for Mitigating Online Gender Based Violence in India
Fcitx5 wrapper for Varnam input method. Easily type Indian languages on Linux desktops.
OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes
Resources and tools for Indian language Natural Language Processing
A comprehensive platform that leverages AI to digitize and streamline patient health data. Supports text, voice, and image input across multiple languages, enabling efficient data entry. Includes an advanced data visualization dashboard for real-time health insights.
AISumNews or shortened for AI Summarized News is an open source application for displaying summarized news content from around the web.
ASCII <-> Unicode conversion library
Source code for marwari.info - A multi-lingual dictionary for Marwari language that is spoken by 8 million people in India. https://marwari.info by @ManasMadrecha
This Django project on PythonAnywhere digitizes catalogs via text, Indic text, voice, OCR for images. Supports image-text, audio modes for data capture. User-friendly interface for viewing digitized records.
Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"
a LoRA trained on English-Hindi text pair for machine translation that is suitable for conversational inputs.
Indic evals for quantised models AWQ / GPTQ / EXL2
Truthfulqa_indic, Available in Hindi, Punjabi, Kannada, Tamil and Telugu
Javascript Bidirectional Transliteration Libarary
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
Web Interface for Transliteration for Indic languages.
A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanagari script.
Add a description, image, and links to the indic-languages topic page so that developers can more easily learn about it.
To associate your repository with the indic-languages topic, visit your repo's landing page and select "manage topics."