A simple, consistent and extendable toolkit for IndicTrans2 tokenizer
-
Updated
Jun 27, 2024 - Python
A simple, consistent and extendable toolkit for IndicTrans2 tokenizer
Benchmarking Study of Bloomz-560m, mBART-large, IndicBART on the Indic Languages
Resources and tools for Indian language Natural Language Processing
A python package to run contextualized topic modeling for Indic Languages. indicCTMs combine contextualized embeddings (e.g., IndicBERT) with topic models to get coherent topics in Hindi, English, and Tamil.
Code for Evaluating Inter-Bilingual Semantic Parsing for Indian Languages Paper at NLP4ConvAI at ACL 2023
Add a description, image, and links to the indicnlp topic page so that developers can more easily learn about it.
To associate your repository with the indicnlp topic, visit your repo's landing page and select "manage topics."