NONWESTLIT Project Codebase
-
Updated
Jul 23, 2024 - Python
NONWESTLIT Project Codebase
[ACL'24] MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)
This repository highlights the LLMs reasoning capabilities of ✨ Mistral / LLaMA-3 / Phi-3 / Gemma / Flan-T5 / GPT-4o ✨ in Targeted Sentiment Analysis in Russian / Translated to English mass-media 📊
English-Sinhala multilingual word embedding alignment resources
[ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly
Fine-tune LLM for early Middle English lemmatization with data from LAEME.
Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models (EMNLP 2024)
Hausa Natural Language Processing Repository
Pashto Natural Language Processing Toolkit
Code for AACL23 paper "Benchmarking Procedural Language Understanding for Low-Resource Languages: A Case Study on Turkish"
A comprehensive overview of research regarding Natural Language Processing (NLP) of Manipuri language.
This is the official repository contains the code, data, and models of the paper titled "Shironaam: Bengali News Headline Generation using Auxiliary Information", accepted for publication in Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL’23), May 2-6, 2023.
This is an official Leaderboard for the RuSentRel-1.1 dataset originally described in paper (arxiv:1808.08932)
Unsupervised Contextualized Document Representation, to appear in SustaiNLP 2021 EMNLP 2021
Must-read papers on relation extraction.
Crowdsource Platform for Low Resourced Language Annotation and Corpus Contribution
This repository provides HAWP: a dataset for Hindi Word Problem Solving and a baseline (LREC 2022)
Wen Lai's Blog related to MT/NLP/ML
Awesome Lao Natural Language Processing
Implementation of NAACL 2024 main conference paper: Named Entity Recognition Under Domain Shift via Metric Learning for Life Science
Add a description, image, and links to the low-resource-nlp topic page so that developers can more easily learn about it.
To associate your repository with the low-resource-nlp topic, visit your repo's landing page and select "manage topics."