Chatbot Solution for Resource-Poor Languages. Contains code and data for Journal Article 'Focused domain contextual AI chatbot framework for resource poor languages'.
-
Updated
Jul 25, 2021 - Python
Chatbot Solution for Resource-Poor Languages. Contains code and data for Journal Article 'Focused domain contextual AI chatbot framework for resource poor languages'.
Unsupervised Contextualized Document Representation, to appear in SustaiNLP 2021 EMNLP 2021
[ACL 2021] OntoED: Low-resource Event Detection with Ontology Embedding
Code and datasets for the ACL 2021 paper "OntoED: Low-resource Event Detection with Ontology Embedding"
This repository contains the code, data, and associated models of the paper titled "BanglaParaphrase: A High-Quality Bangla Paraphrase Dataset", accepted in Proceedings of the Asia-Pacific Chapter of the Association for Computational Linguistics: AACL 2022.
Enhanced awesome-align for low-resource languages and noise simulation: https://arxiv.org/abs/2301.09685
[SIGIR 2023] Schema-aware Reference as Prompt Improves Data-Efficient Knowledge Graph Construction
This repository is an open-source colleciton of various low-resource machine translation experiments.
This is an official Leaderboard for the RuSentRel-1.1 dataset originally described in paper (arxiv:1808.08932)
Fine-tune LLM for early Middle English lemmatization with data from LAEME.
Code for AACL23 paper "Benchmarking Procedural Language Understanding for Low-Resource Languages: A Case Study on Turkish"
Official implementation of the EACL Findings 2024 paper: Chem-FINESE: Validating Fine-Grained Few-shot Entity Extraction through Text Reconstruction
Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"
English-Sinhala multilingual word embedding alignment resources
[ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly
[ACL'24] MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)
Efficient Information Extraction in Few-Shot Relation Classification through Contrastive Representation Learning. NAACL 2024.
Implementation of NAACL 2024 main conference paper: Named Entity Recognition Under Domain Shift via Metric Learning for Life Science
NONWESTLIT Project Codebase
NLP pipelines for Tagalog using spaCy
Add a description, image, and links to the low-resource-nlp topic page so that developers can more easily learn about it.
To associate your repository with the low-resource-nlp topic, visit your repo's landing page and select "manage topics."