NLP deep learning model for multilingual toxicity detection in text 📚
-
Updated
Aug 10, 2020 - Jupyter Notebook
NLP deep learning model for multilingual toxicity detection in text 📚
Dataset & dataset processing for (CMU 11-785 Deep Learning Project)
BERT classification of Myers-Brigg personality types based on Twitter tweets in four different European languages.
Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.
Cross-lingual alignment model for creating an aligned corpus of Hindi sentences aligned with English fact triples.
Evaluating the Efficacy of Summarization Evaluation across Languages. In Findings of ACL 2021.
Official Repository for the paper titled "Meta-Learning for Effective Multi-task and Multilingual Modelling" accepted at EACL 2021
Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre-train from scratch. We investigated if multilingual models could inherit these properties by making it an Efficient Transformer (s.a. the Longformer architecture).
Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"
Dataset: Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society
A multilingual lexicon of words to hurt.
Code for the shared task on homophobia/transphobia detection at LT-EDI Workshop @ ACL 2022
A model-based cleaner using Laser sentence embeddings to exploit embeddings to filter misaligned segment pairs. Product scaled by asynchronously building the Task Queues, dispatching the tasks in a Round Robin method and adding multiple workers on the RabbitMQ server for consumption.
Code for "Multilingual Sentiment Elicitation System for Social Media Data" @ IEEE Intelligent Systems
Codes for master's thesis investigating approaches for building a multilingual, knowledge-grounded dialogue system via cross-task and cross-lingual transfer learning.
AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
PyTorch implementation of Sentiment Analysis of the long texts written in Serbian language (which is underused language) using pretrained Multilingual RoBERTa based model (XLM-R) on the small dataset.
Multilingual Speech to Speech (STS) Translator is the First Ever Code-mixed English-Arabic speech to Bangla-Arabic Speech Translator
Add a description, image, and links to the multilingual-models topic page so that developers can more easily learn about it.
To associate your repository with the multilingual-models topic, visit your repo's landing page and select "manage topics."