Dataset & dataset processing for (CMU 11-785 Deep Learning Project)
-
Updated
Sep 23, 2020 - Python
Dataset & dataset processing for (CMU 11-785 Deep Learning Project)
Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.
Official Repository for the paper titled "Meta-Learning for Effective Multi-task and Multilingual Modelling" accepted at EACL 2021
Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"
A multilingual lexicon of words to hurt.
A model-based cleaner using Laser sentence embeddings to exploit embeddings to filter misaligned segment pairs. Product scaled by asynchronously building the Task Queues, dispatching the tasks in a Round Robin method and adding multiple workers on the RabbitMQ server for consumption.
Code for "Multilingual Sentiment Elicitation System for Social Media Data" @ IEEE Intelligent Systems
Codes for master's thesis investigating approaches for building a multilingual, knowledge-grounded dialogue system via cross-task and cross-lingual transfer learning.
AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chap…
Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.com/AI4Bharat/IndicBERT
XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).
PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023
On Bilingual Lexicon Induction with Large Language Models (EMNLP 2023). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.
Self-Augmented In-Context Learning for Unsupervised Word Translation (ACL 2024). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.
PicQ: Demo for MiniCPM Llama3 to answer questions about images using natural language.
Add a description, image, and links to the multilingual-models topic page so that developers can more easily learn about it.
To associate your repository with the multilingual-models topic, visit your repo's landing page and select "manage topics."