Multilingual sentence alignment using sentence embeddings
-
Updated
Nov 4, 2024 - Python
Multilingual sentence alignment using sentence embeddings
A 2024 Reading List for Bilingual Lexicon Induction (BLI) / Word Translation. Frequently Updated.
Cross-lingual word embeddings are crucial for various multilingual NLP tasks. This project focuses on aligning monolingual word embeddings from English and Hindi to create a shared cross-lingual embedding space.
Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
ISCC - Semantic Code Text
Improving Word Translation via Two-Stage Contrastive Learning (ACL 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.
A cross lingual toxicity detection model that works for over 100 languages. Powered the mighty XLM-R model, the model performance is state of the art.
Cross Lingual Language models for making search engines for Holy Quran and Sahih Hadiths
Improving Bilingual Lexicon Induction with Cross-Encoder Reranking (Findings of EMNLP 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.
Cross Lingual Word Embeddings for Turkic Languages
Code for paper "Cross-lingual Transfer for Text Classification with Dictionary-based Heterogeneous Graph", EMNLP 2021 - findings.
A curated list of papers and experiments in the field of Natural Language Processing (NLP)
Weakly-supervised bilingual lexicon induction
Why Overfitting Isn't Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries (ACL 2020)
Are Girls Neko or Shōjo? Cross-Lingual Alignment of Non-Isomorphic Embeddings with Iterative Normalization (ACL 2019)
Improving cross-lingual word embeddings by meeting in the middle
Bilingual n-gram Phrase Table Induction with Dynamax-Jaccard
A monolingual and cross-lingual meta-embedding generation framework
Automating healthcare QA in a noisy multilingual low-resource setting
Add a description, image, and links to the cross-lingual-embeddings topic page so that developers can more easily learn about it.
To associate your repository with the cross-lingual-embeddings topic, visit your repo's landing page and select "manage topics."