Skip to content

A curated list of resources for Cross-lingual Information Retrieval (CLIR).

Notifications You must be signed in to change notification settings

flamato/awesome-clir

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 

Repository files navigation

Awesome Cross-lingual Information Retrieval

A curated list of resources for Cross-lingual Information Retrieval (CLIR).

Contributing

Please feel free to email Rui Zhang (r.zhang@yale.edu).

Table of Contents

Tutorials

CLIR

  • Cross-Language Information Retrieval [book]

Cross-lingual Word Embeddings

  • Cross-Lingual Word Representations: Induction and Evaluation (EMNLP 2017 Tutorial) [slides]
  • A Survey of Cross-lingual Word Embedding Models (JAIR) (paper)

Neural Networks for IR

  • Neural Networks for Information Retrieval (SIGIR 2017, ECIR 2018, WSDM 2018 Tutorial) [slides]
  • Neural Models for Information Retrieval [video][paper][slides]
  • An Introduction to Neural Information Retrieval [paper]
  • Neural Text Embeddings for Information Retrieval (WSDM 2017) [slides]
  • Neural Information Retrieval: At the End of the Early Years (Information Retrieval Journal 2018) [slides]
  • Neural Information Retrieval: A Literature Review [paper]
  • Deep Learning for Information Retrieval (SIGIR 2016 Tutorial) [paper]
  • Learning to Rank for Information Retrieval and Natural Language Processing [book]

Other Books

  • Information Retrieval [book]
  • Estimating the Query Difficulty for Information Retrieval [book]

Blogs

Papers

Neural CLIR

  • Unsupervised Cross-Lingual Information Retrieval using Monolingual Data Only (SIGIR 2018) [paper][code]
  • Monolingual and Cross-Lingual Information Retrieval Models Based on (Bilingual) Word Embeddings (SIGIR 2015) [paper]
  • Entity-Duet Neural Ranking: Understanding the Role of Knowledge Graph Semantics in Neural Information Retrieval (ACL 2018) [paper][code]
  • Cross-lingual Learning-to-Rank with Shared Representations (NAACL 2018) [paper][data]
  • Learning to Match using Local and Distributed Representations of Text for Web Search (WWW 2017) [paper][code]
  • PACRR: A Position-Aware Neural IR Model for Relevance Matching (EMNLP 2017) [paper][code]
  • Co-PACRR: A Context-Aware Neural IR Model for Ad-hoc Retrieval (WSDM 2018) [paper][code]
  • Learning to Translate: A Query-Specific Combination Approach for Cross-Lingual Information Retrieval (EMNLP 2014) [paper]
  • Representation Learning Using Multi-Task Deep Neural Networks for Semantic Classification and Information Retrieval (NAACL 2015)[paper]
  • A Dual Embedding Space Model for Document Ranking (WWW 2016)[paper]
  • Query Expansion with Locally-Trained Word Embeddings (ACL 2016)[paper]
  • A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval (CIKM 2014)[paper]
  • Toward Incorporation of Relevant Documents in word2vec (NeuIR-SIGIR 2017)[paper]
  • Using Word Embeddings for Automatic Query Expansion (NeuIR-SIGIR 2016)[paper]
  • Query Expansion Using Word Embeddings (CIKM 2016)[paper]
  • Improved Cross-Language Retrieval using Backoff Translation (HLT 2001)[paper]

Crosslingual Word Embeddings

  • (MUSE) Word Translation Without Parallel Data (ICLR 2018) [paper][github][blog]
  • (Babylon - fastText_multilingual) Offline bilingual word vectors, orthogonal transformations and the inverted softmax (ICLR 2017) [paper][github]
  • Learning Crosslingual Word Embeddings without Bilingual Corpora (EMNLP 2016) [paper][github]
  • Multilingual Training of Crosslingual Word Embeddings (EACL 2017)[paper]
  • Unsupervised Multilingual Word Embeddings (EMNLP 2018)[paper][code]
  • (multiCluster and multiCCA) Massively Multilingual Word Embeddings [paper]
  • (bivec) Bilingual Word Representations with Monolingual Quality in Mind (NAACL 2015 workshop)[paper][github]
  • Cross-lingual Models of Word Embeddings: An Empirical Comparison (ACL 2016) [paper]
  • Bilingual Embeddings with Random Walks over Multilingual Wordnets (Knowledge-Based Systems, 2018) [paper]
  • Cross-lingual Word Clusters for Direct Transfer of Linguistic Structure (NAACL 2012) [paper]
  • Improving Cross-Lingual Word Embeddings by Meeting in the Middle (EMNLP 2018) [paper]
  • CLUSE: Cross-Lingual Unsupervised Sense Embedding (EMNLP 2018) [paper][code]
  • Unsupervised Cross-lingual Transfer of Word Embedding Spaces (EMNLP 2018) [paper]

Cross-lingual Learning

  • Cross-Lingual Morphological Tagging for Low-Resource Languages (ACL 2016) [paper]
  • Phrase-Based & Neural Unsupervised Machine Translation (EMNLP 2018) [paper]
  • Joint Representation Learning of Cross-lingual Words and Entities via Attentive Distant Supervision (EMNLP 2018) [paper]
  • XL-NBT: A Cross-lingual Neural Belief Tracking Framework (EMNLP 2018) [paper]
  • Adversarial Propagation and Zero-Shot Cross-Lingual Transfer of Word Vector Specialization (EMNLP 2018) [paper]
  • Cross-lingual Lexical Sememe Prediction (EMNLP 2018) [paper]
  • Joint Multilingual Supervision for Cross-lingual Entity Linking (EMNLP 2018) [paper]
  • Cross-lingual Knowledge Graph Alignment via Graph Convolutional Networks (EMNLP 2018) [paper]
  • Neural Cross-lingual Named Entity Recognition with Minimal Resources (EMNLP 2018) [paper]
  • Multi-lingual Common Semantic Space Construction via Cluster-Consistent Word Embedding (EMNLP 2018) [paper]
  • XNLI: Evaluating Cross-lingual Sentence Representations (EMNLP 2018) [paper]

Miscellaneous

  • Morphological Word-Embeddings (NAACL 2015) [paper]
  • Discriminative Reranking for Machine Translation (NAACL 2004) [paper]
  • Discriminative Reranking for Natural Language Parsing (Computational Linguistics) [paper]
  • Probabilistic Structured Query Methods (SIGIR 2003) [paper]
  • Strong Baselines for Neural Semi-supervised Learning under Domain Shift (ACL 2018) [paper]
  • Transfer Learning for Low-Resource Neural Machine Translation (EMNLP 2016) [paper]
  • Meta-Learning for Semi-Supervised Few-Shot Classification (ICLR 2018) [paper]
  • One-shot Learning with Memory-Augmented Neural Networks [paper]
  • Low-Resource Semantic Role Labeling (ACL 2014) [paper]
  • Universal Neural Machine Translation for Extremely Low Resource Languages (NAACL 2018) [paper]
  • An Unsupervised Word Sense Disambiguation System for Under-Resourced Languages (LREC 2018) [paper]

Lists

  • Information Retrieval [link]
  • Awesome Information Retrieval [link]

About

A curated list of resources for Cross-lingual Information Retrieval (CLIR).

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published