
Starred repositories
An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/pdf/2106.04718.pdf
SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
Python library & examples for Masked Language Model Scoring (ACL 2020)
cookiecutter template for web API with python
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
The code of a graph neural network (GNN) for molecules, which is based on learning representations of r-radius subgraphs (i.e., fingerprints) in molecules.
A Japanese NLP Library using spaCy as framework based on Universal Dependencies
EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)
🌈 Implementation of Neural Network based Named Entity Recognizer (Lample+, 2016) using Chainer.
Implementations of Embedding-based methods for Knowledge Base Completion tasks
The Tensorflow code for this ACL 2018 paper: "Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms"
Python implementation of "Data-dependent Learning of Symmetric/Antisymmetric Relations for Knowledge Base Completion [Manabe+. 2018]"
A Japanese tokenizer based on recurrent neural networks
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
Counter-fitting Word Vectors to Linguistic Constraints
Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"