Skip to content

ruyimarone/character-eyes

master
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Character Eyes

Code for our project analyzing character level taggers, accepted at BlackboxNLP 2019.

example activations

Contents

  • model.py - A fully character level tagger model, implemented in DyNet. It has support for asymmetric bi-directional RNNs, which we found had performance effects depending on linguistic properties of the language.
  • Pretrained models for 6 of our 24 languages
  • Ready-to-train datasets (from Univseral Dependencies 2.3) for all 24 languages
  • This notebook, which produces some of the figures in our paper and gives an overview of our method and metrics. This is the best place to start if you want to use our project.
  • An interactive notebook allowing you to visualize the character level activations for our models

Much of the code is modified from Mimick, a character level system that can replace OOVs or UNKs with learned representations approximating a closed vocabulary set of word embeddings.

Citation format

When using our work, please use the following .bib entry:

@article{charactereyes,
  title={Character Eyes: Seeing Language through Character-Level Taggers},
  author={Pinter, Yuval and Marone, Marc and Eisenstein, Jacob},
  journal={arXiv preprint arXiv:1903.05041},
  year={2019}
}

Releases

No releases published

Packages

No packages published