-
Notifications
You must be signed in to change notification settings - Fork 15
/
README.md
23 lines (18 loc) · 1.74 KB
/
README.md
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
# NeuroDecipher
This repo hosts the codebase for reproduce the results for the ACL paper [Neural Decipherment via Minimum-Cost Flow: from Ugaritic to Linear B](http://people.csail.mit.edu/j_luo/assets/publications/NeuroDecipher.pdf).
# Data
Data for linear B and Ugaritic are included in the `data` folder:
* `uga-heb.no_spe.cog` is the entire Ugaritic-Hebrew data obtained from [Ben Snyder](http://people.csail.mit.edu/bsnyder/papers/bsnyder_acl2010.pdf). `no_spe` stands for no special symbols since the original file contains special symbols that mark the morphological segmentations and affixes.
* `uga-heb.small.no_spe.cog` is the exact random subset of Ugaritic data I used in the paper for training the model. Around one tenth of the original file.
* `linear_b-greek.cog` is the linear B data used in the paper. `notebooks/Linear_b_simplified.ipynb` is the same notebook I used for preparing the linear B data.
* `linear_b-greek.names.cog` is the linear B data that only included names on the Greek side.
Note that you might need to install [fonts](https://www.google.com/get/noto/#sans-linb) in order to render Linear B scripts properly in your computer.
# Install
* Run `git submodule init && git submodule update` to obtain all the submodules.
* Install `pytorch`. Any version `>= 1.3` should work.
* `pip install -r requirements.txt` for other libraries.
* Go to the three folders (`editdistance`, `arglib`, and `dev_misc`) and run `pip install .` in each folder to install the dependency.
* Run `pip install .` in the root folder.
# Run
The recommended way to run this program is to write down a configuration class first. See `nd/config/decipher_config.py` for an example, and
accordingly you can use `python nd/main.py --cfg UgaHebSmallNoSpe` to start running.