Skip to content

Commit

Permalink
Add placenames from Pleiades to Latin dictionary
Browse files Browse the repository at this point in the history
  • Loading branch information
ryanfb committed Apr 20, 2015
1 parent fa50456 commit 29b71a0
Show file tree
Hide file tree
Showing 3 changed files with 22,154 additions and 1 deletion.
2 changes: 1 addition & 1 deletion Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@ lat.rigaudon.word.txt: tools/rigaudonparseword.sh wordlist.rigaudon
lat.perseus.word.txt: tools/wordlistparseword.sh wordlist.perseus
tools/wordlistparseword.sh < wordlist.perseus > $@

lat.word.txt: lat.perseus.word.txt lat.rigaudon.word.txt
lat.word.txt: lat.perseus.word.txt lat.rigaudon.word.txt lat.pleiades.word.txt
LC_ALL=C cat $^ | sort | uniq > $@

seed:
Expand Down
13 changes: 13 additions & 0 deletions Makefile.pleiades
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
all: lat.pleiades.word.txt

lat.pleiades.word.txt: latin-names-historic.csv
csvcut -c 27 latin-names-historic.csv | tr -d '",…' | tr ' ' "\n" | grep -v "[][?'’()+ŚĀ.0-9-]" | grep -v '^[a-z]' | grep -v '^$$' | sort -u > $@

latin-names-historic.csv: pleiades-names-latest.csv
csvgrep -c 15 -r '^(la)?$$' pleiades-names-latest.csv | csvgrep -c 24 -i -r '[MOSN]'|csvgrep -c 7 -i -r '.*TAVO Index.*' > $@

pleiades-names-latest.csv: pleiades-names-latest.csv.gz
gunzip -k pleiades-names-latest.csv.gz

pleiades-names-latest.csv.gz:
wget 'http://atlantides.org/downloads/pleiades/dumps/pleiades-names-latest.csv.gz'
Loading

0 comments on commit 29b71a0

Please sign in to comment.