OEDict

Old English dictionary parser

The goal of this project is to parse the Bosworth and Toller Anglo-Saxon dictionary into modern formats.

We are basing the parser on the digital text version available here:
https://www.ling.upenn.edu/~kurisuto/germanic/oe_bosworthtoller_about.html

Instructions

git clone git@github.com:loco-lab/pyglossary.git
cd pyglossary
git checkout at_delim
pip install .

From this repository, you can then use csv_to_apple.py to create an Apple xml dictionary, which can then be compiled and imported to Dictionary. This requires the Dictionary Development Kit from XCode.

python csv_to_apple.py oe_bt.csv oe_bt.xml
cd oe_bt.xml
make
make install

pyglossary can be used to convert to a variety of other formats as well.

Credits

We acknowledge the following contributions to create the text version:

Scanning and initial OCR was funded by Joel Dean grants at Swarthmore College, summer 2001 and summer 2002.
Jason Burton: Scanned the entire text during summer 2001, and performed preliminary OCR.
B. Dan Fairchild: Revised OCR work and performed a preliminary round of automated corrections, summer 2002.
Margaret Hoyt: Image postprocessing on pages b0400-b1000, summer 2003.
Grace Mrowicki and Michael O'Keefe: Hand correction of some pages (prior to initiation of web-based correction system), summer 2003.
Sarah Hartman, Finlay Logan, and Sean Crist: Correction of headwords in the 1921 supplement, spring/summer 2005.
Thomas McFadden: Correction of page headers, summer 2005. Kari Swingle: Joel Dean grantee, summer 2001
David Harrison: Joel Dean grantee, summer 2002
Sean Crist: Initiated project; Principal Investigator under Joel Dean grantees, summer 2001/2002; major software design and programming; global correction; image postprocessing except on pages b0401-b1000.
Thanks to Ian Sandy for pointing out a few image files which contained the wrong page image.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
sources		sources
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
csv_to_apple.py		csv_to_apple.py
json_to_csv.py		json_to_csv.py
oe_bt.csv		oe_bt.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OEDict

Instructions

Credits

About

Releases

Packages

Contributors 2

Languages

License

loco-lab/OEDict

Folders and files

Latest commit

History

Repository files navigation

OEDict

Instructions

Credits

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages