Skip to content
This repository has been archived by the owner on Mar 17, 2021. It is now read-only.

A Python script that converts the Online Plain Text English Dictionary to JSON format.

License

Notifications You must be signed in to change notification settings

johnridesabike/OPTED-to-JSON

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OPTED to JSON Converter

The Online Plain Text English Dictionary (OPTED) is a public domain dictionary based on Project Gutenberg's digitization of the 1913 Webster Unabridged Dictionary.

This is a Python script that converts the OPTED from HTML to JSON. The code can probably be modified to convert the OPTED to other formats as well.

Outputted files are written to the /json/ directory. They're separated into a file for each letter. The structure of the JSON data is an object with the words as names and the values are arrays of definitions. Each definition is an object with the part of speech and definition text. See this example:

{"word": [ {"partOfSpeech": "noun", "text" : "definition one"},
            {"partOfSpeech": "verb", "text" : "definition two"} ] 
}

There are several other scripts out there that convert the Webster Unabridged Dictionary to different formats. I wasn't satisfied with any of the ones I found, so I wrote my own.

Depending on your needs, the GNU Collaborative International Dictionary of English (GCIDE) may be more useful, since it's much more complete. It's licensed under the GPL, so it does come with conditions for its use. The text of the OPTED is public domain.

The Python code portion of this repository is MIT licensed.

About

A Python script that converts the Online Plain Text English Dictionary to JSON format.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages