Skip to content

A simple crawler which pulls down word/origin pairs from etymonline.com

Notifications You must be signed in to change notification settings

HaveF/etymonline_scraper

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Online Etymology Dictionary Scraper

It works! I have plans for building an HMM capable of recognizing language of origin from a word's orthographical form.

After you have scrapy installed you can run it with the following command while in the project directory. This will create a large JSON file of word and origins pairs.

scrapy crawl etymonline.com -o etymonline_data.json -t json -s LOG_FILE=etymonline_data.log -L WARNING

-L used to set Log levels.

About

A simple crawler which pulls down word/origin pairs from etymonline.com

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%