Skip to content

Transforms popular web dictionaries into word lists. Made for NLP data analysis. Only oxfordreference available currently.

Notifications You must be signed in to change notification settings

SarahKay99/URLtoDict

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

URLtoDict

Transforms popular web dictionaries into word lists. Made for NLP data analysis. Only oxfordreference available currently.

DEPENDENCIES: furl, bs4, requests, urllib

Future sites: Wiktionary

Takes 4-6 seconds per webpage.

LEARN IN 30 SECONDS:

from URLtoDict import URLtoDict
convert = URLtoDict()
oxford_url = r"https://www.oxfordreference.com/view/10.1093/acref/9780199688975.001.0001/acref-9780199688975" # dictionary of compsci
url, words = convert.oxfordreference(oxford_url)

print(words)  # should be an array

TRY WITH ANY DICTIONARY FROM OXFORDREFERENCE.

About

Transforms popular web dictionaries into word lists. Made for NLP data analysis. Only oxfordreference available currently.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages