Skip to content

meekr/english-words-by-frequency

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

About

This repo contains a list of English words extracted from Wikipedia articles built by Lexipedia

  • wikipedia_words.zip Contains over 2.9M words. All words from english wikipedia articles including nouns, pronouns, etc. Also contains non-english words.

  • wiktionary_words.zip 280k words. Contains only words found in English Wiktionary.

Each line has 4 values: word, length, frequency, and document frequency(number of Wikipedia articles in which this word occurs) separeted by spaces.

About

English words list extracted from Wikipedia articles

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published