Skip to content

avinassh/gre-classics

Repository files navigation

An attempt to find out frequency of Top 800 GRE words in Classic books.

#Requirements:

  • Check requirements.txt. Install it using pip:

      pip install -r requirements.txt
    
  • NTLK WordNet corpus

#Log:

  1. 16/Feb/2014:

    • High frequency words and corpus, were not lemmatized
    • Output is saved in output.json
  2. 18/Feb/2014:

    • High frequency words and corpus both were lemmatized

    • Lemmatization was done using WordNetLemmatizer:

        from nltk.stem import WordNetLemmatizer
        wnl = WordNetLemmatizer()
        wnl.lemmatize(word)
      
    • Output is saved in output-wnl.json

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages