Skip to content

ppouke/NLPProject

Repository files navigation

Requirements:
- NLTK
   - Stopwords, Reuters
- BNC Corpus ("download" folder) as per project instructions
- WordNet as per project instructions ("wn-domains-3.2" folder)

Running main.py:
- Change variable "usingReut" to determine if Reuters or BNC Corpus is used
- At the bottom of the file, comment and uncomment the parts you do and don't want to run/print

When running first time, stopworded.pkl or stopwordedReut.pkl will be generated. Subsequent runs will utilize these saved files and save processing time.

About

Oulu university NLP project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages