Deployed at https://learnddit-v3.herokuapp.com. If the link or app does not work, that is probably because our Google Cloud Platform trial expired as we store Reddit comments from the past several years on it (Heroku had a 10k limit but we need to store around ~300k comments).
This project was done for Cornell's CS4300, a course on information retrieval. Learnddit is an app that searches through /r/IWantToLearn to retrieve relevant comments related to the search query. This is different from a regular Reddit search as it only does a boolean search on post titles, but not comments and without regard for weighting. We use a combination of cosine-similarity with tf-idf scoring, alongside weighing certain search terms heavier utilizing NLTK to classify more important query terms.
- Flask
- ReactJS
- NLTK