Question Recommendation and Topic Modelling engine.
Python was used for programming this application.
Following libraries were used for this project:
- Numpy - Basic linear algebra
- Scipy - Basic linear algebra
- Gensim - Topic Modelling
- nltk - Natural Language Processing and StopWords
- Stackoverflow API - Collection of ~40,000 questions retireived from Stackoverflow website.
- scikit-learn - Machine Learning library (Document Clustering)
- CGI - Web page was hosted on Apache server using Python CGI.
Algorithms used here:
- Top-N Recommendation.
- Tf_Idf Vector Similarity.
- Latent Dirichlet Allocation (LDA)