This project scrapes REI kayak product pages for product info and reviews. Kayaks vary considerably in design and what seems like a small change can drastically change how a kayak performs in different conditions. Even though a single kayak can be used in different settings (for example, whitewater kayaking versus open lake kayaking), different designs perform better in different settings. Performance here means speed, stability, and maneuverability. The goal of my project is to analyze the different technical specifications (dimensions, ratings, price, and advertised primary use (flatwater, whitewater, racing, sea, etc.)) and see how reviews compare.
The results of this project can be seen here: https://nycdatascience.com/blog/student-works/web-scraping-without-a-paddle/
References:
https://towardsdatascience.com/end-to-end-topic-modeling-in-python-latent-dirichlet-allocation-lda-35ce4ed6b3e0 https://www.analyticsvidhya.com/blog/2016/08/beginners-guide-to-topic-modeling-in-python/ https://www.analyticsvidhya.com/blog/2018/10/stepwise-guide-topic-modeling-latent-semantic-analysis/ https://www.analyticsvidhya.com/blog/2018/10/mining-online-reviews-topic-modeling-lda/ https://en.wikipedia.org/wiki/Kayak