Skip to content

This project scrapes kayak product data, then analyzes the data for differences between types of kayaks and consumer preferences.

Notifications You must be signed in to change notification settings

bbotzheim/kayak_scraper

Repository files navigation

kayak_scraper

This project scrapes REI kayak product pages for product info and reviews. Kayaks vary considerably in design and what seems like a small change can drastically change how a kayak performs in different conditions. Even though a single kayak can be used in different settings (for example, whitewater kayaking versus open lake kayaking), different designs perform better in different settings. Performance here means speed, stability, and maneuverability. The goal of my project is to analyze the different technical specifications (dimensions, ratings, price, and advertised primary use (flatwater, whitewater, racing, sea, etc.)) and see how reviews compare.

The results of this project can be seen here: https://nycdatascience.com/blog/student-works/web-scraping-without-a-paddle/

References:

https://towardsdatascience.com/end-to-end-topic-modeling-in-python-latent-dirichlet-allocation-lda-35ce4ed6b3e0 https://www.analyticsvidhya.com/blog/2016/08/beginners-guide-to-topic-modeling-in-python/ https://www.analyticsvidhya.com/blog/2018/10/stepwise-guide-topic-modeling-latent-semantic-analysis/ https://www.analyticsvidhya.com/blog/2018/10/mining-online-reviews-topic-modeling-lda/ https://en.wikipedia.org/wiki/Kayak

About

This project scrapes kayak product data, then analyzes the data for differences between types of kayaks and consumer preferences.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published