Beer Scraper in Scala
Simple scrapper, developed using
scalatest to work in TDD.
Scrapping the website http://craftcans.com/db.php?search=all&sort=beerid&ord=desc&view=text.
Followed the tutorial http://blog.kaggle.com/2017/01/31/scraping-for-craft-beers-a-dataset-creation-tutorial/ to clean the data (work in progress)
How does it work?
Run it yourself!
git clone https://github.com/ColinLeverger/beer-scraper-scala` sbt test sbt run
What about the process?
- Connect to the website and download the HTML
- Parse it with the library
- Create a list of
Beers case class objects
- Write CSV
See this file. Variables cleaned.