Skip to content
master
Go to file
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
src
 
 
 
 
 
 
 
 
 
 

README.md

Beer Scraper in Scala

Simple scrapper, developed using scala 2.12.1, ruippeixotog/scala-scraper and scalatest.

Use of FlatSpecs from scalatest to work in TDD.

Scrapping the website http://craftcans.com/db.php?search=all&sort=beerid&ord=desc&view=text.

Followed the tutorial http://blog.kaggle.com/2017/01/31/scraping-for-craft-beers-a-dataset-creation-tutorial/ to clean the data (work in progress)

How does it work?

Run it yourself!

git clone https://github.com/ColinLeverger/beer-scraper-scala`
sbt test
sbt run

What about the process?

  1. Connect to the website and download the HTML
  2. Parse it with the library scala-scraper
  3. Create a list of Beers case class objects
  4. Write CSV
  5. Done!

Sample output

See this file. Variables cleaned.

About

Beer Scraper in Scala

Topics

Resources

Releases

No releases published

Packages

No packages published

Languages

You can’t perform that action at this time.