Skip to content

FilipeTheAnalyst/boardgames_xml_api

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BoardGameGeek BoardGames Web Scraping

I made this project to learn how to use Scrapy framework with Python to perform web scraping. The developed spider extracts all the board games from BoardGameGeek website and exported into a csv file (around 137 thousand games) with the following data:

  • Id
  • Rank
  • Name
  • Url
  • Rating
  • Number of Votes
  • Year
  • Description
  • Date

Use Boardgamegeek XML API to scrape data details from boardgames

I use the following code here to scrape data using XML API from Boardgamegeek website to get the following details for each game that was previously web scraped from their website:

  • Thumbnail
  • Language dependency
  • Minimum number of players
  • Maximum number of players
  • Best player number
  • Minimum playing time
  • Maximum playing time
  • Number of users rating the game
  • Number of users that own the game
  • Number of comments
  • Number of votes for the weight of the game

Prerequisites to run the project

Need to have installed the following Python packages:

  1. scrapy
  2. pandas
  3. beautifulsoup4

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages