Skip to content
Nishikori's tennis match data
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
data
LICENSE
README.md
ScrapeNishikori.py

README.md

NishikoriBoardData

This script gathers comment data of Kei Nishikori comment board and formats it as match data and outputs it as csv file.

錦織実況掲示板 Nishikori comment board https://jbbs.shitaraba.net/sports/34934/

Collected data

Year2016-2018 https://github.com/taikoma/NishikoriBoardData/tree/master/data

Match Data

The csv file contains the following data.

Point by point data.

  • Serve direction.
  • Ace or DoubleFault
  • 1stServe or 2ndServe
  • Server
  • ServeSpeed
  • Won or Lost
  • Score
  • Side

The Picture below is a screenshot of the csv file. default

Requirement

  • python3
  • urllib2
  • re
  • unicodedata
  • pandas
  • BeautifulSoup
  • selenium
  • lxml.html
  • time

Usage

  1. Clone this repository or download ScrapeNishikori.py to your working directory.

  2. Before executing the ScrapeNishikori.py, open the file and edit the following two.

  3. Change to the url of the bulletin board you want to obtain data

url = "https://jbbs.shitaraba.net/bbs/read.cgi/sports/34934/1547509776/"
  1. Change the filename of the output file
df.to_csv("201901Australian.csv")
  1. Execute the script file
python ScrapeNishikori.py
You can’t perform that action at this time.