Skip to content
Extract data from an HTML table and store results to a csv file.
Python
Find file
Latest commit 730fb6d @hernamesbarbara Merge pull request #2 from Candunc/master
Changed the seperator to a comma, removed some errors

README.rst

table2csv

Simple script for downloading html tables as csv.

Installation

pip install -U table2csv

Usage

table2csv http://en.wikipedia.org/wiki/List_of_Super_Bowl_champions > dump.txt
python -m table2csv.main http://en.wikipedia.org/wiki/List_of_Super_Bowl_champions > dump.txt

Use --nth=[int] to grab a certain table from the page.

Features

  • accepts a URL
  • Identifies all the tables
  • Merges tables that share same structure (e.g. same column headers get merged)
  • Figures out which table is the biggest
  • extracts text
  • extracts links

TODO

  • detect the data types found within each column
  • add support for tables with hierarchical indices on the rows and/or columns

View on Github

Something went wrong with that request. Please try again.