Skip to content

crubba/htmltab

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

htmltab: Hassle-free HTML tables in R

HTML tables are a valuable data source but extracting and recasting these data into a useful format can be tedious. htmltab is a package for extracting structured information from HTML tables. It is similar to readHTMLTable() of the XML package but provides two major advantages. First, the function automatically expands row and column spans in the header and body cells. Second, users are given more control over the identification of header and body rows which will end up in the R table. Additionally, the function preprocesses table code, removes unneeded parts and so helps to alleviate the need for tedious post-processing.

Installation

The package is available from CRAN and Github. For the stable release version, download from CRAN:

install.packages("htmltab")

For the developer version, download from my GitHub repo. You can install the package directly from inside R:

install.packages("devtools")
devtools::install_github("crubba/htmltab")

Usage

To see htmltab in action, take a look at the case studies in the package vignette, this blog post or consult the package manual.

Travis status

travis status

Report issues

If you experience problems with htmltab, I would like to hear about it to improve the project. Please use my github repo to report the issue.

About

An R package for assembling data frames from HTML tables (fka htmltable)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages