Tidybrowse is a meta package containing different packages easing web scrapping and the deployment of docker containers from R.
A list of use-cases
- Searching Citations on Google Scholar: Use Selenium to scrape citations from Google Scholar
# install.packages("devtools")
devtools::install_github("benjaminguinaudeau/tidybrowse")
dockeR wraps up docker command line tools and allows to manage docker containers from R. It can be use to deploy selenium servers, shiny-app servers, rstudio-servers, etc…
This wraps up RSelenium function in a pipable way. It also offers function to easily communicate with a selenium server running inside a docker container.
Tidyweb allows to represent xml-tree data in a tidy way. It works as well with xml-nodes as with selenium elements.
Selinput wraps up the python library pyautogui, which emulates mouse and keyboard input. It allows to easily type, click and scroll inside a docker container, with a running selenium server.
library(tidybrowse)
#> ── Attaching packages ────────────────────────────────────── tidybrowse 0.0.1 ──
#> ✓ RSelenium 1.7.7 ✓ tidyweb 0.0.0.9000
#> ✓ dockeR 0.1.0 ✓ selinput 0.0.0.9000
#> ✓ tidyselenium 0.0.0.9000 ✓ rvest 0.3.6
#> ── Conflicts ───────────────────────────────────────── tidybrowse_conflicts() ──
#> x dockeR::system() masks base::system()
A huge thank you to Favstats for designing each of the hex-stickers.