Simple scripts to setup a fresh data science box using an Ubuntu 12.04.* LTS 64-bit server running on an EC2
Shell Python
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.

Data Science Box

I often have to spin-up EC2 instances to do various data science(y) things with said instance. These scripts are the result of having done that many times and needing many of the same tools to be available on those boxes.

Running these scripts will turn an Ubuntu 12.04.* LTS 64-bit server (tested) running on EC2 into a fully functioning data science box. Along with several base development libraries, the scripts installs and configures:


Stick these two scripts in the same directory on your freshly deployed instance and type:

$ ./

Then follow the on-screen instructions to configure the software.


The script takes care of all the on-box configuration, but if you wish to access RStudio Server, shiny-server, and IPython notebook server via a browser you will need to make sure the Security Groups for the server you deploy allows in-bound traffic to the follow ports (defaults):

  • RStudio: 8787
  • shiny-server: 3838
  • IPython notebook: 8888