From 70c71edd8f3ec11be352044e8eee12f46a61e088 Mon Sep 17 00:00:00 2001 From: Sebastian Nagel Date: Thu, 15 Sep 2016 10:35:50 +0200 Subject: [PATCH] add note how to install requirements in Hadoop cluster --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index a8a39bc..6b11366 100644 --- a/README.md +++ b/README.md @@ -20,6 +20,8 @@ In addition, a bash script `index_env.sh` is used to specify all the relevant pa You can simply run `cp index_env.sample.sh index_env.sh` to copy the provided sample. Please refer to the file for more details and to fill in the actual paths. Note that on EMR paths to AWS S3 have to be given as `s3://bucket/path` while on Hadoop (no EMR) paths must start with `s3a://`. +Requirements have to be installed on all nodes of the cluster. The script `bootstrap.sh` installs everything needed on EMR, including Python and packages necessary to compile the requirements. + #### Local No additional setup is necessary. See [building a local cluster](#building-a-local-cluster).