Skip to content

datitran/emr-bootstrap-pyspark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

EMR Bootstrap PySpark with Anaconda

This code should help to jump start PySpark with Anaconda on AWS.

Getting Started

  1. conda env create -f environment.yml
  2. Fill in all the required information e.g. aws access key, secret acess key etc. into the config.yml.example file and rename it to config.yml
  3. Run it python emr_loader.py

Requirements

Copyright

See LICENSE for details. Copyright (c) 2016 Dat Tran.