-
How to execute a Spark application via EMR step non-interactively, examples with spark-submit
-
Using Spark HiveContext with cluster deployment mode on YARN
-
Reading input with custom Hadoop input formats, newAPIHadoopFile() (Java)
-
Using Hive and SparkSQL together with Parquet storage on EMR AMI 3.x
- AWS CLI installation and configuration, http://docs.aws.amazon.com/cli/latest/userguide/cli-chap-welcome.html
- AWS EMR documentation, http://docs.aws.amazon.com/ElasticMapReduce/latest/DeveloperGuide/
- AWS EMR AMI version information, http://docs.aws.amazon.com/ElasticMapReduce/latest/DeveloperGuide/ami-versions-supported.html
- Running Spark on YARN, https://spark.apache.org/docs/latest/running-on-yarn.html