Can be run in cluster mode with Mesos, YARN or standalone mode:
GCP provides Dataproc for Spark, AWS provides EMR
RDD - Resilient Distributed Dataset
Can be run in cluster mode with Mesos, YARN or standalone mode:
GCP provides Dataproc for Spark, AWS provides EMR
RDD - Resilient Distributed Dataset