Skip to content

Spark jobs in EMR on EKS through Spark Operator and spark-submit #54365

@adityamohan93

Description

@adityamohan93

Description

In Airflow, for EMR on EKS virtual cluster, EmrContainerOperator is advised to start a new Spark job. From doc, seems it needs sparkSubmitJobDriver as input[1] where we provide location for script. EMR on EKS also support Spark Operator to submit jobs[2]. Are these supported in Airflow because from documentation seems it doesn't?

  1. https://airflow.apache.org/docs/apache-airflow-providers-amazon/stable/operators/emr/emr_eks.html#submit-a-job-to-an-amazon-emr-virtual-cluster
  2. https://docs.aws.amazon.com/emr/latest/EMR-on-EKS-DevelopmentGuide/spark-operator-gs.html

Use case/motivation

Run Spark job on AWS EMR on EKS. This question is relevant to anyone doing the same.

Related issues

No response

Are you willing to submit a PR?

  • Yes I am willing to submit a PR!

Code of Conduct

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions