Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: EMR Serverless Stage #390

Merged
merged 14 commits into from
Jul 11, 2023
Merged

Conversation

malachi-constant
Copy link
Contributor

@malachi-constant malachi-constant commented Jul 10, 2023

Closes #384

Todo

  • Implementation
  • Testing
  • Docs
  • Unit tests

Details

  • Adds SFN stage to orchestrate trigger and polling of a job running against a serverless EMR cluster

Testing

Tested with infrastructure created in #385
and the following code

const app = new cdk.App();
const stack = new cdk.Stack(app, "EmrServerlessTest");
const emrCluster = new ddk.EMRServerlessCluster(stack, "MyEmrApp", {
  releaseLabel: "emr-6.11.0",
  type: "SPARK",
  vpcCidr: "10.60.0.0/16"
})
new ddk.EMRServerlessJobStage(
  stack,
  "EMR Serverless Job",
  {
    applicationId: emrCluster.emrServerlessApplication.ref,
    executionRoleArn: emrCluster.role.roleArn,
    jobDriver: { 
      SparkSubmit: {
        EntryPoint: `s3://${emrCluster.s3Bucket.bucketName}/scripts/wordcount.py`,
        EntryPointArguments: [`s3://${emrCluster.s3Bucket.bucketName}/emr-serverless-spark/output`],
        SparkSubmitParameters: "--conf spark.executor.cores=1 --conf spark.executor.memory=4g --conf spark.driver.cores=1 --conf spark.driver.memory=4g --conf spark.executor.instances=1"
      }
    }
  }
)
Screen Shot 2023-07-10 at 6 14 00 PM

@malachi-constant malachi-constant added this to the 1.2.0 milestone Jul 10, 2023
@malachi-constant malachi-constant self-assigned this Jul 10, 2023
@malachi-constant malachi-constant linked an issue Jul 10, 2023 that may be closed by this pull request
@malachi-constant malachi-constant marked this pull request as draft July 10, 2023 22:16
Signed-off-by: github-actions <github-actions@github.com>
@malachi-constant
Copy link
Contributor Author

AWS CodeBuild CI Report

  • CodeBuild project: DDKIntegrationTestsD8472609-5SCHWjldmWcf
  • Commit ID: 89d80f1
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

malachi-constant and others added 5 commits July 10, 2023 19:45
Signed-off-by: github-actions <github-actions@github.com>
Signed-off-by: github-actions <github-actions@github.com>
@malachi-constant
Copy link
Contributor Author

AWS CodeBuild CI Report

  • CodeBuild project: DDKIntegrationTestsD8472609-5SCHWjldmWcf
  • Commit ID: 9af8149
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@malachi-constant
Copy link
Contributor Author

AWS CodeBuild CI Report

  • CodeBuild project: DDKIntegrationTestsD8472609-5SCHWjldmWcf
  • Commit ID: 00b9d06
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@malachi-constant malachi-constant merged commit cfd2252 into main Jul 11, 2023
5 checks passed
@malachi-constant malachi-constant deleted the 384-core-emr-serverless-stage branch July 11, 2023 21:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

core: EMR Serverless Stage
2 participants