Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EMR Serverless Adding Option to Boto3 for Glue Catlog #53

Closed
soumilshah1995 opened this issue May 9, 2023 · 1 comment
Closed

EMR Serverless Adding Option to Boto3 for Glue Catlog #53

soumilshah1995 opened this issue May 9, 2023 · 1 comment

Comments

@soumilshah1995
Copy link

soumilshah1995 commented May 9, 2023

AWS has announced AWS EMR CLI

https://aws.amazon.com/blogs/big-data/build-deploy-and-run-spark-jobs-on-amazon-emr-with-the-open-source-emr-cli-tool/

 

I have tried and CLi works great simplifies submitting jobs
image

However, could you tell us how to enable the Glue Hive meta store when submitting a job via CLI or in Boto3
i have looked at documentation i don't see an argument for supplying use Glue CatLog option on boto3

image

Here is a sample of how we are submitting jobs qith EMR-CLI

emr run     --entry-point entrypoint.py
    --application-id     --job-role <arn>
    --s3-code-uri s3:///emr_scripts/     --spark-submit-opts "--conf spark.jars=/usr/lib/hudi/hudi-spark-bundle.jar --conf spark.serializer=org.apache.spark.serializer.KryoSerializer"
    --build `
    --wait

 

Created A Github Issue awslabs/amazon-emr-cli#18

If you can kindly get back to us on issue that would be great 😃

@Neubauer-A
Copy link

I've been able to get this to work by specifying in sparkSubmitParameters. For example:

job_run = client.start_job_run(
    applicationId=application_id,
    executionRoleArn=job_role_arn,
    jobDriver={
        'sparkSubmit': {
            'entryPoint': 's3://bucket/your_script.py',
            'entryPointArguments': [],
            'sparkSubmitParameters': '--conf spark.hadoop.hive.metastore.client.factory.class=com.amazonaws.glue.catalog.metastore.AWSGlueDataCatalogHiveClientFactory'
        },
    }
)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants