Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to use with AWS Glue? #103

Closed
kasunnirmala opened this issue Jul 18, 2022 · 9 comments
Closed

How to use with AWS Glue? #103

kasunnirmala opened this issue Jul 18, 2022 · 9 comments
Labels
question Further information is requested

Comments

@kasunnirmala
Copy link

How to add the extension and the way to use it in AWS Glue? Either Notebook or shell script

@EnricoMi
Copy link
Contributor

You should be able to use this as any other Spark package. Try to find out how this is done in general, it should apply to spark-extension as well.

https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-python-libraries.html#aws-glue-programming-python-libraries-job
https://stackoverflow.com/questions/53382166/how-to-import-spark-packages-in-aws-glue

@EnricoMi EnricoMi added the question Further information is requested label Jul 19, 2022
@EnricoMi
Copy link
Contributor

@leewalter78 do you have some suggestions, looks like you have used this with Glue.

@leewalter78
Copy link

#64 (comment)

@kasunnirmala
Copy link
Author

image

I get an error something like this.

@EnricoMi
Copy link
Contributor

You are using Python? Which spark-extension version did you use? Which Glue version?

Do you have step-by-step instructions for me to reconstruct this error on Glue?

@kasunnirmala
Copy link
Author

You are using Python? Which spark-extension version did you use? Which Glue version?

Do you have step-by-step instructions for me to reconstruct this error on Glue?

I was using spark-extension_2.12:2.1.0-3.3 with Glue 3.0, and python 3

@EnricoMi
Copy link
Contributor

Glue 3.0 uses Spark 3.1, so you should use spark-extension_2.12:2.1.0-3.1, not ...-3.3.

@kasunnirmala
Copy link
Author

Glue 3.0 uses Spark 3.1, so you should use spark-extension_2.12:2.1.0-3.1, not ...-3.3.

@EnricoMi Thanks. And you have inserted the s3 path of jar file for Python library path or Dependent JARs path ?

@EnricoMi
Copy link
Contributor

Glue 3.0 uses Spark 3.1, so you should use spark-extension_2.12:2.1.0-3.1, not ...-3.3.

@EnricoMi Thanks. And you have inserted the s3 path of jar file for Python library path or Dependent JARs path ?

As Python library path. I uploaded the jar file as is.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants