Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unexpected error using spline agent 0.7 with AWS Glue 3.0 #346

Closed
wajda opened this issue Nov 1, 2021 Discussed in #345 · 3 comments
Closed

Unexpected error using spline agent 0.7 with AWS Glue 3.0 #346

wajda opened this issue Nov 1, 2021 Discussed in #345 · 3 comments
Labels
bug Something isn't working
Milestone

Comments

@wajda
Copy link
Contributor

wajda commented Nov 1, 2021

Discussed in #345

Originally posted by araujo-joao October 29, 2021
Hi guys,

I've been using a custom version of the spline agent for collecting data lineage from AWS Glue spark jobs.
Now AWS Glue released a new version that uses spark 3.1 so I decided to try to update the spline agent to 0.7.0, which supports it.

I'm getting an
ERROR SplineQueryExecutionListener: Unexpected error occurred during lineage processing for application ...
and after digging a bit I could retrieve a better error message:
java.lang.NumberFormatException: For input string: "op-0"

After looking at the code, this seems to be related with the idGenerators created on the latest versions of the spline agent, but unfortunately my knowledge of scala isn't sufficient to understand why this is breaking the spline agent.

Could someone give a hint on how to solve or further debug this issue?

Thanks in advance!

Edit: replacing "spline" with "spline agent" since that's the only component used to get this error.

@wajda wajda added the bug Something isn't working label Nov 1, 2021
@wajda wajda added this to the 0.7.1 milestone Nov 1, 2021
@wajda wajda closed this as completed Nov 3, 2021
@jaldaadithya
Copy link

Guys, could you please tell us how did you pass the listener and producer URLs in the AWS glue job
For us, its not even reflected in the logs that AWS Glue is using spline Listener

@wajda
Copy link
Contributor Author

wajda commented May 19, 2022

@aman3389
Copy link

you can add job parameters in advanced options

spark.spline.lineageDispatcher.http.producer.url=http://:8080/producer --conf spark.spline.mode=REQUIRED --conf spark.sql.queryExecutionListeners=za.co.absa.spline.harvester.listener.SplineQueryExecutionListener

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Status: Done
Development

No branches or pull requests

3 participants