Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix the data-lakehouse-iceberg-trino-spark demo #62

Closed
Tracked by #647
NickLarsenNZ opened this issue Jul 19, 2024 · 5 comments
Closed
Tracked by #647

Fix the data-lakehouse-iceberg-trino-spark demo #62

NickLarsenNZ opened this issue Jul 19, 2024 · 5 comments
Assignees

Comments

@NickLarsenNZ
Copy link
Member

This came out of the release upgrade testing: #59 (comment)

Spark

No visualization information available for the streaming job run. but that seems to only be because the jobs completed. The active job show a visualisation, as does the completed water_level job.

Job list Missing visualization
Job list Missing visualization

You can see exceptions on the right, and the jobs didn't seem to run (0 ms duration).

Postgres

There was also some warnings from the postgres helm install:

# Postgres gave the following warnings:
# WARNING: There are "resources" sections in the chart not set. Using "resourcesPreset" is not recommended for production. For production installations, please set the following values according to your workload needs:
#     - primary.resources
#     - readReplicas.resources
@mbaeuerle
Copy link

This might not be related but the URL to the bike status endpoint is not reachable any longer: https://ip-api.mobidata-bw.de/v1/NVBW/gbfs/v2/free_bike_status.json
Used in the shared bike status: https://github.com/stackabletech/demos/blob/main/demos/data-lakehouse-iceberg-trino-spark/LakehouseKafkaIngest.xml#L1388

@NickLarsenNZ
Copy link
Member Author

This might not be related but the URL to the bike status endpoint is not reachable any longer: https://ip-api.mobidata-bw.de/v1/NVBW/gbfs/v2/free_bike_status.json Used in the shared bike status: https://github.com/stackabletech/demos/blob/main/demos/data-lakehouse-iceberg-trino-spark/LakehouseKafkaIngest.xml#L1388

Thanks for the heads-up @mbaeuerle. I'll take a look into this in the lead up to the SDP 24.11 release.

@xeniape
Copy link
Member

xeniape commented Oct 23, 2024

@NickLarsenNZ I would argue the postgres warning was related to the postgres upgrade process and not the demo itself. At least I didn't see any errors or warnings in the running postgres instances while solving this issue. Do you have more information on what was impacted by that and does that need addressing as part of this issue?

Other than that I submitted a PR for the fix of the endpoints fetching the data. Kafka topics are created as a result and the spark streaming queries are running fine now. #115

@NickLarsenNZ
Copy link
Member Author

NickLarsenNZ commented Oct 24, 2024

Thanks for the endpoint fix @xeniape.

@NickLarsenNZ
Copy link
Member Author

NickLarsenNZ commented Oct 24, 2024

Marked as closed because:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants