New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Broken relationship between audio and audio_set
at ingestion in the production API DB
#1534
Comments
audio_set
at ingestion in the production API DB
@obulat Thanks for investigating and finding the root cause of the issue! I changed the linking of issues before noticing you had updated this. Do you believe that the data refresh process will automatically update the existing data or do we have to do it manually? We can close this once confirming the existing audio files are correctly linked. |
I'm not sure if the data refresh DAG updates the materialized view. I'm sure @stacimc knows, though :) If it does, then a data refresh will fix the issue automatically. Otherwise, we will need to refresh the matview separately, before doing the data refresh. |
On a regular data refresh, the matview gets refreshed. This is the SQL that runs: https://github.com/WordPress/openverse-catalog/blob/main/openverse_catalog/dags/common/popularity/sql.py#L374 I'm not sure I follow what needs to happen here -- do you just need the view refreshed, or does a column need to be added? The If a view refresh is all that's needed, then yes this would be resolved with a data refresh. The data refreshes are not currently running, however (pending API latency investigation). |
Thank you for clarification, @stacimc! Do we normally run this manually? |
@obulat yes that's not scheduled, only triggered manually in airflow. |
Is this resolved from a code perspective? Or do we want to wait to close this until we've run |
Yes, it remains open until confirmed it's fixed once we run the data refresh again. |
We've added the column to the upstream catalog view, run an audio data refresh, and can now see
With that I believe this issue can be closed 😄 |
Problem
The API was expecting
audio_set_foreign_id
at the top level, but the catalog did not expose it. It was a part of theaudio_set
JSONB column.Description
We should add the
audio_set_foreign_id
to the materialized view.The text was updated successfully, but these errors were encountered: