Skip to content

Conversation

@rlittle08
Copy link
Collaborator

@rlittle08 rlittle08 commented Aug 23, 2023

The dedupe here should include tenant & year, to match the unique key of k_discipline_incident.

Tested on Boston, where this is needed because of their multiyear ODS -- for years 2017-2022, all of their student_discipline_incident_associations records are duplicated for each api_year, because we run repeated pulls on the multiyear ODS with different values for api_year.

Without including tenant & api_year in the dedupe, this dedupe just takes one record, assigned to whichever api_year is the last we pulled. We instead want all records to flow through stg, and handle the duplicates downstream (either with boston-specific queries, or a feature that removes incident dates that fall outside the school year)

@ejoranlienea ejoranlienea merged commit ce27edc into main Aug 24, 2023
@ejoranlienea ejoranlienea deleted the bugfix/stg_stu_behavior_dedupe_ty branch August 24, 2023 21:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants