Skip to content

Conversation

@AngelicaLastra
Copy link
Contributor

Description & motivation

This PR creates models for learning standard data to be used downstream by student grade tables.

Changes to existing models:

  • _edfi_3__base and _edfi_3__stage: Added configurations for new base and staging models.

New models created:

  • base_ef3__learning_standards : Pulls raw learning standards data columns to be used in downstream models.
  • stg_ef3__learning_standards : Extends base_ef3__learning_standards by adding surrogate keys, and extracting academic subject descriptors.
  • stg_ef3__grades__learning_standards : Extracts student's learning standard grades and performance conversions.

Tests and QC done:

  • Executed a successful dbt run.
  • Queried both Boston and Jeffco data warehouses to ensure data was accurately populated in the tables.
  • Validated the granularity of the tables by querying each learning standard ID in the learning standard tables and cross-referencing it with a student, course section, school, grading period, and learning standard in the student learning standard grades table.
  • Counted rows throughout to ensure there was no fan out of data, and queried distinct values to ensure data was represented properly.

Future ToDos & Questions:

  • Want to confirm that the yml configurations are consistent with the rest of the models in the yml file. Are there any other tests or configs that should be added? Anything about the foreign keys?
  • There is a question about the grade range a student can earn through a learning standard (what is the max? the min?). We have not yet been able to come to a solution for this since this data doesn't seem to be coming through from the learning standard source (there is an ongoing conversation between Jordan, Rob, and me about this).
  • Academic subject values can sometimes appear cryptic (ex: F1FC8A52-3B53-11E0-B042-495E9DFF4B22 ) however, in other instances, the naming convention is more conventional, such as "Science." We should consider if there's a way to transform them into more readable subject names or if they are helpful as is.
  • Some learning_standard_id values have some helpful descriptions (ex: Math-Effort_rbcX200MATH00000010001111000) while some are less helpful (ex: c86bc12e-e6e3-467b-b8a8-bacc46c95b37) for now we don't do anything with this column but it is something to consider for the future.

PR Merge Priority:

  • Low
  • Medium
  • High

@AngelicaLastra AngelicaLastra self-assigned this Sep 22, 2023
@rlittle08 rlittle08 merged commit 3f99ad0 into main Sep 27, 2023
@ejoranlienea ejoranlienea deleted the build/learning_standards branch October 2, 2023 23:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants