Skip to content

Latest commit

 

History

History
35 lines (26 loc) · 2.03 KB

Contributing.md

File metadata and controls

35 lines (26 loc) · 2.03 KB

Contributing

Contributions are welcome from the community. Questions can be asked on the CEDS Community of Practice website.

Submitting Code Changes

If you have direct contributions you would like considered for incorporation into the project you can fork this repository and submit a pull request for review. Along with your pull request, please create a use case ticket that details the parameters for your changes.

Changes to the CEDS Data Warehouse Parquet standard must originate from a change to the CEDS Data Warehouse. Development guidelines for to the CEDS Data Warehouse schema can be found here.

Data Warehouse Parquet Design Guidelines

Naming Conventions

The CEDS Data Warehouse Parquet standard is based on the CEDS Data Warehouse data standard fact tables.
The fields within the files are derived from the table and column names in the Fact and Dimension tables in this format "[*Fact table column name minus the 'ID' at the end]_[*Dimension table column name]" For example, RDS.FactK12StudentEnrollments.EnrollmentEntryDateId, which links to RDS.DimDates, translates to the following fields in the Parquet standard:

FactK12StudentEnrollmentsParquet

  • EnrollmentEntryDate_DateValue
  • EnrollmentEntryDate_Day
  • EnrollmentEntryDate_DayOfWeek
  • EnrollmentEntryDate_DayOfYear
  • EnrollmentEntryDate_Month
  • EnrollmentEntryDate_MonthName
  • EnrollmentEntryDate_SubmissionYear
  • EnrollmentEntryDate_Year

Each of the CEDS Data Warehouse Parquet file standards is documented in CEDS-Data-Warehouse-Parquet-File-Metadata.xlsx