This repository was archived by the owner on Mar 28, 2025. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 2
Home
echang97 edited this page Jun 26, 2019
·
60 revisions
This is the wiki page for mapping out the Data Quality scripts
- Compare two Excel files to determine if data was added and/or deleted
- Determine if an Excel file follows its predefined format
- Check if numbers are in line with older records
- Maybe make a GUI to make it more User Friendly?
- Credit to Matthew Kudija for the Source Code
- Highlights differences between two Excel files
- Exports file with highlighted differences
Running Excel Diff through Terminal: python diff.py "oldFile.xlsx" "newFile.xlsx"
- A Python script that accepts an Excel file
- Creates formats based on sample Excel files
- Checks given Excel file for:
- New or Missing Fields
- Unexpected Units of measurement or New items
- Number of Withheld rows
Running Setup through Terminal: python formatcheck.py setup "filename.xlsx"
Running Format Check through Terminal: python formatcheck.py "filename.xlsx"