A Python repository which repairs and analyzes tablular data
This module provides the capability to extract and repair blocks of data from 2D tables. These blocks can then be individually processed, stitched together, or filtered as needed by a particular program.
Autoconversions of cells along with a multi-tier flagging system for each magnitude of change allows for a wide variety of error handling. Additionally missing titles can be repaired from surrounding cells in order to generate compelete blocks from implied headings.
- allset
- pydatawrap
From source:
python settup.py install
From pip:
pip install carpenter
- Block detection
- Title repairing
- Tunable cell conversions
- Column re-orienting
The top level/front facing objects/functions
The block processing implementation detail files for the repository
The regex suite used to perform cell type identification
All unit tests for the repo.
- Google Style Guide
- Object Oriented (with a few exceptions)
- Add refactor top-level functionality
- Add new usable functions
- Separate flagging some from block iteration code
Author(s): Matthew Seal
© Copyright 2013, OpenGov