Skip to main content

A utility library which repairs and analyzes tablular data

Project description

A Python repository which repairs and analyzes tablular data

Description

This module provides the capability to extract and repair blocks of data from 2D tables. These blocks can then be individually processed, stitched together, or filtered as needed by a particular program.

Autoconversions of cells along with a multi-tier flagging system for each magnitude of change allows for a wide variety of error handling. Additionally missing titles can be repaired from surrounding cells in order to generate compelete blocks from implied headings.

Dependencies

  • allset

  • pydatawrap

Setup

Installation

From source:

python settup.py install

From pip:

pip install carpenter

Features

  • Block detection

  • Title repairing

  • Tunable cell conversions

  • Column re-orienting

Language Preferences

  • Google Style Guide

  • Object Oriented (with a few exceptions)

TODO

  • Add refactor top-level functionality

  • Add new usable functions

  • Separate flagging some from block iteration code

Author

Author(s): Matthew Seal

© Copyright 2013, OpenGov

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page