Skip to main content

Parse messy tabular data in various formats

Project description

Tabular data as published on the web is often not well formatted and structured. Messytables tries to detect and fix errors in the data. Typical examples include:

  • Finding the header of a table when there are explanations and text fragments in the first few rows of the table.

  • Guessing the type of columns in CSV data.

This library provides data structures and some heuristics to fix these problems and read a wide number of different tabular abominations.

See the full documentation at: https://messytables.readthedocs.io

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page