skip to navigation
skip to content

Not Logged In

table2csv 0.1.3

Extract data from an HTML table and store results to a csv file.

Simple script for downloading html tables as csv.

Installation

pip install -U table2csv

Usage

table2csv http://en.wikipedia.org/wiki/List_of_Super_Bowl_champions > dump.txt

Features

  • accepts a URL
  • Identifies all the tables
  • Merges tables that share same structure (e.g. same column headers get merged)
  • Figures out which table is the biggest
  • extracts text
  • extracts links

TODO

  • add the ability to specify which table on the page you would like to download (not just the biggest one)
  • add support for columns that do not use proper <th> tags [DONE] tags for headers (i.e. imperfect html tables)]
  • detect the data types found within each column
  • add support for tables with hierarchical indices on the rows and/or columns

View on Github

 
File Type Py Version Uploaded on Size
table2csv-0.1.3.tar.gz (md5) Source 2013-11-07 4KB
  • Downloads (All Versions):
  • 0 downloads in the last day
  • 62 downloads in the last week
  • 355 downloads in the last month