Skip to main content

Get the vital statistics of a CSV file

Project description

csv-overview
============

Get the vital statistics of a CSV file.

You can install with:
$ pip install csv_overview

For example (from the NYPL menu's database):

$ csv-overview some/path/Dish.csv
Dish.csv
351769 entries
(0) id: 351769 unique (100.00%)
0 empty (0.00%),
351769 numeric (100.00%)
e.g. ['254113', '349976', '228305', '315656', '248210']
(1) name: 351734 unique (99.99%)
0 empty (0.00%),
59 numeric (0.02%)
e.g. ['Pineapple Wheel', 'Kase, Fruchte, Kaffee', 'Cocoanut Pie per cut', 'Pernod for Absinthe', 'Hot Chicken Sandwich']
(2) description: 1 unique (0.00%)
351769 empty (100.00%),
0 numeric (0.00%)
e.g. ['', '', '', '', '']
(3) menus_appeared: 481 unique (0.14%)
0 empty (0.00%),
351769 numeric (100.00%)
e.g. ['357', '395', '328', '283', '195']
(4) times_appeared: 505 unique (0.14%)
0 empty (0.00%),
351769 numeric (100.00%)
e.g. ['1360', '516', '155', '434', '85']
(5) first_appeared: 140 unique (0.04%)
0 empty (0.00%),
351769 numeric (100.00%)
e.g. ['1913', '1943', '1982', '1908', '1865']
(6) last_appeared: 140 unique (0.04%)
0 empty (0.00%),
351769 numeric (100.00%)
e.g. ['2004', '1920', '1981', '1904', '1998']
(7) lowest_price: 633 unique (0.18%)
6211 empty (1.77%),
345558 numeric (98.23%)
e.g. ['19.25', '135.0', '2.19', '4.51', '12.95']
(8) highest_price: 685 unique (0.19%)
6211 empty (1.77%),
345558 numeric (98.23%)
e.g. ['2.3', '850.0', '6.99', '4.0', '1100.0']
Numerics:
id: 1.000000 - 410825.000000 (avg: 209010.0423857702, non-0 avg: 209010.0423857702, median: 209171.000000, mode: '287144')
menus_appeared: 0.000000 - 7195.000000 (avg: 2.9667025803865603, non-0 avg: 2.9704264960378906, median: 1.000000, mode: '1')
times_appeared: -1.000000 - 7901.000000 (avg: 3.0338801884191047, non-0 avg: 3.0363488419070053, median: 1.000000, mode: '1')
first_appeared: 0.000000 - 2012.000000 (avg: 1744.2648584724634, non-0 avg: 1930.0559754896872, median: 1900.000000, mode: '1900')
last_appeared: 0.000000 - 2012.000000 (avg: 1747.0906845117108, non-0 avg: 1933.1827955974545, median: 1900.000000, mode: '1900')
lowest_price: 0.000000 - 9500.000000 (avg: 0.9507477181834131, non-0 avg: 2.526713734176425, median: 0.000000, mode: '0.0')
highest_price: 0.000000 - 9500.000000 (avg: 1.517539747307291, non-0 avg: 3.915697196875889, median: 0.000000, mode: '0.0')

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

csv_overview-0.32.tar.gz (3.8 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page