tabulator

A utility library that provides a consistent interface for reading tabular data.

These details have been verified by PyPI

Maintainers

brew callmealien okfn pwalsh

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

tabulator-py
============

`|Travis| <https://travis-ci.org/frictionlessdata/tabulator-py>`_
`|Coveralls| <https://coveralls.io/r/frictionlessdata/tabulator-py?branch=master>`_
`|PyPi| <https://pypi.python.org/pypi/tabulator>`_
`|Gitter| <https://gitter.im/frictionlessdata/chat>`_

A utility library that provides a consistent interface for reading
tabular data.

Getting Started
---------------

Installation
~~~~~~~~~~~~

To get started (under development):

::

$ pip install tabulator

Simple interface
~~~~~~~~~~~~~~~~

Fast access to the table with ``topen`` (stands for ``table open``)
function:

::

from tabulator import topen, processors

with topen('path.csv', with_headers=True) as table:
for row in table:
print(row)
print(row.get('header'))

For the most use cases ``topen`` function is enough. It takes the
``source`` argument:

``<scheme>://path/to/file.<format>`` and uses corresponding ``Loader``
and ``Parser`` to open and start to iterate over the table. Also user
can pass ``scheme`` and ``format`` explicitly as function arguments. The
last ``topen`` argument is ``encoding`` - user can force Tabulator to
use encoding of choice to open the table.

Read more about ``topen`` -
`documentation <https://github.com/frictionlessdata/tabulator-py/blob/master/tabulator/topen.py>`_.

Function ``topen`` returns ``Table`` instance. We use context manager to
call ``table.open()`` on enter and ``table.close()`` when we exit: -
table can be iterated like file-like object returning row by row - table
can be read row by bow using ``readrow`` method (it returns row tuple) -
table can be read into memory using ``read`` function (return list or
row tuples) with ``limit`` of output rows as parameter. - headers can be
accessed via ``headers`` property - table pointer can be set to start
via ``reset`` method.

Read more about ``Table`` -
`documentation <https://github.com/frictionlessdata/tabulator-py/blob/master/tabulator/table.py>`_.

In the example above we use ``processors.Headers`` to extract headers
from the table (via ``with_headers=True`` shortcut). Processors is a
powerfull Tabulator concept. Parsed data goes thru pipeline of
processors to be updated before returning as table row.

Read more about ``Processor`` -
`documentation <https://github.com/frictionlessdata/tabulator-py/blob/master/tabulator/processors/api.py>`_.

Read a processors tutorial -
`tutorial <https://github.com/frictionlessdata/tabulator-py/blob/master/docs/processors.md>`_.

Advanced interface
~~~~~~~~~~~~~~~~~~

To get full control over the process you can use more parameters. Below
all parts of Tabulator are presented:

::

from tabulator import topen, processors, loaders, parsers

table = topen('path.csv',
loader_options={'encondig': 'utf-8'},
parser_options={'delimeter': ',', quotechar: '|'},
loader_class=loaders.File,
parser_class=parsers.CSV,
iterator_class=CustomIterator,
table_class=CustomTable)
table.add_processor(processors.Headers(skip=1))
headers = table.headers
contents = table.read(limit=10)
print(headers, contents)
table.close()

Also ``Table`` class can be instantiated by user (see documentation).
But there is no difference between it and ``topen`` call with extended
list of parameters except ``topen`` also calls the ``table.open()``
method.

Design Overview
---------------

Tabulator uses modular architecture to be fully extensible and flexible.
It uses loosely coupled modules like ``Loader``, ``Parser`` and
``Processor`` to provide clear data flow.

.. figure:: docs/diagram.png
:align: center
:alt: diagram

diagram
Documentation
-------------

API documentation is presented as docstrings: - High-level: -
`topen <https://github.com/frictionlessdata/tabulator-py/blob/master/tabulator/topen.py>`_
- Core elements: -
`Row <https://github.com/frictionlessdata/tabulator-py/blob/master/tabulator/row.py>`_
-
`Table <https://github.com/frictionlessdata/tabulator-py/blob/master/tabulator/table.py>`_
-
`Iterator <https://github.com/frictionlessdata/tabulator-py/blob/master/tabulator/iterator.py>`_
- Plugin elements: - `Loader
API <https://github.com/frictionlessdata/tabulator-py/blob/master/tabulator/loaders/api.py>`_
- `Parser
API <https://github.com/frictionlessdata/tabulator-py/blob/master/tabulator/parsers/api.py>`_
- `Processor
API <https://github.com/frictionlessdata/tabulator-py/blob/master/tabulator/processors/api.py>`_

Contributing
------------

Please read the contribution guideline:

`How to Contribute <CONTRIBUTING.md>`_

Thanks!

.. |Travis| image:: https://img.shields.io/travis/frictionlessdata/tabulator-py/master.svg
.. |Coveralls| image:: http://img.shields.io/coveralls/frictionlessdata/tabulator-py.svg?branch=master
.. |PyPi| image:: https://img.shields.io/pypi/v/tabulator.svg
.. |Gitter| image:: https://img.shields.io/gitter/room/frictionlessdata/chat.svg

Project details

These details have been verified by PyPI

Maintainers

brew callmealien okfn pwalsh

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.53.5

Mar 21, 2021

1.53.4

Feb 24, 2021

1.53.2

Feb 11, 2021

1.53.1

Nov 30, 2020

1.53.0

Nov 9, 2020

1.52.5

Nov 2, 2020

1.52.4

Sep 26, 2020

1.52.3

Jun 18, 2020

1.52.2

Jun 18, 2020

1.52.1

Jun 15, 2020

1.52.0

Jun 10, 2020

1.51.3

Jun 3, 2020

1.51.2

Jun 3, 2020

1.51.1

Jun 3, 2020

1.50.0

Jun 1, 2020

1.49.4

Jun 1, 2020

1.49.3

Jun 1, 2020

1.49.2

May 27, 2020

1.48.0

May 20, 2020

1.47.0

May 20, 2020

1.46.1

May 19, 2020

1.46.0

May 19, 2020

1.45.1

May 18, 2020

1.45.0

May 18, 2020

1.44.7

May 14, 2020

1.44.6

May 14, 2020

1.44.5

May 14, 2020

1.44.4

May 14, 2020

1.44.3

May 13, 2020

1.44.2

May 11, 2020

1.44.1

May 11, 2020

1.44.0

May 7, 2020

1.42.0

May 4, 2020

1.41.0

Apr 30, 2020

1.40.0

Apr 29, 2020

1.39.1

Apr 29, 2020

1.39.0

Apr 29, 2020

1.38.4

Apr 23, 2020

1.38.3

Apr 22, 2020

1.38.2

Apr 8, 2020

1.38.1

Mar 25, 2020

1.37.1

Mar 25, 2020

1.36.1

Mar 25, 2020

1.36.0

Mar 16, 2020

1.35.0

Feb 17, 2020

1.34.1

Feb 17, 2020

1.34.0

Feb 4, 2020

1.33.0

Jan 30, 2020

1.32.0

Jan 29, 2020

1.31.2

Dec 19, 2019

1.31.1

Dec 17, 2019

1.31.0

Dec 2, 2019

1.30.0

Nov 19, 2019

1.29.0

Oct 30, 2019

1.28.0

Oct 21, 2019

1.27.0

Oct 14, 2019

1.26.1

Sep 21, 2019

1.25.1

Sep 18, 2019

1.25.0

Sep 18, 2019

1.24.3

Sep 17, 2019

1.24.2

Aug 27, 2019

1.24.1

Aug 21, 2019

1.24.0

Aug 16, 2019

1.23.0

Jul 7, 2019

1.22.0

Jun 28, 2019

1.21.0

May 27, 2019

1.20.0

Apr 24, 2019

1.19.3

Apr 17, 2019

1.19.1

Apr 11, 2019

1.19.0

Nov 6, 2018

1.18.0

Oct 29, 2018

1.17.1

Oct 22, 2018

1.17.0

Oct 15, 2018

1.16.0

Oct 15, 2018

1.15.0

Oct 8, 2018

1.14.4

Oct 4, 2018

1.14.3

Sep 17, 2018

1.14.2

Jul 26, 2018

1.14.1

Jul 17, 2018

1.14.0

Mar 21, 2018

1.13.0

Dec 27, 2017

1.12.2

Nov 24, 2017

1.12.1

Nov 22, 2017

1.12.0

Nov 10, 2017

1.11.1

Oct 30, 2017

1.11.0

Oct 27, 2017

1.10.0

Oct 20, 2017

1.9.0

Oct 20, 2017

1.8.0

Oct 17, 2017

1.7.1

Oct 12, 2017

1.7.0

Oct 12, 2017

1.6.0

Oct 5, 2017

1.5.0

Sep 6, 2017

1.4.1

Aug 28, 2017

1.3.1

Aug 22, 2017

1.3.0

Aug 8, 2017

1.2.0

Aug 3, 2017

1.1.0

Jun 20, 2017

1.0.0

Jun 5, 2017

1.0.0a5 pre-release

May 18, 2017

1.0.0a4 pre-release

May 17, 2017

1.0.0a1 pre-release

May 10, 2017

0.15.1

May 3, 2017

0.15.0

Apr 23, 2017

0.14.2

Mar 2, 2017

0.14.1

Feb 21, 2017

0.14.0

Jan 24, 2017

0.13.0

Jan 13, 2017

0.12.1

Dec 6, 2016

0.12.0

Nov 28, 2016

0.11.2

Nov 18, 2016

0.11.1

Nov 9, 2016

0.11.0

Nov 9, 2016

0.10.5

Nov 3, 2016

0.10.4

Oct 29, 2016

0.10.3

Oct 29, 2016

0.10.2

Oct 29, 2016

0.10.1

Oct 28, 2016

0.10.0

Oct 27, 2016

0.9.0

Oct 27, 2016

0.8.0

Oct 26, 2016

0.7.6

Oct 20, 2016

0.7.5

Oct 13, 2016

0.7.4

Sep 23, 2016

0.7.2

Sep 14, 2016

0.7.1

Sep 14, 2016

0.7.0

Sep 14, 2016

0.6.2

Sep 13, 2016

0.6.1

Sep 13, 2016

0.6.0

Sep 13, 2016

0.5.0

Aug 16, 2016

0.4.0

May 11, 2016

0.3.14

May 10, 2016

0.3.13

Mar 29, 2016

This version

0.3.12

Mar 29, 2016

0.3.9

Mar 28, 2016

0.3.8

Mar 28, 2016

0.3.7

Mar 26, 2016

0.3.6

Mar 15, 2016

0.3.5

Feb 18, 2016

0.3.3

Feb 17, 2016

0.3.2

Feb 8, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tabulator-0.3.12.tar.gz (18.1 kB view hashes)

Uploaded Mar 29, 2016 Source

Built Distribution

tabulator-0.3.12-py2.py3-none-any.whl (33.8 kB view hashes)

Uploaded Mar 29, 2016 Python 2 Python 3

Hashes for tabulator-0.3.12.tar.gz

Hashes for tabulator-0.3.12.tar.gz
Algorithm	Hash digest
SHA256	`71439890f65785b0c5ca8fe537e59fdab86a7a91b49db64673643058b2cd89ac`
MD5	`9f4deffe4a36d19399b5762f98fb7229`
BLAKE2b-256	`034ef9b900e23c2386368707698b5d471cbf7fcc197400eb33daf9aa15b65a66`

Hashes for tabulator-0.3.12-py2.py3-none-any.whl

Hashes for tabulator-0.3.12-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`e983e9b0090b8ca4b76b011dd8d0b8d3f9487a523c791d0fe7b259b9919eca31`
MD5	`22125fa78aff7421dcf24b5407ec192d`
BLAKE2b-256	`b59ffa7cf6f5519bf9ede1f12bb00ab5321ff92849018a3c4b6b31cf60d865b2`