Python Driver for Apache Drill.
Project description
===============================
pydrill
===============================
.. image:: https://img.shields.io/travis/PythonicNinja/pydrill.svg
:target: https://travis-ci.org/PythonicNinja/pydrill
.. image:: https://img.shields.io/pypi/v/pydrill.svg
:target: https://pypi.python.org/pypi/pydrill
.. image:: https://readthedocs.org/projects/pydrill/badge/?version=latest
:target: https://readthedocs.org/projects/pydrill/?badge=latest
:alt: Documentation Status
.. image:: https://coveralls.io/repos/PythonicNinja/pydrill/badge.svg?branch=master&service=github
:target: https://coveralls.io/github/PythonicNinja/pydrill?branch=master
Python Driver for `Apache Drill <https://drill.apache.org/>`_.
*Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage*
* Free software: MIT license
* Documentation: https://pydrill.readthedocs.org.
Features
--------
* Python 2/3 compatibility,
* Support for all rest API calls inluding profiles/options/metrics `docs with full list <http://pydrill.readthedocs.io/en/latest/readme.html#supported-api-calls>`_.
* Mapping Results to internal python types,
* Compatibility with Pandas data frame,
* Drill Authentication using PAM,
Installation
------------
::
Version from https://pypi.python.org/pypi/pydrill::
$ pip install pydrill
Latest version from git::
$ pip install git+git://github.com/PythonicNinja/pydrill.git
Sample usage
------------
::
from pydrill.client import PyDrill
drill = PyDrill(host='localhost', port=8047)
if not drill.is_active():
raise ImproperlyConfigured('Please run Drill first')
yelp_reviews = drill.query('''
SELECT * FROM
`dfs.root`.`./Users/macbookair/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
LIMIT 5
''')
for result in yelp_reviews:
print("%s: %s" %(result['type'], result['date']))
# pandas dataframe
df = yelp_reviews.to_dataframe()
print(df[df['stars'] > 3])
=======
History
=======
0.3.1 (2017-03-06)
------------------
* Support for Drill Authentication using PAM
0.3 (2017-02-15)
----------------
* requests response encoding (utf-8)
* support Python 3.6 support
0.1.1 (2016-05-21)
------------------
* Anaconda requirements fixed
0.1.0 (2016-05-19)
------------------
* First minor release
* Updated docs
0.0.2 (2016-04-24)
------------------
* First release on PyPI.
* Implementation of metrics/storage/options/stats
* Builds are tested by docker container with Apache Drill running
* support for pandas with ResultQuery.to_dataframe
0.0.1 (2015-12-28)
------------------
* Project start
pydrill
===============================
.. image:: https://img.shields.io/travis/PythonicNinja/pydrill.svg
:target: https://travis-ci.org/PythonicNinja/pydrill
.. image:: https://img.shields.io/pypi/v/pydrill.svg
:target: https://pypi.python.org/pypi/pydrill
.. image:: https://readthedocs.org/projects/pydrill/badge/?version=latest
:target: https://readthedocs.org/projects/pydrill/?badge=latest
:alt: Documentation Status
.. image:: https://coveralls.io/repos/PythonicNinja/pydrill/badge.svg?branch=master&service=github
:target: https://coveralls.io/github/PythonicNinja/pydrill?branch=master
Python Driver for `Apache Drill <https://drill.apache.org/>`_.
*Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage*
* Free software: MIT license
* Documentation: https://pydrill.readthedocs.org.
Features
--------
* Python 2/3 compatibility,
* Support for all rest API calls inluding profiles/options/metrics `docs with full list <http://pydrill.readthedocs.io/en/latest/readme.html#supported-api-calls>`_.
* Mapping Results to internal python types,
* Compatibility with Pandas data frame,
* Drill Authentication using PAM,
Installation
------------
::
Version from https://pypi.python.org/pypi/pydrill::
$ pip install pydrill
Latest version from git::
$ pip install git+git://github.com/PythonicNinja/pydrill.git
Sample usage
------------
::
from pydrill.client import PyDrill
drill = PyDrill(host='localhost', port=8047)
if not drill.is_active():
raise ImproperlyConfigured('Please run Drill first')
yelp_reviews = drill.query('''
SELECT * FROM
`dfs.root`.`./Users/macbookair/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
LIMIT 5
''')
for result in yelp_reviews:
print("%s: %s" %(result['type'], result['date']))
# pandas dataframe
df = yelp_reviews.to_dataframe()
print(df[df['stars'] > 3])
=======
History
=======
0.3.1 (2017-03-06)
------------------
* Support for Drill Authentication using PAM
0.3 (2017-02-15)
----------------
* requests response encoding (utf-8)
* support Python 3.6 support
0.1.1 (2016-05-21)
------------------
* Anaconda requirements fixed
0.1.0 (2016-05-19)
------------------
* First minor release
* Updated docs
0.0.2 (2016-04-24)
------------------
* First release on PyPI.
* Implementation of metrics/storage/options/stats
* Builds are tested by docker container with Apache Drill running
* support for pandas with ResultQuery.to_dataframe
0.0.1 (2015-12-28)
------------------
* Project start
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pydrill-0.3.1.tar.gz
(21.3 kB
view hashes)
Built Distribution
Close
Hashes for pydrill-0.3.1-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | fed0db6c56de98f02ae0afa143cf75a8f00436b932532fb2e755c272f7cca06f |
|
MD5 | add10aff7a5a8c2b8c47a4297e3f52a8 |
|
BLAKE2b-256 | a1101933ab8f4e479ba3092f7344b86aeafd72bcf90bf6897eab1c5133b46e8b |