drill

A small python library for quickly traversing XML data.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
License
- OSI Approved :: BSD License
Programming Language
- Python
- Python :: 3
Topic
- Text Processing :: Markup

Project description

## Basic Usage

import drill
doc = drill.parse(path_or_url_or_string)

# Drill down to a specific element.
print unicode(doc.book.title)

# Iterate through all "title" tags in the document.
for t in doc.iter('title'):
print t.attrs, t.data

# Find all "bar" nodes with a "baz" child and a "foo" parent.
q = doc.find('//foo/bar[baz]')
# Easily access the first and last elements of matching results.
print q.first(), q.last()
# Iterate over results.
for e in q:
do_something(e)

# Parse only elements matching some path
for e in drill.iterparse(filelike, xpath='root/*/something'):
print e.tagname, e.data

## Features

* Runnable test suite
* Python 3 support

## Advantages

* Pure python
* Faster, more efficient parsing than ElementTree
* Using ElementTree, a ~150 MB XML file (~3,000,000 elements) took ~46 seconds to parse, consuming ~1.3 GB of RAM
* Parsing the same file using drill took ~24 seconds and consumed ~950 MB of RAM
* Very unscientific benchmarks performed on a Core i5 @ 2.8 GHz, running Windows 7. YMMV.
* Lots of convenience methods for accessing elements quickly:
* doc.response.resultCode.data
* root[2].child['attr']
* first/last/prev/next methods for traversing siblings

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
License
- OSI Approved :: BSD License
Programming Language
- Python
- Python :: 3
Topic
- Text Processing :: Markup

Release history Release notifications | RSS feed

This version

1.2.0

Dec 3, 2018

1.1.3

Mar 30, 2016

1.1.2

Sep 24, 2015

1.1.1

Aug 5, 2013

1.1.0

Mar 8, 2013

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

drill-1.2.0.tar.gz (7.5 kB view hashes)

Uploaded Dec 3, 2018 Source

Built Distribution

drill-1.2.0-py2.py3-none-any.whl (8.4 kB view hashes)

Uploaded Dec 3, 2018 Python 2 Python 3

Hashes for drill-1.2.0.tar.gz

Hashes for drill-1.2.0.tar.gz
Algorithm	Hash digest
SHA256	`d2645ed6d3cfc925bd7bf5328982d8a5aff7cda9c7e56107c7a74482f7037b7d`
MD5	`8b995f9ce6739ee3f2722b4aff6c065e`
BLAKE2b-256	`e4213d1dec8958c74c3d1f46a6f264e12b146a4b97458240a68ad10ab3a41031`

Hashes for drill-1.2.0-py2.py3-none-any.whl

Hashes for drill-1.2.0-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`fb5a1eae68993076d033034dd446a832f2b9757222478c47f0cf129bafe70a74`
MD5	`0183625d1a2b40b1ab2d9465c6e16448`
BLAKE2b-256	`736c2871f4b4ad4dbc2d0fc7078e1b91fbdd2a62b33b9ffe9cba3dc610fd669b`