piculet

XML/HTML scraper using XPath queries.

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Piculet is a module for extracting data from XML or HTML documents using XPath queries. It consists of a single source file with no dependencies other than the standard library, which makes it very easy to integrate into applications. It also provides a command line interface.

Getting started

Piculet has been tested with Python 3.5+ and compatible versions of PyPy. You can install the latest version using pip:

pip install piculet

Installing Piculet creates a script named piculet which can be used to invoke the command line interface:

$ piculet -h
usage: piculet [-h] [--version] [--html] (-s SPEC | --h2x)

For example, say you want to extract some data from the file shining.html. An example specification is given in movie.json. Download both of these files and run the command:

$ cat shining.html | piculet -s movie.json

Getting help

The documentation is available on: https://piculet.tekir.org/

The source code can be obtained from: https://github.com/uyar/piculet

License

Piculet is released under the LGPL license, version 3 or later. Read the included LICENSE.txt file for details.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

2.0.0a1 pre-release

Jul 23, 2019

2.0.0a0 pre-release

Jun 28, 2019

1.0.1

Feb 7, 2019

1.0

May 25, 2018

1.0b7 pre-release

Mar 21, 2018

1.0b6 pre-release

Jan 17, 2018

1.0b5 pre-release

Jan 16, 2018

1.0b4 pre-release

Jan 2, 2018

1.0b3 pre-release

Jul 25, 2017

1.0b2 pre-release

Jun 16, 2017

1.0b1 pre-release

Apr 26, 2017

1.0a2 pre-release

Apr 4, 2017

1.0a1 pre-release

Aug 24, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

piculet-2.0.0a1.tar.gz (30.8 kB view hashes)

Uploaded Jul 23, 2019 Source

Built Distribution

piculet-2.0.0a1-py3-none-any.whl (36.0 kB view hashes)

Uploaded Jul 23, 2019 Python 3

Hashes for piculet-2.0.0a1.tar.gz

Hashes for piculet-2.0.0a1.tar.gz
Algorithm	Hash digest
SHA256	`76ecc8c20070770ef8c979efad0e92d16a19a7e3cadc7fae9e41eaa8e2f1d847`
MD5	`04370a5ed45cb6e44610d821d7795951`
BLAKE2b-256	`34998a5760fbd729691a951d072af88ce08e7b94bc778ef56e7c431dde1d16e5`

Hashes for piculet-2.0.0a1-py3-none-any.whl

Hashes for piculet-2.0.0a1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`dcf927dd99016aa45ac425fac3e7664aad618af949225a08a6d742c8c7efdafa`
MD5	`1809399be4239df479445e2ec17b0137`
BLAKE2b-256	`602954825cb837d7123b8dbf7f6bc7b645d3ba43aab771c101666567ff57d314`