django-scraper

Django application which crawls and downloads online content following instructions

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Features

Extract content of given online websites/pages using XPath queries.
Process can be started from command line (~cron job) or inside Django code
Can be called from command line (~cron job) or inside Django code
Automatically browse and download content in related pages, with given depth.
Support metadata extract along with other content
Have content refinement rules and black words filtering
Store and prevent duplication of downloaded content
Allow changing User Agent
Support proxy servers

Documentation

The full documentation is not ready yet, please go here for notes about installation and usage: https://github.com/zniper/django-scraper

Support

If you have any questions about this application, please email to me[at]zniper.net

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.3.8

May 26, 2015

0.3.0

May 7, 2015

0.2.3

Mar 12, 2015

This version

0.2.2

Oct 10, 2014

0.2.0

Jul 11, 2014

0.1

Jul 4, 2014

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

django-scraper-0.2.2.tar.gz (9.8 kB view hashes)

Uploaded Oct 10, 2014 Source

Hashes for django-scraper-0.2.2.tar.gz

Hashes for django-scraper-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`a0351ed4d943e52d3a68922be0f084ec83c7fca6e94c9f6e623948a0b44259eb`
MD5	`0a739b0ca06bc295c2df0092234a32fd`
BLAKE2b-256	`565ec234f08fb4a2c66054709078e5acbb9e6323f4e7737db57842e7891c1b40`

django-scraper 0.2.2

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Project description

Features

Documentation

Support

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution