skip to navigation
skip to content

Not Logged In

django-scraper 0.2.2

Django application which crawls and downloads online content following instructions

Features

  • Extract content of given online websites/pages using XPath queries.
  • Process can be started from command line (~cron job) or inside Django code
  • Can be called from command line (~cron job) or inside Django code
  • Automatically browse and download content in related pages, with given depth.
  • Support metadata extract along with other content
  • Have content refinement rules and black words filtering
  • Store and prevent duplication of downloaded content
  • Allow changing User Agent
  • Support proxy servers

Documentation

The full documentation is not ready yet, please go here for notes about installation and usage: https://github.com/zniper/django-scraper

Support

If you have any questions about this application, please email to me[at]zniper.net

 
File Type Py Version Uploaded on Size
django-scraper-0.2.2.tar.gz (md5) Source 2014-10-10 9KB
  • Downloads (All Versions):
  • 14 downloads in the last day
  • 65 downloads in the last week
  • 82 downloads in the last month