Crwy

A Simple Web Crawling and Web Scraping framework

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

简介

Crwy是一个轻量级的爬虫抓取框架，参考Scrapy框架结构开发而来。该框架提供了实用的爬虫模板，旨在帮助大家快速实现爬虫任务，高效开发。新增了gevent，使爬虫异步执行，速度更快。

运行环境

Python2.7

Works on Linux, Mac OSX

依赖包

beautifulsoup4>=4.5.1

pycurl>=7.43.0

configparser>=3.5.0

SQLAlchemy>=1.0.14

pyssdb>=0.1.2

redis>=2.10.5

certifi==2016.9.26

psutil>=5.1.3

gevent>=1.2.1

安装

快速安装:

pip install crwy

or 前往下载: https://pypi.python.org/pypi/Crwy/1.0.3/

使用手册

在这里: http://crwy.readthedocs.io/zh_CN/1.0.3/

友情链接

修改日志

2017-04-04 v1.0.3

加入gevent，实现pycurl与gevent异步调用；
新增async异步模板；
修改HtmlDownloader返回值，返回Response对象。

2017-03-22 v1.0.2

docs更新多进程，redis/ssdb队列文档。

2017-02-14 v1.0.2

runspider模块新增多进程支持。

2017-02-07 v1.0.2

更改RedisQueue模块路径，新增SsdbQueue模块。

2017-01-09 v1.0.2

修复模板中的BUG；
去除mysqldb依赖，用户根据自行需求进行安装；
讲utils中的sqlite包名称更改为db，且功能上更新为通用数据链接。

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.7.1

Jan 25, 2021

1.7.0

Oct 27, 2020

1.6.0

Feb 4, 2020

1.5.7

Nov 9, 2020

1.5.6

Dec 11, 2019

1.5.5

Jul 25, 2019

1.5.4

Apr 4, 2019

1.5.3

Mar 26, 2019

1.5.2

Mar 25, 2019

1.5.1

Feb 18, 2019

1.5.0

Jan 11, 2019

1.4.0

Jan 6, 2019

1.3.2

Dec 24, 2018

1.3.1

Dec 18, 2018

1.3.0

Dec 6, 2018

1.2.0

Nov 8, 2018

1.1.4

Oct 24, 2018

1.1.3

Oct 19, 2018

1.1.2

Sep 20, 2018

1.1.1

Aug 26, 2018

1.1.0

Aug 24, 2018

1.0.9

Jul 1, 2018

1.0.8

May 10, 2018

1.0.7

Nov 14, 2017

1.0.6

Sep 22, 2017

1.0.5

Jun 13, 2017

This version

1.0.3

Apr 4, 2017

1.0.2

Jan 9, 2017

1.0.1

Aug 24, 2016

1.0.0

Aug 18, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Crwy-1.0.3.tar.gz (14.4 kB view hashes)

Uploaded Apr 4, 2017 Source

Hashes for Crwy-1.0.3.tar.gz

Hashes for Crwy-1.0.3.tar.gz
Algorithm	Hash digest
SHA256	`58c4ee229655695253ef7fa92b3261b49c4b9ed12f4811f6e8a6f5bd687922f0`
MD5	`7f26a7bfd79d75cdb7f34c823a3ebf37`
BLAKE2b-256	`638e38eff1650eaf3e5f5146a524486e4f88cfea6163b9a12ffe31de5366a89e`