scrapy util
Project description
Scrapy util
启用数据收集功能
此功能配合spider-admin-pro 使用
# 项目名默认是(不需要设置)
BOT_NAME = 'scrapy_demo'
# 设置收集运行日志的路径,会以post方式提交json数据
STATS_COLLECTION_URL = "http://127.0.0.1:5001/api/collection"
# 启用数据收集扩展
EXTENSIONS = {
# 'scrapy.extensions.telnet.TelnetConsole': None,
'scrapy_util.extensions.SpiderItemCountExtension': 100
}
使用脚本Spider
# -*- coding: utf-8 -*-
from scrapy import cmdline
from scrapy_util.spiders import ScriptSpider
class BaiduScriptSpider(ScriptSpider):
name = 'baidu_script'
def execute(self):
print("hi")
if __name__ == '__main__':
cmdline.execute('scrapy crawl baidu_script'.split())
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
scrapy-util-0.0.4.tar.gz
(4.3 kB
view hashes)
Built Distribution
Close
Hashes for scrapy_util-0.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4c05397e239823cebdc92781bcc029c48e39eda30371ac81b12a5388826ce8b6 |
|
MD5 | ff2ccc080122aaef9bd953e48334ee43 |
|
BLAKE2b-256 | 098d1131d029ef2eb9a331ebcb9240a971f31442e20968ce6bddc65e3af9e98f |