Skip to main content

novel grab crawler module using python3 and lxml

Project description

novel grab crawler module using python3 and lxml

multiprocesssing with multithread version

winxos, AISTLAB Since 2017-02-19

INSTALL:

pip3 install aistlab_novel_grab

USAGE:

from novel_grab.novel_grab import Downloader
d = Downloader()
print(d.get_info())
if d.set_url('http://book.zongheng.com/showchapter/221579.html'):
    d.start()

TIPS

  • When d = Downloader(), d.get_info() can get supported sites info.

  • Once d.set_url(url) will return the url is valid or not.

  • Of course you can use d.get_info() to access the state of d at any time.

  • While finished, will create \(novel_name\).zip file in your current path, default zip method using zipfile.ZIP_DEFLATED

  • Just for educational purpose, take care of yourself.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

AISTLAB_novel_grab-1.2.8.tar.gz (5.7 kB view hashes)

Uploaded Source

Built Distribution

AISTLAB_novel_grab-1.2.8-py2.py3-none-any.whl (8.8 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page