grab · PyPI

Site Scraping Framework

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 4 - Beta
Environment
- Web Environment
Intended Audience
- Developers
License
- OSI Approved :: BSD License
Operating System
- OS Independent
Programming Language
- Python
Topic
- Utilities

Project description

Grab is a python site scraping framework. Grab provides powerful interface to two libraries: lxml and pycurl. There are two ways how to use Grab: 1) Use Grab to configure network requests and to process fetched documents. In this way you should manually control flow of you program. 2) Use Grab::Spider to buld asynchronous site scrapers. This is how scrapy works.

Example of Grab usage:

from grab import Grab

g = Grab()
g.go('https://github.com/login')
g.set_input('login', 'lorien')
g.set_input('password', '***')
g.submit()
for elem in g.doc.select('//ul[@id="repo_listing"]/li/a'):
    print '%s: %s' % (elem.text(), elem.attr('href'))

Example of Grab::Spider usage:

from grab.spider import Spider, Task
import logging

class ExampleSpider(Spider):
    def task_generator(self):
        for lang in ('python', 'ruby', 'perl'):
            url = 'https://www.google.com/search?q=%s' % lang
            yield Task('search', url=url)

    def task_search(self, grab, task):
        print grab.doc.select('//div[@class="s"]//cite').text()


logging.basicConfig(level=logging.DEBUG)
bot = ExampleSpider()
bot.run()

Installation

Pip is recommended way to install Grab and its dependencies:

$ pip install lxml
$ pip install pycurl
$ pip install grab

Documentation

Russian docs: http://docs.grablib.org English docs in progress.

Discussion group (Russian or English): http://groups.google.com/group/python-grab/

Contribution

If you found a bug or if you want new feature please create new issue on github:

https://github.com/lorien/grab/issues

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 4 - Beta
Environment
- Web Environment
Intended Audience
- Developers
License
- OSI Approved :: BSD License
Operating System
- OS Independent
Programming Language
- Python
Topic
- Utilities

Release history Release notifications | RSS feed

0.6.41

Jun 24, 2018

0.6.40

May 14, 2018

0.6.39

May 10, 2018

0.6.38

May 17, 2017

0.6.37

May 13, 2017

0.6.36

May 13, 2017

0.6.35

Feb 6, 2017

0.6.34

Feb 4, 2017

0.6.33

Jan 27, 2017

0.6.32

Dec 31, 2016

0.6.31

Dec 31, 2016

0.6.30

Nov 22, 2015

0.6.29

Oct 15, 2015

0.6.28

Oct 13, 2015

0.6.27

Oct 13, 2015

0.6.26

Oct 9, 2015

0.6.25

Sep 20, 2015

0.6.24

Sep 9, 2015

0.6.23

Aug 27, 2015

0.6.22

Aug 14, 2015

0.6.21

Jun 20, 2015

0.6.20

Jun 8, 2015

0.6.19

Jun 5, 2015

0.6.18

Jun 5, 2015

0.6.17

Jun 5, 2015

0.6.16

Jun 3, 2015

0.6.15

May 31, 2015

0.6.14

May 18, 2015

0.6.13

May 12, 2015

0.6.12

May 7, 2015

0.6.11

May 7, 2015

0.6.10

Apr 30, 2015

0.6.9

Apr 29, 2015

0.6.8

Apr 26, 2015

0.6.7

Apr 26, 2015

0.6.6

Apr 23, 2015

0.6.5

Apr 16, 2015

0.6.4

Apr 12, 2015

0.6.3

Apr 10, 2015

0.6.2

Apr 9, 2015

0.6.1

Apr 8, 2015

0.6.0

Apr 6, 2015

0.5.5

Mar 27, 2015

0.5.4

Mar 7, 2015

0.5.3

Mar 7, 2015

0.5.2

Feb 22, 2015

0.5.1

Feb 16, 2015

0.5.0

Feb 9, 2015

0.4.13

Sep 12, 2013

This version

0.4.12

Jul 25, 2013

0.4.11

Jun 7, 2013

0.4.10

May 1, 2013

0.4.9

Apr 27, 2013

0.4.8

Nov 18, 2012

0.4.7

Aug 31, 2012

0.4.5

Jun 27, 2012

0.4.4

Jun 21, 2012

0.4.3

Jun 10, 2012

0.4.2

May 16, 2012

0.4.1

Apr 28, 2012

0.4.0

Apr 27, 2012

0.3.33

Apr 13, 2012

0.3.32

Apr 5, 2012

0.3.31

Mar 30, 2012

0.3.30

Mar 27, 2012

0.3.29

Mar 7, 2012

0.3.28

Mar 6, 2012

0.3.27

Mar 6, 2012

0.3.26

Mar 5, 2012

0.3.25

Mar 1, 2012

0.3.24

Feb 21, 2012

0.3.23

Jan 26, 2012

0.3.22

Jan 16, 2012

0.3.21

Jan 6, 2012

0.3.20

Dec 31, 2011

0.3.19

Dec 25, 2011

0.3.18

Dec 20, 2011

0.3.17

Dec 18, 2011

0.3.16

Dec 7, 2011

0.3.15

Dec 2, 2011

0.3.14

Nov 24, 2011

0.3.13

Nov 22, 2011

0.3.12

Nov 14, 2011

0.3.11

Nov 9, 2011

0.3.10

Nov 6, 2011

0.3.9

Nov 6, 2011

0.3.8

Nov 5, 2011

0.3.7

Nov 5, 2011

0.3.6

Nov 4, 2011

0.3.4

Oct 26, 2011

0.3.3

Oct 23, 2011

0.3.2

Oct 3, 2011

0.3.1

Sep 23, 2011

0.3

Sep 2, 2011

0.2.20

Aug 21, 2011

0.2.19

Aug 14, 2011

0.2.18

Jul 31, 2011

0.2.17

Jul 31, 2011

0.2.16

Jul 23, 2011

0.2.15

Jul 23, 2011

0.2.12

Jun 17, 2011

0.2.11

Jun 13, 2011

0.2.10

May 17, 2011

0.2.9

May 11, 2011

0.2.8

May 5, 2011

0.2.7

May 5, 2011

0.2.6

Mar 23, 2011

0.2.5

Dec 5, 2010

0.2.4

Dec 5, 2010

0.2.3

Nov 10, 2010

0.2.2

Nov 8, 2010

0.2.1

Nov 1, 2010

0.2.0

Nov 1, 2010

0.1.7

Sep 12, 2010

0.1.6

Sep 8, 2010

0.1.5

Sep 8, 2010

0.1.4

Sep 4, 2010

0.1.3

Sep 4, 2010

0.1.2

Sep 3, 2010

0.1.1

Aug 14, 2010

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

grab-0.4.12.tar.gz (141.5 kB view hashes)

Uploaded Jul 25, 2013 Source

Hashes for grab-0.4.12.tar.gz

Hashes for grab-0.4.12.tar.gz
Algorithm	Hash digest
SHA256	`9eff4927bb4ae2442a5d2a02e6c4f67af6730e497c7af32e0c6d321fd9473216`
MD5	`a9d42f6db9f96357d18fe170176c95b4`
BLAKE2b-256	`886552362d25343b282c07b4a98bc55b2f1fb0858935378e66d71ecff8a0cb75`