Skip to main content

Utilities for scraping Yahoo pages

Project description

Python utilities for scraping Yahoo pages

Build status PyPI version

The yahooscraper package is organized into modules and submodules. Each leaf module (i.e., module without submodules) contains functions that take a single argument – HTML text of the page represented by the module and its namespacing – and return some data parsed from the page.

If the data is not found, None is returned. Or, in cases where an iterable should be returned, an empty iterable may be returned.

Each leaf module also includes a url() function, which returns the URL of the page represented by the module. In cases where the module represents a set of URLs, this function takes parameters.

To get data from pages that require authentication, first obtain an authenticated session using the login module’s authenticated_session() method function.

See more detailed API documentation using pydoc:

$ pydoc yahooscraper
$ pydoc yahooscraper.login
$ pydoc yahooscraper.fantasy.team

Examples

Output Fantasy NBA team name:

import yahooscraper as ys

LEAGUE_ID = 237834
TEAM_ID = 8

session = ys.login.authenticated_session()
response = session.get(ys.fantasy.team.url('nba', LEAGUE_ID, TEAM_ID))
team = ys.fantasy.team.team(response.text)
print(team)

Install

$ pip install yahooscraper

Development

Test

$ python tests.py

Deploy

  • Bump version in setup.py

  • $ python setup.py register

  • $ python setup.py sdist upload

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

yahooscraper-0.3.0.tar.gz (3.7 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page