Skip to main content

Find the feed URLs for a website.

Project description

This is an asynchronous Python library for finding links feeds on a website.

It is based on the synchronous (requests based) feedfinder2, written by Dan Foreman-Mackey, which is based on feedfinder - originally written by Mark Pilgrim and subsequently maintained by Aaron Swartz until his untimely death.

Usage

Feedfinder2 offers a single public function: find_feeds. You would use it as following:

import asyncio
from aio_feedfinder2 import find_feeds

loop = asyncio.get_event_loop()
task = asyncio.ensure_future(find_feeds("xkcd.com"))
feeds = loop.run_until_complete(future)

Now, feeds is the list: ['http://xkcd.com/atom.xml', 'http://xkcd.com/rss.xml']. There is some attempt made to rank feeds from best candidate to worst but… well… you never know.

This asyncio variant is ideally suited to find feeds on multiple domains/ sites in an asynchronous way:

import asyncio
from aio_feedfinder2 import find_feeds

loop = asyncio.get_event_loop()
tasks = [find_feeds(url) for url in ["xkcd.com", "abstrusegoose.com"]]
feeds = loop.run_until_complete(asyncio.gather(*tasks))

>>> feeds
... [
...     ['http://xkcd.com/atom.xml', 'http://xkcd.com/rss.xml'],
...     ['http://abstrusegoose.com/feed.xml', 'http://abstrusegoose.com/atomfeed.xml']
... ]

License

Feedfinder2 is licensed under the MIT license (see LICENSE).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aio-feedfinder2-0.3.0.tar.gz (2.0 kB view hashes)

Uploaded Source

Built Distribution

aio_feedfinder2-0.3.0-py3-none-any.whl (6.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page