feedly

Feedly allows you to build complex feed and caching structures using Redis.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Feedly
------

Feedly allows you to build complex feed and caching structures using Redis.

**What is a feed?**

A feed is a stream of content which is created by people or subjects you follow.
Prime examples are the Facebook newsfeed, your Twitter stream or your Pinterest following page.

Feeds are commonly also called: Activity Streams, activity feeds, news streams.

**Why is it hard?**

It's very hard to split up data for social sites. You can't easily store all Facebook users in Brasil on one server and the ones in The Netherlands on another. One of the recommended approaches to this problem is to publish your activity (ie a tweet on twitter) to all of your followers. These streams of content are hard to maintain and keep up to date, but they are really fast for the user and can easily be sharded.

**Feedly**

Feedly allows you to easily use Redis and Celery (an awesome task broker) to build infinitely scalable feeds.
The core functionality is located in 3 core classes.

- Structures
- Activities
- Feeds
- Feed managers (Feedly)

Structures are basic building blocks wrapping python functionality around Redis datastructures. There are convenient objects for hashes, lists and sorted sets.

Activities is the content which is stored in a feed. It follows the nomenclatura from the [activity stream spec] [astream]
[astream]: http://activitystrea.ms/specs/atom/1.0/#activity.summary
Every activity therefor stores at least:

- Time (the time of the activity)
- Verb (the action, ie loved, liked, followed)
- Actor (the user id doing the action)
- Object (the object the action is related to)
- Extra context (Used for whatever else you need to store at the activity level)

Optionally you can also add a target (which is best explained in the activity docs)

Feeds are sorted containers of activities. They extend upon the data structures and add custom serialization logic and behavior.

Feedly classes (feed managers)
Handle the logic used in addressing the feed objects. They handle the complex bits of fanning out to all your followers when you create a new object (such as a tweet).

In addition there are several utility classes which you will encounter

- Serializers (classes handling serialization of Activity objects)
- Aggregators (utility classes for creating smart/computed feeds based on algorithms)
- Marker (FeedEndMarker, marker class allowing you to correctly cache an empty feed)

**Example**

```python
#Feedly level, on the background this spawns hundreds of tasks to update the feeds of your followers
love_feedly.add_love(love)
love_feedly.remove_love(love)
#Follow a user, adds their content to your feed
love_feedly.follow_user(follow)
love_feedly.unfollow_user(follow)

#Feed level, show the activities stored in the feed
feed = LoveFeed(user_id)
loves = feed[:20]
```

**Admin Interface**

You can find a basic admin interface at /feedly/admin/
Note that it's currently still tied into Fashiolista's use cases.
So this is one which will definitely require forking.

**Features**

Feedly uses celery and redis to build a system which is heavy in terms of writes, but
very light for reads.

- Asynchronous tasks (All the heavy lifting happens in the background, your users don't wait for it)
- Reusable components (You will need to make tradeoffs based on your use cases, Feedly doesnt get in your way)
- It supports distributed redis calls (Threaded calls to multiple redis servers)

**Tradeoffs**

*Store Serialized activities or ids in the feed*
Every feed contains a list of activities. But do you store the data for this activity per feed, or do you only store the id and cache the activity data.
If you store the activity plus data your feed's memory usage will increase.
If you store the id you will need to make more calls to redis upon reads.
In general you will want to store the id to reduce memory usage. Only for notification style feeds which require aggregation (John and 3 other people started following you) you might consider including
the data neccesary to determine the unique keys for aggregation.

*Fallback to the database?*
In general I recommend starting with the database as a fallback. This allows you to get used to running the feed system in production and rebuilt when you eventually lose data.
If your site is already quite large and you want to support multiple content types (Facebook allows pictures, messages etc. Twitter only supports messages.) it will become
impossible to rebuild from the database at some point. If that's the case you need to be sure you have the skills to properly setup persistence storage on your redis slaves.

**Background Articles**

A lot has been written about the best approaches to building feed based systems.
Here's a collection on some of the talks:

[Etsy feed scaling] [etsy]
(Gearman, separate scoring and aggregation steps, rollups - aggregation part two)

[etsy]: http://www.slideshare.net/danmckinley/etsy-activity-feeds-architecture/

[Facebook history] [facebook]

[facebook]: http://www.infoq.com/presentations/Facebook-Software-Stack

[Django project, with good naming conventions.] [djproject]
[djproject]: http://justquick.github.com/django-activity-stream/
http://activitystrea.ms/specs/atom/1.0/
(actor, verb, object, target)

[Quora post on best practises] [quora]

[quora]: http://www.quora.com/What-are-best-practices-for-building-something-like-a-News-Feed?q=news+feeds

[Quora scaling a social network feed] [quora2]

[quora2]: http://www.quora.com/What-are-the-scaling-issues-to-keep-in-mind-while-developing-a-social-network-feed

[Redis ruby example] [redisruby]

[redisruby]: http://blog.waxman.me/how-to-build-a-fast-news-feed-in-redis

[FriendFeed approach] [friendfeed]

[friendfeed]: http://backchannel.org/blog/friendfeed-schemaless-mysql

[Thoonk setup] [thoonk]

[thoonk]: http://blog.thoonk.com/

[Yahoo Research Paper] [yahoo]

[yahoo]: http://research.yahoo.com/files/sigmod278-silberstein.pdf

[Twitter’s approach] [twitter]

[twitter]: http://www.slideshare.net/nkallen/q-con-3770885

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.11.3

Sep 22, 2014

0.11.2

Sep 10, 2014

0.11.1

Sep 7, 2014

0.11.0

Sep 6, 2014

0.10.10

Jun 30, 2014

0.10.9

Jun 29, 2014

0.10.8

Jun 16, 2014

0.10.7

Jun 16, 2014

0.10.6

Jun 15, 2014

0.10.5

May 20, 2014

0.10.4

May 20, 2014

0.10.3

May 20, 2014

0.10.2

May 20, 2014

0.10.1

Apr 10, 2014

0.9.42

Mar 10, 2014

0.9.41

Mar 7, 2014

0.9.40

Feb 28, 2014

0.9.38

Feb 17, 2014

0.9.37

Feb 17, 2014

0.9.36

Feb 17, 2014

0.9.35

Feb 12, 2014

0.9.34

Jan 29, 2014

0.9.32

Dec 2, 2013

0.9.31

Dec 2, 2013

0.9.3

Nov 6, 2013

0.9.2

Oct 29, 2013

0.9.1

Oct 21, 2013

0.9.0

Oct 18, 2013

0.8.134

Oct 18, 2013

0.8.132

Oct 15, 2013

0.8.131

Oct 15, 2013

0.8.130

Oct 15, 2013

0.8.119

Sep 16, 2013

0.8.117

Sep 13, 2013

0.8.115

Sep 13, 2013

0.8.114

Sep 9, 2013

0.8.113

Sep 9, 2013

0.8.111

Sep 5, 2013

0.8.110

Sep 5, 2013

0.8.109

Sep 4, 2013

0.8.108

Sep 4, 2013

0.8.106

Sep 3, 2013

0.8.105

Sep 3, 2013

0.8.104

Sep 3, 2013

0.8.103

Sep 3, 2013

0.8.102

Aug 30, 2013

0.8.97

Aug 21, 2013

0.8.95

Aug 21, 2013

0.8.93

Aug 21, 2013

0.8.9

Aug 16, 2013

0.8.8

Aug 16, 2013

0.8.7

Aug 16, 2013

0.8.6

Aug 15, 2013

0.8.5

Aug 14, 2013

0.8.4

Aug 14, 2013

0.8.2

Aug 13, 2013

0.7.9

Aug 13, 2013

0.7.2

Aug 7, 2013

0.7.1

Aug 6, 2013

0.4.5

Jun 3, 2013

0.4.4

May 3, 2013

0.4.2

May 1, 2013

0.4.0

Apr 24, 2013

0.3.22

Apr 18, 2013

0.3.17

Apr 9, 2013

0.3.16

Apr 8, 2013

0.3.15

Apr 8, 2013

0.3.14

Apr 5, 2013

0.3.13

Apr 2, 2013

0.3.12

Apr 2, 2013

0.3.10

Mar 12, 2013

0.3.9

Mar 12, 2013

0.3.8

Mar 12, 2013

0.2.10

Feb 21, 2013

0.2.9

Feb 18, 2013

0.2.8

Feb 14, 2013

0.2.7

Feb 13, 2013

0.2.6

Feb 11, 2013

0.2.4

Feb 7, 2013

0.2.3

Feb 6, 2013

0.2.2

Jan 31, 2013

0.2.1

Jan 29, 2013

0.2.0

Jan 29, 2013

0.1.4

Jan 25, 2013

0.1.3

Jan 22, 2013

0.1.1

Jan 18, 2013

0.1.0

Jan 17, 2013

This version

0.0.12

Oct 23, 2012

0.0.11

Aug 16, 2012

0.0.10

Aug 14, 2012

0.0.9

Aug 10, 2012

0.0.8

Aug 10, 2012

0.0.7

Aug 10, 2012

0.0.6

Aug 10, 2012

0.0.5

Aug 10, 2012

0.0.4

Aug 10, 2012

0.0.3

Aug 10, 2012

0.0.1

Aug 9, 2012

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

feedly-0.0.12.zip (46.7 kB view hashes)

Uploaded Oct 23, 2012 Source

Hashes for feedly-0.0.12.zip

Hashes for feedly-0.0.12.zip
Algorithm	Hash digest
SHA256	`b75210dc33bf61c11b36d9fceba8ef746d9aa4452aa6c99c845983fd78e0c2ae`
MD5	`fe0c5f166a04821c50acb49c7db3b8a1`
BLAKE2b-256	`66c52061d1589f60ef5fc96a231357af6e7d5b86154679133964185aa3f62b7c`