django-cacheops

A slick ORM cache with automatic granular event-driven invalidation for Django.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 4 - Beta
Environment
- Web Environment
Framework
- Django
Intended Audience
- Developers
License
- OSI Approved :: BSD License
Operating System
- OS Independent
Programming Language
- Python
Topic
- Internet :: WWW/HTTP
- Software Development :: Libraries :: Python Modules

Project description

A slick app that supports automatic or manual queryset caching and automatic granular event-driven invalidation.

It uses redis as backend for ORM cache and redis or filesystem for simple time-invalidated one.

And there is more to it:

decorator to cache any user function as queryset
extension for jinja2 to cache template fragments as querysets
a couple of hacks to make django faster

Requirements

Python 2.6, Django 1.2 and Redis 2.2.7.

Installation

Using pip:

$ pip install django-cacheops

Or you can get latest one from github:

$ git clone git://github.com/Suor/django-cacheops.git
$ ln -s `pwd`/django-cacheops/cacheops/ /somewhere/on/python/path/

Setup

Add cacheops to your INSTALLED_APPS before any apps that use it.

Setup redis connection and enable caching for desired models:

CACHEOPS_REDIS = {
    'host': 'localhost', # redis-server is on same machine
    'port': 6379,        # default redis port
    'db': 1,             # SELECT non-default redis database
                         # using separate redis db or redis instance
                         # is highly recommended
    'socket_timeout': 3,
}

CACHEOPS = {
    # Automatically cache any User.objects.get() calls for 15 minutes
    # This includes request.user or post.author access,
    # where Post.author is a foreign key to auth.User
    'auth.user': ('get', 60*15),

    # Automatically cache all gets, queryset fetches and counts
    # to other django.contrib.auth models for an hour
    'auth.*': ('all', 60*60),

    # Enable manual caching on all news models with default timeout of an hour
    # Use News.objects.cache().get(...)
    #  or Tags.objects.filter(...).order_by(...).cache()
    # to cache particular ORM request.
    # Invalidation is still automatic
    'news.*': ('just_enable', 60*60),

    # Automatically cache count requests for all other models for 15 min
    '*.*': ('count', 60*15),
}

Usage

Automatic caching.

It’s automatic you just need to set it up.

Manual caching.

You can force any queryset to use cache by calling it’s .cache() method:

Article.objects.filter(tag=2).cache()

Here you can specify which ops should be cached for queryset, for example, this code:

qs = Article.objects.filter(tag=2).cache(ops=['count'])
paginator = Paginator(objects, ipp)
articles = list(pager.page(page_num)) # hits database

will cache .count() call in Paginator but not later in articles fetch. There are three possible actions - get, fetch and count. You can pass any subset of this ops to .cache() method even empty to turn off caching. There are, however, a shortcut for it:

qs = Article.objects.filter(visible=True).nocache()
qs1 = qs.filter(tag=2)       # hits database
qs2 = qs.filter(category=3)  # hits it once more

It is usefull when you want to disable automatic caching on particular queryset.

Function caching.

You can cache and invalidate result of a function the same way as a queryset. Cache of next function will be invalidated on any Article change, addition or deletetion:

from cacheops import cached_as

@cached_as(Article.objects.all())
def article_stats():
    return {
        'tags': list( Article.objects.values('tag').annotate(count=Count('id')) )
        'categories': list( Article.objects.values('category').annotate(count=Count('id')) )
    }

Note that we are using list on both querysets here, it’s because we don’t want to cache queryset objects but their result.

Also note that cache key does not depend on arguments of a function, so it’s result should not, either. This is done to enable caching of view functions. Instead you should use a local function:

def articles_block(category, count=5):

    @cached_as(Article.objects.filter(category=category), extra=count)
    def _articles_block():
        qs = Article.objects.filter(category=category)
        articles = list(qs.filter(photo=True)[:count])

        if len(articles) < count:
            articles += list(qs[:count-len(articles)])

        return articles

    return _articles_block()

Using local function gives additional advantage: we can filter queryset used in @cached_as() to make invalidation more granular. We also add an extra to make diffrent keys for calls with same category but diffrent count.

Invalidation

Cacheops uses both time and event-driven invalidation. The event-driven one listens on model signals and invalidates appropriate caches on Model.save() and .delete().

Invalidation tries to be granular which means it won’t invalidate a queryset that cannot be influenced by added/updated/deleted object judjing by query conditions. Most time this will do what you want, if it’s not you can use one of the following:

from cacheops import invalidate_obj, invalidate_model

invalidate_obj(some_article)  # invalidates queries affected by some_article
invalidate_model(Article)     # invalidates all queries for model

And last there is invalidate command:

./manage.py invalidate articles.Artcile.34  # same as invalidate_obj
./manage.py invalidate articles.Article     # same as invalidate_model
./manage.py invalidate articles   # invalidate all models in articles

And the one that FLUSHES cacheops redis database:

./manage.py invalidate all

Don’t use that if you share redis database for both cache and something else.

Jinja2 extension

Add cacheops.jinja2.cache to your extensions and use:

{% cached_as queryset [, timeout=<timeout>] [, extra=<key addition>] %}
    ... some template code ...
{% endcached_as %}

{% cached [timeout=<timeout>] [, extra=<key addition>] %}
    ...
{% endcached %}

Tags work the same way as corresponding decorators.

CAVEATS

Conditions other than __exact or __in don’t provide more granularity for invalidation.
Conditions on related models don’t provide it either.
Update of “selected_related” object does not invalidate cache for queryset.
Mass updates don’t trigger invalidation.
ORDER BY and LIMIT/OFFSET don’t affect invalidation.
Doesn’t work with RawQuerySet.
Conditions on subqueries don’t affect invalidation.

Aggregates is not implemented yet.
Timeout in queryset and @cached_as() cannot be larger than default.

Here 1, 3, 5, 10 are part of design compromise, trying to solve them will make things complicated and slow. 2 and 7 can be implemented if needed, but it’s probably counter-productive since one can just break queries into simple ones, which cache better. 4 is a deliberate choice, making it “right” will flush cache too much when update conditions are orthogonal to most queries conditions. 6 can be cached as SomeModel.objects.all() but @cached_as() someway covers that and is more flexible.

Performance tips

Here come some performance tips to make cacheops and Django ORM faster.

When you use cache you pickle and unpickle lots of django model instances, which could be slow. You can optimize django models serialization with django-pickling.
Constructing querysets is rather slow in django, mainly because most of QuerySet methods clone self, then change it and return a clone. Original queryset is usually thrown away. Cacheops adds .inplace() method, which makes queryset mutating, preventing useless cloning:
```
items = Item.objects.inplace().filter(category=12).order_by('-date')[:20]
```
You can revert queryset to cloning state using .cloning() call.
More to 2, there is unfixed bug in django, which sometimes make queryset cloning very slow. You can use any patch from this ticket to fix it.
Use template fragment caching when possible, it’s way more fast because you don’t need to generate anything. Also pickling/unpickling a string is much faster than list of model instances. Cacheops doesn’t provide extension for django’s built-in templates for now, but you can adapt django.templatetags.cache to work with cacheops fairly easily (send me a pull request if you do).
Run separate redis instance for cache with disabled persistence. You can manually call SAVE or BGSAVE to stay hot upon server restart.
If you filter queryset on many different or complex conditions cache could degrade performance (comparing to uncached db calls) in consequence of frequent cache misses. Disable cache in such cases entirely or on some heurestics which detect if this request would be probably hit. E.g. enable cache if only some primary fields are used in filter.

Caching querysets with large amount of filters also slows down all subsequent invalidation on that model. You can disable caching if more than some amount of fields is used in filter simultaneously.

TODO

docs about simple cache
docs about file cache
add .delete(cache_key) method to simple and file cache
.invalidate() method on simple cached funcs
queryset brothers
jinja2 tag for “get random of some list” block with lazy rendering
make a version of invalidation with scripting
shard cache between multiple redises
integrate with prefetch_related()

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 4 - Beta
Environment
- Web Environment
Framework
- Django
Intended Audience
- Developers
License
- OSI Approved :: BSD License
Operating System
- OS Independent
Programming Language
- Python
Topic
- Internet :: WWW/HTTP
- Software Development :: Libraries :: Python Modules

Release history Release notifications | RSS feed

7.0.2

Oct 24, 2023

7.0.1

May 9, 2023

7.0

Mar 15, 2023

6.2

Feb 22, 2023

6.1

May 27, 2022

6.0

May 3, 2021

5.1

Oct 25, 2020

5.0.1

Jul 9, 2020

5.0

May 15, 2020

4.2

Sep 1, 2019

4.1

Sep 10, 2018

4.0.7

Jun 30, 2018

4.0.6

Mar 9, 2018

4.0.5

Jan 30, 2018

4.0.4

Jan 2, 2018

4.0.3

Dec 2, 2017

4.0.2

Nov 14, 2017

4.0.1

Oct 2, 2017

4.0

Sep 16, 2017

3.2.1

May 17, 2017

3.2

May 9, 2017

3.1.3

Mar 29, 2017

3.1.2

Feb 15, 2017

3.1.1

Nov 16, 2016

3.1

Nov 10, 2016

3.0.1

Oct 12, 2016

3.0

Aug 7, 2016

2.4.5

Jun 13, 2016

2.4.3

Nov 7, 2015

2.4.2

Oct 16, 2015

2.4.1

Sep 4, 2015

2.4

Aug 11, 2015

2.3.2

May 11, 2015

2.3.1

Apr 1, 2015

2.3

Feb 10, 2015

2.2.1

Oct 22, 2014

2.2

Oct 16, 2014

2.1.1

Jul 16, 2014

2.1

Jul 12, 2014

2.0

Jun 13, 2014

1.3.1

Feb 26, 2014

1.3.0

Feb 6, 2014

1.2.1

Dec 3, 2013

1.2

Dec 3, 2013

1.1.1

Nov 27, 2013

1.1

Nov 18, 2013

1.0.3

Oct 3, 2013

1.0.2

Aug 30, 2013

1.0.1

Aug 13, 2013

1.0.0

Jul 1, 2013

0.9.9

Jun 10, 2013

0.9.8

May 31, 2013

0.9.7

Apr 18, 2013

0.9.6

Feb 25, 2013

This version

0.9.5

Feb 3, 2013

0.9.4

Jan 26, 2013

0.9.3

Dec 24, 2012

0.9.2

Apr 30, 2012

0.9.1

Nov 24, 2011

0.9

Oct 13, 2011

0.8.1

Sep 25, 2011

0.8

Sep 24, 2011

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

django-cacheops-0.9.5.tar.gz (22.3 kB view hashes)

Uploaded Feb 3, 2013 Source

Hashes for django-cacheops-0.9.5.tar.gz

Hashes for django-cacheops-0.9.5.tar.gz
Algorithm	Hash digest
SHA256	`8495eb1d9de9aa53f7513f13474ba2cc9955d1a27aa51c63fbc07d53ef59e5c8`
MD5	`be132b72437fa419a514b4bd8ecd1e98`
BLAKE2b-256	`12e8c81d59e4f647cf839b621251559fcb0cbd62151eb1c71342b3162dbb3acd`