Skip to main content

Package provides Binary-, RedBlack- and AVL-Trees in Python and Cython.

Project description

Binary Tree Package

Bintrees Development Stopped

Use sortedcontainers instead: https://pypi.python.org/pypi/sortedcontainers

see also PyCon 2016 presentation: https://www.youtube.com/watch?v=7z2Ki44Vs4E

Advantages:

  • pure Python no Cython/C dependencies

  • faster

  • active development

  • more & better testing/profiling

Abstract

This package provides Binary- RedBlack- and AVL-Trees written in Python and Cython/C.

This Classes are much slower than the built-in dict class, but all iterators/generators yielding data in sorted key order. Trees can be uses as drop in replacement for dicts in most cases.

Source of Algorithms

AVL- and RBTree algorithms taken from Julienne Walker: http://eternallyconfuzzled.com/jsw_home.aspx

Trees written in Python

  • BinaryTree – unbalanced binary tree

  • AVLTree – balanced AVL-Tree

  • RBTree – balanced Red-Black-Tree

Trees written with C-Functions and Cython as wrapper

  • FastBinaryTree – unbalanced binary tree

  • FastAVLTree – balanced AVL-Tree

  • FastRBTree – balanced Red-Black-Tree

All trees provides the same API, the pickle protocol is supported.

Cython-Trees have C-structs as tree-nodes and C-functions for low level operations:

  • insert

  • remove

  • get_value

  • min_item

  • max_item

  • prev_item

  • succ_item

  • floor_item

  • ceiling_item

Constructor

  • Tree() -> new empty tree;

  • Tree(mapping) -> new tree initialized from a mapping (requires only an items() method)

  • Tree(seq) -> new tree initialized from seq [(k1, v1), (k2, v2), … (kn, vn)]

Methods

  • __contains__(k) -> True if T has a key k, else False, O(log(n))

  • __delitem__(y) <==> del T[y], del[s:e], O(log(n))

  • __getitem__(y) <==> T[y], T[s:e], O(log(n))

  • __iter__() <==> iter(T)

  • __len__() <==> len(T), O(1)

  • __max__() <==> max(T), get max item (k,v) of T, O(log(n))

  • __min__() <==> min(T), get min item (k,v) of T, O(log(n))

  • __and__(other) <==> T & other, intersection

  • __or__(other) <==> T | other, union

  • __sub__(other) <==> T - other, difference

  • __xor__(other) <==> T ^ other, symmetric_difference

  • __repr__() <==> repr(T)

  • __setitem__(k, v) <==> T[k] = v, O(log(n))

  • __copy__() -> shallow copy support, copy.copy(T)

  • __deepcopy__() -> deep copy support, copy.deepcopy(T)

  • clear() -> None, remove all items from T, O(n)

  • copy() -> a shallow copy of T, O(n*log(n))

  • discard(k) -> None, remove k from T, if k is present, O(log(n))

  • get(k[,d]) -> T[k] if k in T, else d, O(log(n))

  • is_empty() -> True if len(T) == 0, O(1)

  • items([reverse]) -> generator for (k, v) items of T, O(n)

  • keys([reverse]) -> generator for keys of T, O(n)

  • values([reverse]) -> generator for values of T, O(n)

  • pop(k[,d]) -> v, remove specified key and return the corresponding value, O(log(n))

  • pop_item() -> (k, v), remove and return some (key, value) pair as a 2-tuple, O(log(n)) (synonym popitem() exist)

  • set_default(k[,d]) -> value, T.get(k, d), also set T[k]=d if k not in T, O(log(n)) (synonym setdefault() exist)

  • update(E) -> None. Update T from dict/iterable E, O(E*log(n))

  • foreach(f, [order]) -> visit all nodes of tree (0 = ‘inorder’, -1 = ‘preorder’ or +1 = ‘postorder’) and call f(k, v) for each node, O(n)

  • iter_items(s, e[, reverse]) -> generator for (k, v) items of T for s <= key < e, O(n)

  • remove_items(keys) -> None, remove items by keys, O(n)

slicing by keys

  • item_slice(s, e[, reverse]) -> generator for (k, v) items of T for s <= key < e, O(n), synonym for iter_items(…)

  • key_slice(s, e[, reverse]) -> generator for keys of T for s <= key < e, O(n)

  • value_slice(s, e[, reverse]) -> generator for values of T for s <= key < e, O(n)

  • T[s:e] -> TreeSlice object, with keys in range s <= key < e, O(n)

  • del T[s:e] -> remove items by key slicing, for s <= key < e, O(n)

start/end parameter:

  • if ‘s’ is None or T[:e] TreeSlice/iterator starts with value of min_key();

  • if ‘e’ is None or T[s:] TreeSlice/iterator ends with value of max_key();

  • T[:] is a TreeSlice which represents the whole tree;

The step argument of the regular slicing syntax T[s:e:step] will silently ignored.

TreeSlice is a tree wrapper with range check and contains no references to objects, deleting objects in the associated tree also deletes the object in the TreeSlice.

  • TreeSlice[k] -> get value for key k, raises KeyError if k not exists in range s:e

  • TreeSlice[s1:e1] -> TreeSlice object, with keys in range s1 <= key < e1
    • new lower bound is max(s, s1)

    • new upper bound is min(e, e1)

TreeSlice methods:

  • items() -> generator for (k, v) items of T, O(n)

  • keys() -> generator for keys of T, O(n)

  • values() -> generator for values of T, O(n)

  • __iter__ <==> keys()

  • __repr__ <==> repr(T)

  • __contains__(key)-> True if TreeSlice has a key k, else False, O(log(n))

prev/succ operations

  • prev_item(key) -> get (k, v) pair, where k is predecessor to key, O(log(n))

  • prev_key(key) -> k, get the predecessor of key, O(log(n))

  • succ_item(key) -> get (k,v) pair as a 2-tuple, where k is successor to key, O(log(n))

  • succ_key(key) -> k, get the successor of key, O(log(n))

  • floor_item(key) -> get (k, v) pair, where k is the greatest key less than or equal to key, O(log(n))

  • floor_key(key) -> k, get the greatest key less than or equal to key, O(log(n))

  • ceiling_item(key) -> get (k, v) pair, where k is the smallest key greater than or equal to key, O(log(n))

  • ceiling_key(key) -> k, get the smallest key greater than or equal to key, O(log(n))

Heap methods

  • max_item() -> get largest (key, value) pair of T, O(log(n))

  • max_key() -> get largest key of T, O(log(n))

  • min_item() -> get smallest (key, value) pair of T, O(log(n))

  • min_key() -> get smallest key of T, O(log(n))

  • pop_min() -> (k, v), remove item with minimum key, O(log(n))

  • pop_max() -> (k, v), remove item with maximum key, O(log(n))

  • nlargest(i[,pop]) -> get list of i largest items (k, v), O(i*log(n))

  • nsmallest(i[,pop]) -> get list of i smallest items (k, v), O(i*log(n))

Set methods (using frozenset)

  • intersection(t1, t2, …) -> Tree with keys common to all trees

  • union(t1, t2, …) -> Tree with keys from either trees

  • difference(t1, t2, …) -> Tree with keys in T but not any of t1, t2, …

  • symmetric_difference(t1) -> Tree with keys in either T and t1 but not both

  • is_subset(S) -> True if every element in T is in S (synonym issubset() exist)

  • is_superset(S) -> True if every element in S is in T (synonym issuperset() exist)

  • is_disjoint(S) -> True if T has a null intersection with S (synonym isdisjoint() exist)

Classmethods

  • from_keys(S[,v]) -> New tree with keys from S and values equal to v. (synonym fromkeys() exist)

Helper functions

  • bintrees.has_fast_tree_support() -> True if Cython extension is working else False (False = using pure Python implementation)

Installation

from source:

python setup.py install

or from PyPI:

pip install bintrees

Compiling the fast Trees requires Cython and on Windows is a C-Compiler necessary.

Download Binaries for Windows

https://github.com/mozman/bintrees/releases

Documentation

this README.rst

bintrees can be found on GitHub.com at:

https://github.com/mozman/bintrees.git

NEWS

Version 2.1.0 - 2020-01-02

Version 2.0.7 - 2017-04-28

  • BUGFIX: foreach (pure Python implementation) works with empty trees

  • acquire GIL for PyMem_Malloc() and PyMem_Free() calls

Version 2.0.6 - 2017-02-04

  • BUGFIX: correct deepcopy() for tree in tree

Version 2.0.5 - 2017-02-04

  • support for copy.deepcopy()

  • changed status back to Mature, there will be: bugfixes, compatibility checks and simple additions like this deep copy support, because I got feedback, that there are some special cases in which bintrees can be useful.

  • switched development to 64bit only & MS compilers - on Windows 7 everything works fine now with CPython 2.7/3.5/3.6

Repository moved to GitHub: https://github.com/mozman/bintrees.git

Version 2.0.4 - 2016-01-09

  • removed logging statements on import

  • added helper function bintrees.has_fast_tree_support()

  • HINT: pypy runs faster than CPython with Cython extension

Version 2.0.3 - 2016-01-06

  • replaced print function by logging.warning for import warning messages

  • KNOWN ISSUE: unable to build Cython extension with MingW32 and CPython 3.5 & CPython 2.7.10

Version 2.0.2 - 2015-02-12

  • fixed foreach cython-function by Sam Yaple

Version 2.0.1 - 2013-10-01

  • removed __del__() method to avoid problems with garbage collection

Version 2.0.0 - 2013-06-01

  • API change: consistent method naming with synonyms for dict/set compatibility

  • code base refactoring

  • removed tree walkers

  • removed low level node stack implementation -> caused crashes

  • optimizations for pypy: iter_items(), succ_item(), prev_item()

  • tested with CPython2.7, CPython3.3, pypy-2.0 on Win7 and Linux Mint 15 x64 (pypy-1.9)

Version 1.0.3 - 2013-05-01

  • extended iter_items(startkey=None, endkey=None, reverse=reverse) -> better performance for slicing

  • Cython implementation of iter_items() for Fast_X_Trees()

  • added key parameter reverse to itemslice(), keyslice(), valueslice()

  • tested with CPython2.7, CPython3.3, pypy-2.0

Version 1.0.2 - 2013-04-01

  • bug fix: FastRBTree data corruption on inserting existing keys

  • bug fix: union & symmetric_difference - copy all values to result tree

Version 1.0.1 - 2013-02-01

  • bug fixes

  • refactorings by graingert

  • skip useless tests for pypy

  • new license: MIT License

  • tested with CPython2.7, CPython3.2, CPython3.3, pypy-1.9, pypy-2.0-beta1

  • unified line endings to LF

  • PEP8 refactorings

  • added floor_item/key, ceiling_item/key methods, thanks to Dai Mikurube

Version 1.0.0 - 2011-12-29

  • bug fixes

  • status: 5 - Production/Stable

  • removed useless TreeIterator() class and T.treeiter() method.

  • patch from Max Motovilov to use Visual Studio 2008 for building C-extensions

Version 0.4.0 - 2011-04-14

  • API change!!!

  • full Python 3 support, also for Cython implementations

  • removed user defined compare() function - keys have to be comparable!

  • removed T.has_key(), use ‘key in T’

  • keys(), items(), values() generating ‘views’

  • removed iterkeys(), itervalues(), iteritems() methods

  • replaced index slicing by key slicing

  • removed index() and item_at()

  • repr() produces a correct representation

  • installs on systems without cython (tested with pypy)

  • new license: GNU Library or Lesser General Public License (LGPL)

Version 0.3.2 - 2011-04-09

  • added itemslice(startkey, endkey), keyslice(startkey, endkey), valueslice(startkey, endkey) - slicing by keys

  • tested with pypy 1.4.1, damn fast

  • Pure Python trees are working with Python 3

  • No Cython implementation for Python 3

Version 0.3.1 - 2010-09-10

  • runs with Python 2.7

Version 0.3.0 - 2010-05-11

  • low level functions written as c-module only interface to python is a cython module

  • support for the pickle protocol

Version 0.2.1 - 2010-05-06

  • added delslice del T[0:3] -> remove treenodes 0, 1, 2

  • added discard -> remove key without KeyError if not found

  • added heap methods: min, max, nlarges, nsmallest …

  • added Set methods -> intersection, differnce, union, …

  • added slicing: T[5:10] get items with position (not key!) 5, 6, 7, 8, 9

    T[5] get item with key! 5

  • added index: T.index(key) -> get position of item <key>

  • added item_at: T.item_at(0) -> get item at position (not key!) 0

    T.item_at(0) O(n)! <==> T.min_item() O(log(n))

Version 0.2.0 - 2010-05-03

  • TreeMixin Class as base for Python-Trees and as Mixin for Cython-Trees

Version 0.1.0 - 2010-04-27

  • Alpha status

  • Initial release

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bintrees-2.2.0.zip (108.3 kB view hashes)

Uploaded Source

Built Distributions

bintrees-2.2.0-cp39-cp39-win_amd64.whl (64.2 kB view hashes)

Uploaded CPython 3.9 Windows x86-64

bintrees-2.2.0-cp39-cp39-macosx_10_14_x86_64.whl (60.7 kB view hashes)

Uploaded CPython 3.9 macOS 10.14+ x86-64

bintrees-2.2.0-cp38-cp38-win_amd64.whl (64.3 kB view hashes)

Uploaded CPython 3.8 Windows x86-64

bintrees-2.2.0-cp38-cp38-macosx_10_14_x86_64.whl (59.4 kB view hashes)

Uploaded CPython 3.8 macOS 10.14+ x86-64

bintrees-2.2.0-cp37-cp37m-win_amd64.whl (62.9 kB view hashes)

Uploaded CPython 3.7m Windows x86-64

bintrees-2.2.0-cp37-cp37m-macosx_10_14_x86_64.whl (59.0 kB view hashes)

Uploaded CPython 3.7m macOS 10.14+ x86-64

bintrees-2.2.0-cp36-cp36m-win_amd64.whl (63.0 kB view hashes)

Uploaded CPython 3.6m Windows x86-64

bintrees-2.2.0-cp36-cp36m-macosx_10_14_x86_64.whl (60.3 kB view hashes)

Uploaded CPython 3.6m macOS 10.14+ x86-64

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page