Skip to main content

Puristaa (Finnish for compress) - shared prefix compression of ordered string sequences.

Project description

puristaa

Puristaa (Finnish for compress)

License: MIT

Provide a memory neutral and fast implementation that is clear to understand and works for the use case of extracting a common prefix of a sequence of strings and yielding that prefix and a generator for the compressed strings of the original sequence.

The latter is not yet clear. One simple implementation path is to receive a reference to a sequence, determine min-max, compare min to max by character and break with first index of enumaeration not matching. Finally yield first the prefix, subsequently all strings of the sequence each shortened by the prefix.

Another still open question is, if a composition of functions can be construed, such that the prefix finding algorithm respects a caller provided policy representing the inner structure of the strings. An example for an inner structure is a sequence of paths as strings. A path policy would provide a separator that modifies the prefix finding algorithm to backtrack to the previous such separator token.

why? well, imagine the sequence 'bar/baz', 'bar/bazaar' and a path policy providing the boolean predicate lambda x: x == '/' should not yield 'bar/baz', '', 'aar' but instead 'bar/', 'baz', 'bazaar'.

version downloads wheel supported-versions supported-implementations

Documentation

User and developer documentation of puristaa.

Bug Tracker

Feature requests and bug reports are bested entered in the todos of puristaa.

Primary Source repository

The primary source repository of puristaa is at sourcehut a collection of tools useful for software development.

Status

Experimental

Note: The default branch is default.

Use

Commandline API

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

puristaa-2022.7.24.tar.gz (12.1 kB view hashes)

Uploaded Source

Built Distribution

puristaa-2022.7.24-py2.py3-none-any.whl (6.9 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page