Lazy dict with universally unique identifier for values

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

test

ldict

Uniquely identified lazy dict.

Latest version as a package

Current code

Overview

We consider that every value or data object is generated by a process, starting from empty. The process is a sequence of transformation steps that can be of two types: value insertion and function application. Value insertion is done using dict-like objects as shown below. The operator >> concatenate the steps chronologically. Each value and each function have unique deterministic identifiers. Identifiers for future values are predictable through the magic available here.

Function application is done in the same way. The parameter names define the input fields, while the keys in the returned dict define the output fields:

Similarly, for anonymous functions:

Finally, the result is only evaluated at request:

Installation

...as a standalone lib

# Set up a virtualenv. 
python3 -m venv venv
source venv/bin/activate

# Install from PyPI...
pip install --upgrade pip
pip install -U ldict

# ...or, install from updated source code.
pip install git+https://github.com/davips/ldict

...from source

git clone https://github.com/davips/ldict
cd ldict
poetry install

Examples

Merging two ldicts

from ldict import ldict

a = ldict(x=3)
print(a)
"""
{
    "id": "kr_4aee5c3bcac2c478be9901d57fd1ef8a9d002",
    "ids": "kr_4aee5c3bcac2c478be9901d57fd1ef8a9d002",
    "x": 3
}
"""

b = ldict(y=5)
print(b)
"""
{
    "id": "Uz_0af6d78f77734fad67e6de7cdba3ea368aae4",
    "ids": "Uz_0af6d78f77734fad67e6de7cdba3ea368aae4",
    "y": 5
}
"""

print(a >> b)
"""
{
    "id": "c._2b0434ca422114262680df425b85cac028be6",
    "ids": "kr_4aee5c3bcac2c478be9901d57fd1ef8a9d002 Uz_0af6d78f77734fad67e6de7cdba3ea368aae4",
    "x": 3,
    "y": 5
}
"""

Lazily applying functions to ldict

from ldict import ldict

a = ldict(x=3)
print(a)
"""
{
    "id": "kr_4aee5c3bcac2c478be9901d57fd1ef8a9d002",
    "ids": "kr_4aee5c3bcac2c478be9901d57fd1ef8a9d002",
    "x": 3
}
"""

a = a >> ldict(y=5) >> {"z": 7} >> (lambda x, y, z: {"r": x ** y // z})
print(a)
"""
{
    "id": "8jopGVdtSEyCk1NSKcrEF-Lfv8up9MQBdvkLxU2o",
    "ids": "J3tsy4vUXPELySBicaAy-h-UK7Dp9MQBdvkLxU2o... +2 ...Ss_7dff0a161ba7462725cac7dcee71b67669f69",
    "r": "→(x y z)",
    "x": 3,
    "y": 5,
    "z": 7
}
"""

print(a.r)
"""
34
"""

print(a)
"""
{
    "id": "8jopGVdtSEyCk1NSKcrEF-Lfv8up9MQBdvkLxU2o",
    "ids": "J3tsy4vUXPELySBicaAy-h-UK7Dp9MQBdvkLxU2o... +2 ...Ss_7dff0a161ba7462725cac7dcee71b67669f69",
    "r": 34,
    "x": 3,
    "y": 5,
    "z": 7
}
"""

Parameterized functions and sampling

from random import Random

from ldict import ø
from ldict.cfg import cfg


# A function provide input fields and, optionally, parameters.
# 'a' is sampled from an arithmetic progression
# 'b' is sampled from a geometric progression
# Here, the syntax for default parameter values is borrowed with a new meaning.
def fun(x, y, a=[-100, -99, -98, ..., 100], b=[0.0001, 0.001, 0.01, ..., 100000000]):
    return {"z": a * x + b * y}


# Creating an empty ldict. Alternatively: d = ldict().
d = ø >> {}
d.show(colored=False)
"""
{
    "id": "0000000000000000000000000000000000000000",
    "ids": {}
}
"""

# Putting some values. Alternatively: d = ldict(x=5, y=7).
d["x"] = 5
d["y"] = 7
d.show(colored=False)
"""
{
    "id": "I0_39c94b4dfbc7a8579ca1304eba25917204a5e",
    "ids": {
        "x": "Tz_d158c49297834fad67e6de7cdba3ea368aae4",
        "y": "Rs_92162dea64a7462725cac7dcee71b67669f69"
    },
    "x": 5,
    "y": 7
}
"""

# Parameter values are uniformly sampled.
d1 = d >> fun
d.show(colored=False)
"""
{
    "id": "I0_39c94b4dfbc7a8579ca1304eba25917204a5e",
    "ids": {
        "x": "Tz_d158c49297834fad67e6de7cdba3ea368aae4",
        "y": "Rs_92162dea64a7462725cac7dcee71b67669f69"
    },
    "x": 5,
    "y": 7
}
"""

print(d1.z)
"""
69610.0
"""

d2 = d >> fun
d.show(colored=False)
"""
{
    "id": "I0_39c94b4dfbc7a8579ca1304eba25917204a5e",
    "ids": {
        "x": "Tz_d158c49297834fad67e6de7cdba3ea368aae4",
        "y": "Rs_92162dea64a7462725cac7dcee71b67669f69"
    },
    "x": 5,
    "y": 7
}
"""

print(d2.z)
"""
-29.93
"""

# Parameter values can also be manually set.
e = d >> cfg(a=5, b=10) >> fun
print(e.z)
"""
95
"""

# Not all parameters need to be set.
e = d >> cfg(a=5) >> fun
print(e.z)
"""
70025.0
"""

# Each run will be a different sample for the missing parameters.
e = e >> cfg(a=5) >> fun
print(e.z)
"""
95.0
"""

# The metaparameter 'rnd' defines the initial state of the random sampler for this point onwards processing the ldict.
e = d >> cfg(a=5)(rnd=0) >> fun
print(e.z)
"""
725.0
"""

# All runs will yield the same result, if starting from the same random number generator seed.
e = e >> cfg(a=5)(rnd=0) >> fun
print(e.z)
"""
725.0
"""

# Reproducible different runs are achievable by passing a stateful random number generator, instead of a seed.
rnd = Random(0)
e = d >> cfg(a=5)(rnd=rnd) >> fun
print(e.z)
"""
725.0
"""

e = d >> cfg(a=5)(rnd=rnd) >> fun
print(e.z)
"""
700000025.0
"""

Composition of sets of functions

from random import Random

from ldict import ø


# A multistep process can be defined without applying its functions
from ldict.cfg import cfg


def g(x, y, a=[1, 2, 3, ..., 10], b=[0.00001, 0.0001, 0.001, ..., 100000]):
    return {"z": a * x + b * y}


def h(z, c=[1, 2, 3]):
    return {"z": c * z}


# In the ldict framework 'data is function',
# so the alias ø represents the 'empty data object' and the 'reflexive function' at the same time.
# In other words: 'inserting nothing' has the same effect as 'doing nothing'.
fun = ø * g * h  # ø enable the cartesian product of the subsequent sets of functions within the expression.
print(fun)
"""
g×h
"""

# The difference between 'ø * g * h' and 'ldict(x=3) >> g >> h' is that the functions in the latter are already applied
# (resulting in an ldict object). The former still has its free parameters unsampled,
# and results in an ordered set of composite functions.
# It is a set because the parameter values of the functions are still undefined.
d = {"x": 5, "y": 7} >> fun
print(d)
"""
{
    "id": "VNqIlnelOh9VJbgIBTtsB20MjhrHDycdeBrDsE9V",
    "ids": "LzRJyJ7ApyJLoJ.OVI8m.1sp56rHDycdeBrDsE9V... +1 ...Rs_92162dea64a7462725cac7dcee71b67669f69",
    "z": "→(z→(x y a b) c)",
    "x": 5,
    "y": 7
}
"""

print(d.z)
"""
2100090.0
"""

d = {"x": 5, "y": 7} >> fun
print(d.z)
"""
94.0
"""

# Reproducible different runs by passing a stateful random number generator.
rnd = Random(0)
e = d >> cfg()(rnd=rnd) >> fun
print(e.z)
"""
105.0
"""

e = d >> cfg()(rnd=rnd) >> fun
print(e.z)
"""
14050.0
"""

rnd = Random(0)
e = d >> cfg()(rnd=rnd) >> fun
print(e.z)
"""
105.0
"""

e = d >> cfg()(rnd=rnd) >> fun
print(e.z)
"""
14050.0
"""

Concept

A ldict is like a common Python dict, with extra funtionality and lazy. It is a mapping between string keys, called fields, and any serializable object. The ldict id (identifier) and the field ids are also part of the mapping.

The user can provide a unique identifier (hosh) for each function or value object. Otherwise, they will be calculated through blake3 hashing of the content of data or bytecode of function. For this reason, such functions should be simple, i.e., with minimal external dependencies, to avoid the unfortunate situation where two functions with identical local code actually perform different calculations through calls to external code that implement different algorithms with the same name.

Grants

This work was partially supported by Fapesp under supervision of Prof. André C. P. L. F. de Carvalho at CEPID-CeMEAI (Grants 2013/07375-0 – 2019/01735-0).

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

3.220128.4

Feb 2, 2023

3.220128.3

Dec 15, 2022

3.220128.1

Jan 28, 2022

3.211211.1

Dec 12, 2021

3.211210.1

Dec 10, 2021

3.211204.3

Dec 5, 2021

3.211204.2

Dec 5, 2021

3.211202.3

Dec 5, 2021

3.211202.2

Dec 3, 2021

3.211130.1

Nov 30, 2021

3.211129.5

Nov 30, 2021

3.211129.3

Nov 30, 2021

3.211129.2

Nov 29, 2021

3.211129.1

Nov 29, 2021

3.211127.7

Nov 29, 2021

3.211127.6

Nov 28, 2021

3.211127.5

Nov 28, 2021

3.211127.4

Nov 28, 2021

3.211127.3

Nov 27, 2021

3.211127.2

Nov 27, 2021

3.211127.1

Nov 27, 2021

3.211126.2

Nov 26, 2021

3.211126.1

Nov 26, 2021

3.211124.1

Nov 24, 2021

3.211123.2

Nov 23, 2021

3.211123.1

Nov 23, 2021

3.211121.1

Nov 21, 2021

3.211118.2

Nov 18, 2021

3.211118.1

Nov 18, 2021

3.211031.1

Nov 1, 2021

3.211024.3

Oct 24, 2021

3.211024.2

Oct 24, 2021

3.211024.1

Oct 24, 2021

3.211019.5

Oct 19, 2021

3.211019.4

Oct 19, 2021

3.211019.3

Oct 19, 2021

3.211019.2

Oct 19, 2021

3.211019.1

Oct 19, 2021

3.211017.19

Oct 18, 2021

3.211017.18

Oct 18, 2021

3.211017.17

Oct 18, 2021

3.211017.16

Oct 17, 2021

3.211017.14

Oct 17, 2021

3.211017.13

Oct 17, 2021

3.211017.12

Oct 17, 2021

3.211017.10

Oct 17, 2021

3.211017.8

Oct 17, 2021

3.211017.7

Oct 17, 2021

3.211017.6

Oct 17, 2021

3.211017.5

Oct 17, 2021

3.211017.3

Oct 17, 2021

3.211017.2

Oct 17, 2021

3.211017.1

Oct 17, 2021

2.211016.3

Oct 16, 2021

2.210914.1

Sep 14, 2021

2.210913.4

Sep 13, 2021

2.210913.3

Sep 13, 2021

2.210913.2

Sep 13, 2021

2.210910.1

Sep 10, 2021

2.210909.4

Sep 9, 2021

2.210909.2

Sep 9, 2021

2.210908.9

Sep 8, 2021

2.210908.8

Sep 8, 2021

This version

2.210908.7

Sep 8, 2021

2.210908.5

Sep 8, 2021

2.210908.4

Sep 8, 2021

2.210908.3

Sep 8, 2021

2.210908.2

Sep 8, 2021

2.210907.2

Sep 7, 2021

2.210815.2

Aug 16, 2021

2.210815.1

Aug 16, 2021

0.210330.2

Mar 30, 2021

0.210330.0

Mar 30, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ldict-2.210908.7.tar.gz (20.2 kB view hashes)

Uploaded Sep 8, 2021 Source

Built Distribution

ldict-2.210908.7-py3-none-any.whl (24.2 kB view hashes)

Uploaded Sep 8, 2021 Python 3

Hashes for ldict-2.210908.7.tar.gz

Hashes for ldict-2.210908.7.tar.gz
Algorithm	Hash digest
SHA256	`21f75f353bfdf1e7cb2c0b25e3efb2979a5e71083104ed6012e84c83ad1aacdd`
MD5	`61a29dfb346cc32d0d89486c300622e6`
BLAKE2b-256	`a72df6f5cd8cf60fb5b7207b1df586dc8d6fb0ebb815acac4a73c49636049d26`

Hashes for ldict-2.210908.7-py3-none-any.whl

Hashes for ldict-2.210908.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7e33d8844482d136b0d1718092f4f330bb8fd265f6e10136cdc89e0d87749e3b`
MD5	`2b0c9f4ff64b43bcbd2a0b350e430651`
BLAKE2b-256	`6067b8eb2f3a13b8e0b4d6114c67561289edb4ed284dca47abe48ee7832818b1`