pycomex

Python Computational Experiments

These details have not been verified by PyPI

Project links

Documentation

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

https://img.shields.io/pypi/v/pycomex.svg

https://img.shields.io/badge/code%20style-black-000000.svg

PyComex - Python Computational Experiments

Microframework to improve the experience of running and managing records of computational experiments, such as machine learning and data science experiments, in Python.

Free software: MIT license
Documentation: https://pycomex.readthedocs.io.

Features

Automatically create (nested) folder structure for results of each run of an experiment
Simply attach metadata such as performance metrics to experiment object and they will be automatically stored as JSON file
Easily attach file artifacts such as matplotlib figures to experiment records
Log messages to stdout as well as permanently store into log file
ready-to-use automatically generated boilerplate code for the analysis and post-processing of experiment data after it terminates

Installation

Install stable version with pip

pip3 install pycomex

Or the most recent development version

git clone https://github.com/the16thpythonist/pycomex.git
cd pycomex ; pip3 install .

Quickstart

Each computational experiment has to be bundled as a standalone python module. Important experiment parameters are placed at the top. Actual execution of the experiment is placed within the Experiment context manager.

Upon entering the context, a new archive folder for each run of the experiment is created.

Archiving of metadata, file artifacts and error handling is automatically managed on context exit.

# quickstart.py
"""
This doc string will be saved as the "description" meta data of the experiment records
"""
from pycomex.experiment import Experiment
from pycomex.util import Skippable

# Experiment parameters can simply be defined as uppercase global variables.
# These are automatically detected and can possibly be overwritten in command
# line invocation
HELLO = "hello "
WORLD = "world!"

# Experiment context manager needs 3 positional arguments:
# - Path to an existing folder in which to store the results
# - A namespace name unique for each experiment
# - access to the local globals() dict
with Skippable(), (e := Experiment("/tmp", "example/quickstart", globals())):

    # Internally saved into automatically created nested dict
    # {'strings': {'hello_world': '...'}}
    e["strings/hello_world"] = HELLO + WORLD

    # Alternative to "print". Message is printed to stdout as well as
    # recorded to log file
    e.info("some debug message")

    # Automatically saves text file artifact to the experiment record folder
    file_name = "hello_world.txt"
    e.commit_raw(file_name, HELLO + WORLD)
    # e.commit_fig(file_name, fig)
    # e.commit_png(file_name, image)
    # ...

# All the code inside this context will be copied to the "analysis.py"
# file which will be created as an experiment artifact.
with Skippable(), e.analysis:
    # And we can access all the internal fields of the experiment object
    # and the experiment parameters here!
    print(HELLO, WORLD)
    print(e['strings/hello_world'])
    # logging will print to stdout but not modify the log file
    e.info('analysis done')

This example would create the following folder structure:

tmp
|- results
   |- example
      |- 000
         |+ experiment_log.txt     # Contains all the log messages printed by experiment
         |+ experiment_meta.txt    # Meta information about the experiment
         |+ experiment_data.json   # All the data that was added to the internal exp. dict
         |+ hello_world.txt        # Text artifact that was committed to the experiment
         |+ snapshot.py            # Copy of the original experiment python module
         |+ analysis.py            # boilerplate code to get started with analysis of results

The analysis.py file is of special importance. It is created as a boilerplate starting place for additional code, which performs analysis or post processing on the results of the experiment. This can for example be used to transform data into a different format or create plots for visualization.

Specifically note these two aspects:

The analysis file contains all of the code which was defined in the e.analysis context of the original experiment file! This code snippet is automatically transferred at the end of the experiment.
The analysis file actually imports the snapshot copy of the original experiment file. This does not trigger the experiment to be executed again! The Experiment instance automatically notices that it is being imported and not explicitly executed. It will also populate all of it’s internal attributes from the persistently saved data in experiment_data.json, which means it is still possible to access all the data of the experiment without having to execute it again!

# analysis.py

# [...] imports omitted
# Importing the experiment itself
from snapshot import *

PATH = pathlib.Path(__file__).parent.absolute()
DATA_PATH = os.path.join(PATH, 'experiment_data.json')
# Load the all raw data of the experiment
with open(DATA_PATH, mode='r') as json_file:
    DATA: Dict[str, Any] = json.load(json_file)


if __name__ == '__main__':
    print('RAW DATA KEYS:')
    pprint(list(DATA.keys()))

    # ~ The analysis template from the experiment file
    # And we can access all the internal fields of the experiment object
    # and the experiment parameters here!
    print(HELLO, WORLD)
    print(e['strings/hello_world'])
    # logging will print to stdout but not modify the log file
    e.info('analysis done')

For more information and more interesting examples visit the Documentation: https://pycomex.readthedocs.io !

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

Project details

These details have not been verified by PyPI

Project links

Documentation

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.10.2

Nov 8, 2023

0.10.1

Nov 5, 2023

0.10.0

Oct 30, 2023

0.9.5

Aug 4, 2023

0.9.4

May 8, 2023

0.9.3

May 5, 2023

0.9.2

Apr 28, 2023

0.9.1

Apr 28, 2023

0.9.0

Apr 27, 2023

0.8.8

Mar 27, 2023

0.8.7

Mar 27, 2023

0.8.6

Mar 24, 2023

0.8.5

Mar 24, 2023

0.8.4

Feb 16, 2023

0.8.3

Feb 13, 2023

0.8.2

Feb 9, 2023

0.8.1

Jan 27, 2023

0.8.0

Jan 20, 2023

0.7.1

Jan 17, 2023

0.7.0

Jan 3, 2023

0.6.1

Nov 28, 2022

0.6.0

Sep 19, 2022

0.5.2

Sep 18, 2022

0.5.1

Sep 14, 2022

This version

0.5.0

Sep 14, 2022

0.4.1

Sep 12, 2022

0.4.0

Aug 21, 2022

0.3.1

Aug 20, 2022

0.3.0

Jul 17, 2022

0.2.1

Jul 12, 2022

0.2.0

Jul 12, 2022

0.1.1

Jul 11, 2022

0.1.0

Jul 11, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycomex-0.5.0.tar.gz (29.3 kB view hashes)

Uploaded Sep 14, 2022 Source

Built Distribution

pycomex-0.5.0-py3-none-any.whl (36.6 kB view hashes)

Uploaded Sep 14, 2022 Python 3

Hashes for pycomex-0.5.0.tar.gz

Hashes for pycomex-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`73af794125dc022678a985789307b303dc7fca0071d4f17b7fd7fe9980dd1183`
MD5	`191f04f23e4936a879f275c3caeb0c78`
BLAKE2b-256	`8ef3090062a36337ab940245079171552ca9fa622ccfa6656ef0abf9ebeb911e`

Hashes for pycomex-0.5.0-py3-none-any.whl

Hashes for pycomex-0.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0a6fa81a2ddc1a1978b197a43b3dbccf7bb4ab9d3d237591f35a28a602b5082b`
MD5	`21c1b5870aa4244715651be5251180a0`
BLAKE2b-256	`c09feb82031a3ff8f7a6a6f5014d5b09d4c2f9df3a5d1edfbb0ead53650512c9`