pypet

A toolkit for numerical simulations to allow easy parameter exploration and storage of results.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 4 - Beta
Intended Audience
- Science/Research
License
- OSI Approved :: BSD License
Natural Language
- English
Operating System
- OS Independent
Programming Language
Topic
- Scientific/Engineering
- Utilities

Project description

pypet

The new python parameter exploration toolkit: pypet manages exploration of the parameter space and data storage into HDF5 files for you.

IMPORTANT!

The program is currently under development, please keep that in mind and use it very carefully.

Before publishing the official 0.1.0 release I will integrate pypet first in my own research project. Thus, I have a more profound testing environment than only using unittests. Accordingly, you still have to deal with the naming 0.1b.X for a little while. However, unless it is really, really, really necessary I do not plan to change the API anymore. So feel free to use this beta version and feel free to give feedback, suggestions, and report bugs. Use github (https://github.com/SmokinCaterpillar/pypet) issues or write to the pypet Google Group (https://groups.google.com/forum/?hl=de#!forum/pypet) :-)

Thanks!

Requirements

Python 2.6 or 2.7

tables >= 2.3.1
pandas >= 0.12.0
numpy >= 1.6.1
scipy >= 0.9.0

For git integration you additionally need

GitPython >= 0.3.1

To utilize the cap feature for multiprocessing you need

psutil >= 2.0.0

If you use Python 2.6 you also need

ordereddict >= 1.1

What is pypet all about?

Whenever you do numerical simulations in science, you come across two major challenges. First, you need some way to save your data. Secondly, you extensively explore the parameter space. In order to accomplish both you write some hacky I/O functionality to get it done the quick and dirty way. This means storing stuff into text files, as MATLAB m-files, or whatever comes in handy.

After a while and many simulations later, you want to look back at some of your very first results. But because of unforeseen circumstances, you changed a lot of your code. As a consequence, you can no longer use your old data, but you need to write a hacky converter to format your previous results to your new needs. The more complexity you add to your simulations, the worse it gets, and you spend way too much time formatting your data than doing science.

Indeed, this was a situation I was confronted with pretty soon at the beginning of my PhD. So this project was born. I wanted to tackle the I/O problems more generally and produce code that was not specific to my current simulations, but I could also use for future scientific projects right out of the box.

The python parameter exploration toolkit (pypet) provides a framework to define parameters that you need to run your simulations. You can actively explore these by following a trajectory through the space spanned by the parameters. And finally, you can get your results together and store everything appropriately to disk. The storage format of choice is HDF5 (http://www.hdfgroup.org/HDF5/) via PyTables (http://www.pytables.org/).

Package Organization

This project encompasses these core modules:

The pypet.environment module for handling the running of simulations
The pypet.trajectory module for managing the parameters and results, and providing a way to explore your parameter space. Somewhat related is also the pypet.naturalnaming module, that provides functionality to access and put data into the trajectory.
The pypet.parameters module including containers for parameters and results
The pypet.storageservice for saving your data to disk

Install

Simply install via pip install –pre pypet (–pre since the current version is still beta)

Package release can also be found on https://pypi.python.org/pypi/pypet. Download, unpack and python setup.py install it.

pypet has been tested for python 2.6 and python 2.7 for Linux using Travis-CI (https://www.travis-ci.org/). However, so far there was only limited testing under Windows.

In principle, pypet should work for Windows out of the box if you have installed all prerequisites (pytables, pandas, scipy, numpy). Yet, installing with pip is not possible. You have to download the tar file from https://pypi.python.org/pypi/pypet and unzip it (using WinRaR, 7zip, etc. You might need to unpack it twice, first the tar.gz file and then the remaining tar file in the subfolder). Next, open a windows terminal and navigate to your unpacked pypet files to the folder containing the setup.py file. As above run from the terminal python setup.py install.

By the way, the source code is available at https://github.com/SmokinCaterpillar/pypet/.

Documentation and Support

Documentation can be found on http://pypet.readthedocs.org/.

There is a Google Groups mailing list for support: https://groups.google.com/forum/?hl=de#!forum/pypet

If you have any further questions feel free to contact me at robert.meyer (at) ni.tu-berlin.de.

Main Features

Novel tree container Trajectory, for handling and managing of parameters and results of numerical simulations
Group your parameters and results into meaningful categories
Access data via natural naming, e.g. traj.parameters.traffic.ncars
Automatic storage of simulation data into HDF5 files via PyTables
Support for many different data formats
- python native data types: bool, int, long, float, str, complex
- list, tuple, dict
- Numpy arrays and matrices
- Scipy sparse matrices
- pandas DataFrames (http://pandas.pydata.org/)
- BRIAN quantities and monitors (http://briansimulator.org/)
Easily extendable to other data formats!
Exploration of the parameter space of your simulations
Merging of trajectories residing in the same space
Support for multiprocessing, pypet can run your simulations in parallel
Dynamic Loading, load only the parts of your data you currently need
Resume a crashed or halted simulation
Annotate your parameters, results and groups
Git Integration, let pypet make automatic commits of your codebase

Quick Working Example

The best way to show how stuff works is by giving examples. I will start right away with a very simple code snippet.

Well, what we have in mind is some sort of numerical simulation. For now we will keep it simple, let’s say we need to simulate the multiplication of 2 values, i.e. z=x*y. We have two objectives, a) we want to store results of this simulation z and b) we want to explore the parameter space and try different values of x and y.

Let’s take a look at the snippet at once:

from pypet.environment import Environment
from pypet.utils.explore import cartesian_product

def multiply(traj):
    """Example of a sophisticated simulation that involves multiplying two values.

    :param traj:

        Trajectory containing the parameters in a particular combination,
        it also serves as a container for results.

    """
    z=traj.x * traj.y
    traj.f_add_result('z',z, comment='I am the product of two values!')

# Create an environment that handles running our simulation
env = Environment(trajectory='Multiplication',filename='./HDF/example_01.hdf5',
                  file_title='Example_01', log_folder='./LOGS/')

# Get the trajectory from the environment
traj = env.v_trajectory

# Add both parameters
traj.f_add_parameter('x', 1.0, comment='Im the first dimension!')
traj.f_add_parameter('y', 1.0, comment='Im the second dimension!')

# Explore the parameters with a cartesian product
traj.f_explore(cartesian_product({'x':[1.0,2.0,3.0,4.0], 'y':[6.0,7.0,8.0]}))

# Run the simulation with all parameter combinations
env.f_run(multiply)

And now let’s go through it one by one. At first we have a job to do, that is multiplying two values:

def multiply(traj):
    """Example of a sophisticated simulation that involves multiplying two values.

    :param traj:

        Trajectory containing the parameters in a particular combination,
        it also serves as a container for results.

    """
    z=traj.x * traj.y
    traj.f_add_result('z',z, comment='I am the product of two values!')

This is our simulation function multiply. The function uses a so called trajectory container which manages our parameters. We can access the parameters simply by natural naming, as seen above via traj.x and traj.y. The value of z is simply added as a result to the traj object.

After the definition of the job that we want to simulate, we create an environment which will run the simulation.

# Create an environment that handles running our simulation
env = Environment(trajectory='Multiplication',filename='./HDF/example_01.hdf5',
                  file_title='Example_01', log_folder='./LOGS/',
                  comment = 'I am the first example!')

The environment uses some parameters here, that is the name of the new trajectory, a filename to store the trajectory into, the title of the file, a folder for the log files, and a comment that is added to the trajectory. There are more options available like the number of processors for multiprocessing or how verbose the final HDF5 file is supposed to be. Check out the documentation (http://pypet.readthedocs.org/) if you want to know more. The environment will automatically generate a trajectory for us which we can access via:

# Get the trajectory from the environment
traj = env.v_trajectory

Now we need to populate our trajectory with our parameters. They are added with the default values of x=y=1.0.

# Add both parameters
traj.f_add_parameter('x', 1.0, comment='Im the first dimension!')
traj.f_add_parameter('y', 1.0, comment='Im the second dimension!')

Well, calculating 1.0 * 1.0 is quite boring, we want to figure out more products, that is the results of the cartesian product set {1.0,2.0,3.0,4.0} x {6.0,7.0,8.0}. Therefore, we use f_explore in combination with the builder function cartesian_product.

# Explore the parameters with a cartesian product
traj.f_explore(cartesian_product({'x':[1.0,2.0,3.0,4.0], 'y':[6.0,7.0,8.0]}))

Finally, we need to tell the environment to run our job multiply with all parameter combinations.

# Run the simulation with all parameter combinations
env.f_run(multiply)

And that’s it. The environment will evoke the function multiply now 12 times with all parameter combinations. Every time it will pass a traj container with another one of these 12 combinations of different x and y values to calculate the value of z. Moreover, the environment and the storage service will have taken care about the storage of our trajectory - including the results we have computed - into an HDF5 file.

So have fun using this tool!

Cheers,: Robert

Miscellaneous

Acknowledgements

Thanks to Robert Pröpper and Philipp Meier for answering all my python questions

You might wanna check out their SpykeViewer (https://github.com/rproepp/spykeviewer) tool for visualization of MEA recordings and NEO (http://pythonhosted.org/neo) data
Thanks to Owen Mackwood for his SNEP toolbox which provided the initial ideas for this project
Thanks to the BCCN Berlin (http://www.bccn-berlin.de), the Research Training Group GRK 1589/1, and the Neural Information Processing Group ( http://www.ni.tu-berlin.de) for support

Tests

Tests can be found in pypet/tests. Note that they involve heavy file I/O and you need privileges to write files to a temporary folder. The tests suite will make use of the tempfile.gettempdir() function to create such a temporary folder.

You can run all tests with $ python all_tests.py which can also be found under pypet/tests. You can pass additional arguments as $ python all_tests.py -k –folder=myfolder/ with -k to keep the HDF5 files created by the tests (if you want to inspect them, otherwise they will be deleted after the completed tests), and –folder= to specify a folder where to store the HDF5 files instead of the temporary one. If the folder cannot be created the program defaults to tempfile.gettempdir().

Running all tests can take up to 15 minutes. The test suite encompasses more than 300 tests (including the BRIAN based tests) and has a code coverage of more than 90%!

License

BSD, please read LICENSE file.

Legal Notice

pypet was created by Robert Meyer at the Neural Information Processing Group (TU Berlin), supported by the Research Training Group GRK 1589/1.

Contact

robert.meyer (at) ni.tu-berlin.de

Marchstr. 23

MAR 5.046

D-10587 Berlin

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 4 - Beta
Intended Audience
- Science/Research
License
- OSI Approved :: BSD License
Natural Language
- English
Operating System
- OS Independent
Programming Language
Topic
- Scientific/Engineering
- Utilities

Release history Release notifications | RSS feed

0.6.1

Feb 14, 2023

0.6.0

Sep 11, 2021

0.5.2

May 25, 2021

0.5.1

Jun 2, 2020

0.5.0

Jun 2, 2020

0.4.3

Jun 24, 2018

0.4.2

Dec 3, 2017

0.4.1

Oct 23, 2017

0.4.0

Dec 26, 2016

0.3.0

Apr 25, 2016

0.2.0

Apr 8, 2016

0.2b1 pre-release

Mar 11, 2016

0.2b0 pre-release

Mar 11, 2016

0.1b12.post1 pre-release

Mar 11, 2016

0.1b.12 pre-release

Nov 24, 2015

0.1b.11 pre-release

May 25, 2015

0.1b.10 pre-release

Apr 16, 2015

0.1b.9 pre-release

Feb 10, 2015

0.1b.8 pre-release

Aug 11, 2014

0.1b.7 pre-release

Jul 17, 2014

0.1b.6 pre-release

Jun 3, 2014

0.1b.5 pre-release

Apr 16, 2014

This version

0.1b.4 pre-release

Mar 24, 2014

0.1b.3 pre-release

Jan 4, 2014

0.1b.2 pre-release

Nov 19, 2013

0.1b.1 pre-release

Oct 22, 2013

0.1b.0 pre-release

Oct 15, 2013

0.1a.6 pre-release

Sep 26, 2013

0.1a.5 pre-release

Sep 26, 2013

0.1a.4 pre-release

Sep 25, 2013

0.1a.3 pre-release

Sep 25, 2013

0.1a.2 pre-release

Sep 25, 2013

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pypet-0.1b.4.tar.gz (178.1 kB view hashes)

Uploaded Mar 24, 2014 Source

Hashes for pypet-0.1b.4.tar.gz

Hashes for pypet-0.1b.4.tar.gz
Algorithm	Hash digest
SHA256	`a2bf7476cd56efa9957b1790e5bd63c2b81d2fe79e99b5d9bc09321561238622`
MD5	`1e3cc32a5e72648b164e410af0724ad6`
BLAKE2b-256	`6dc0b6175bcdc99e960ee0a81e68c639741ceec37e24de96893cbdbd902dd7a3`