parsecpy

Parsec Benchmark interface tool

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Python library to interface with PARSEC 2.1 and 3.0 Benchmark, controlling execution triggers and processing the output measures times data to calculate speedups. Further, the library can generate a mathematical model of speedup of a parallel application, based on “Particles Swarm Optimization” algorithm to discover the parameters to minimize a “objective function”.

Features

Run parsec application with repetitions e multiple input sizes and output data to file
Process a group of Parsec 2.1 logs files generates from a shell direct execution of parsec
Manipulate of data resulting from logs process or execution obtained by module run script itself
Calculate the speedups and efficency of applications, if it’ possible, using the measured times of execution
provide a “PSO” algorithm to model the speedup of a parallel application

Prerequisites

Parsec 2.1 or newer
Python3 or newer
Numpy
Pandas
Matplotlib with Mplot3D Toolkit (Optional, to plot 3D surface)

Site

https://github.com/alexfurtunatoifrn/parsecpy

Installation

$ pip3 install parsecpy

Usage

Class ParsecData

>>> from parsecpy import ParsecData
>>> d = ParsecData('path_to_datafile')
>>> print(d)        # Print summary informations
>>> d.times()       # Show a Dataframe with mesures times
>>> d.speedups()    # Show a Dataframe with speedups
>>> d.plot3D(d.speedups(), title='Speedup', zlabel='speedup')   # plot a 3D Plot : speedups x number of cores x input sizes
>>> d.plot3D(d.efficiency(), title='Efficiency', zlabel='efficiency')   # plot a 3D Plot : speedups x number of cores x input sizes

Class ParsecLogsData

>>> from parsecpy import ParsecLogsData
>>> l = ParsecLogsData('path_to_folder_with_logfiles')
>>> print(l)        # Print summary informations
>>> l.times()       # Show a Dataframe with mesures times
>>> l.speedups()    # Show a Dataframe with speedups
>>> l.plot3D()      # plot a 3D Plot : speedups x number of cores x input sizes

Class Swarm

>>> from mparsecpy import Swarm
>>> parsec_date = ParsecData("my_output_parsec_file.dat")
>>> out_measure = parsec_exec.speedups()
>>> inputsizes = [(col, int(col.split('_')[1])) for col in y_measure]
>>> cores = y_measure.index
>>> overhead = False
>>> argsswarm = (out_measure, overhead, cores, inputsizes)
>>> pso = Swarm([0,0,0,0], [2.0,1.0,1.0,2.0], args=argsswarm, threads=10,
                size=100, maxiter=1000, modelpath=/root/mymodelfunc.py)
>>> model = pso.run()
>>> print(model.params)

Requirements for model python module

The python module file provided by user should has the following requirements:

Should has, at least, two function as following:

def constraint_function(p, *args):
    # your code
    # arguments:
    # p - particle object
    # args - list of position arguments passed to function:
    #   args[0] - Pandas Dataframe object of measured speedups (PasecData speedups)
    #   args[1] - boolean value (if overhead should be considerable)
    #   args[2] - list of number of cores used on args[0] measured speedups
    #   args[3] - list of number of problems sizes used on args[0] measured speedups
    # analize the feasable of particles position (searched parameters)
    # return True or False, depend of requirements
    return boolean_value

def objective_function(p, *args):
    # your code
    # calculate the function with should be minimized
    # return the calculated value
    return float_value

Run Parsec

parsecpy_runprocess [-h] -p PACKAGE
                       [-c {gcc,gcc-serial,gcc-hooks,gcc-openmp,gcc-pthreads,gcc-tbb}]
                       [-i INPUT] [-r REPITITIONS]
                       c

Script to run parsec app with repetitions and multiples inputs sizes

positional arguments:
  c                     List of cores numbers to be used. Ex: 1,2,4

optional arguments:
  -h, --help            show this help message and exit
  -p PACKAGE, --package PACKAGE
                        Package Name to run
  -c {gcc,gcc-serial,gcc-hooks,gcc-openmp,gcc-pthreads,gcc-tbb}, --compiler {gcc,gcc-serial,gcc-hooks,gcc-openmp,gcc-pthreads,gcc-tbb}
                        Compiler name to be used on run. (Default: gcc-hooks).
  -i INPUT, --input INPUT
                        Input name to be used on run. (Default: native).
                        Syntax: inputsetname[<initialnumber>:<finalnumber>].
                        Ex: native or native_1:10
  -r REPITITIONS, --repititions REPITITIONS
                        Number of repititions for a specific run. (Default: 1)

Example:
    parsecpy_runprocess -p frqmine -c gcc-hooks -r 5 -i native 1,2,4,8

Logs process

parsecpy_processlogs [-h] foldername outputfilename

Script to parse a folder with parsec log files and save measures an output
file

positional arguments:
  foldername      Foldername with parsec log files.
  outputfilename  Filename to save the measures dictionary.

optional arguments:
  -h, --help      show this help message and exit

Example:
    parsecpy_processlogs logs_folder my-logs-folder-data.dat

Create split parts

parsecpy_createinputs [-h] -p {freqmine,fluidanimate} -n NUMBEROFPARTS
                           [-t {equal,diff}] -x EXTRAARG
                           inputfilename

Script to split a parsec input file on specific parts

positional arguments:
  inputfilename         Input filename from Parsec specificated package.

optional arguments:
  -h, --help            show this help message and exit
  -p {freqmine,fluidanimate}, --package {freqmine,fluidanimate}
                        Package name to be used on split.
  -n NUMBEROFPARTS, --numberofparts NUMBEROFPARTS
                        Number of split parts
  -t {equal,diff}, --typeofsplit {equal,diff}
                        Split on equal or diferent size partes parts
  -x EXTRAARG, --extraarg EXTRAARG
                        Specific argument: Freqmine=minimum support (11000),
                        Fluidanimate=Max number of frames

Example:
    parsec_createinputs -p fluidanimate -n 10 -t diff -x 500 fluidanimate_native.tar

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.0.6

Dec 5, 2019

1.0.5

May 4, 2019

1.0.4

May 3, 2019

1.0.3

May 2, 2019

1.0.1

Mar 14, 2019

1.0.0

Jun 5, 2018

0.9.7

Apr 17, 2018

0.9.6

Apr 11, 2018

0.9.5

Apr 9, 2018

0.9.3

Mar 3, 2018

0.9.2

Mar 3, 2018

0.9.0

Jan 22, 2018

This version

0.8.4

Nov 3, 2017

0.8.3

Nov 3, 2017

0.8.2

Sep 19, 2017

0.8.1

Sep 19, 2017

0.7

Sep 6, 2017

0.6

Sep 6, 2017

0.5

Aug 23, 2017

0.4

Aug 23, 2017

0.3

Aug 22, 2017

0.2

Aug 22, 2017

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parsecpy-0.8.4.tar.gz (18.1 kB view hashes)

Uploaded Nov 3, 2017 Source

Hashes for parsecpy-0.8.4.tar.gz

Hashes for parsecpy-0.8.4.tar.gz
Algorithm	Hash digest
SHA256	`3f8bdbbc2906eb0053f568ba30ff73c13b73b34dde3ecaabbde62dde798d041d`
MD5	`e3139567427f8d61526b9a5b546a1b2a`
BLAKE2b-256	`668467808a0ffbdf80bd5d1b0902dc7459b23d9470cbc5421c0903cde8f2534a`