Skip to main content

Extensible PROV-N visualizer and querier

Project description

Extensible PROV-N

This tool intends to provide a way to visualize and query PROV-N with custom extensions. It was extracted from the Versioned-PROV repository.

Installation

This tool supports Python 3.6 or greater. To install, please run:

$ pip install extensible_prov

Basic visualization usage

This project installs three CLI commands to convert PROV-N to Graphviz Dot files:

  • provn: converts Plain PROV-N to dot. It generates a graph similar to ProvToolbox
  • prov-dictionary: converts PROV-Dictionary files to dot
  • versioned-prov: converts Versioned-PROV files to dot

These three commands have the same interface. To convert a PROV-N file to dot, just run:

$ provn --infile input.prov --outfile output.dot

These commands also support input redirection:

echo "entity(e1)" | provn | dot -Tpng -o pipe.png

Resulting pipe.png

For other options, see the the help:

$ versioned-prov -h
usage: versioned-prov [-h] [-i INFILE] [-o OUTFILE] [-x WIDTH] [-y HEIGHT]
                      [-r RANKDIR] [-s STYLE]

Convert PROV-N to GraphViz Dot

optional arguments:
  -h, --help            show this help message and exit
  -i INFILE, --infile INFILE
                        Input PROV-N file
  -o OUTFILE, --outfile OUTFILE
                        Output dot file
  -x WIDTH, --width WIDTH
                        Graph width
  -y HEIGHT, --height HEIGHT
                        Graph height
  -r RANKDIR, --rankdir RANKDIR
                        Graph rankdir
  -s STYLE, --style STYLE
                        Graph style

Styling

These commands support custom styles. It has four main built-in styles:

  • default: Default style that highlights nodes and edges specific to Prov-Dictionary and Versioned-PROV
  • nohighlight: Style that works for plain PROV, PROV-Dictionary, and Versioned-PROV, but does not highlight anything
  • blackwhite: Black and white style based on nohighlight
  • provtoolbox: Style based on ProvToolbox. Labels are smaller than nohighlight style.

Usage:

echo "entity(e1)" | provn -s blackwhite | dot -Tpng -o blackwhite.png

Resulting blackwhite.png

Defining new Styles

Styles are defined as modules with an EXPORT variable with a Style class. This class uses the style dictionary attribute to define individual styles for every element type.

In this dictionary, the key indicate which PROV-N statements or elements are affect by the style. Using a * at the end of the key overrides the other styles. It supports the following keys:

  • label: applies to general labels
  • {statement}_label: applies to statement labels
  • {statement}{part}_label: applies to part of statement labels
  • arrow: applies to general arrows
  • node: applies to general nodes
  • {statement}: applies to statement arrows or nodes
  • {statement}{part}: applies to part of statement arrows
  • point: applies to general points
  • {statement}_point: applies to statement points
  • {statement}{part}_point: applies to part of statement points
  • attrs: applies to general optional attributes
  • {statement}_attrs: applies to statement optional attributes
  • attrs_arrow: applies to general optional attributes arrows
  • {statement}_attrs_arrow: applies to statement optional attributes arrows
  • after: applies to statements after other styles
  • arrow_after: applies to arrows after other styles
  • point_after: applies to points after other styles
  • node_after: applies to nodes after other styles
  • attrs_after: applies to optional attributes after other styles

In this dictionary, the value can be either a dictionary with dot attributes, or a function that receives the statement and the attributes as arguments and returns a dictionary with dot attributes.

The following module code presents a style example that outputs the opposite colors of the blackwhite style:

from extensible_provn.view.style.nohighlight import NoHighlightStyle

class WhiteBlackStyle(NoHighlightStyle):

    def __init__(self):
        super(WhiteBlackStyle, self).__init__()
        self.style = self.join(self.style, {
            "entity": {"fillcolor": "#000000", "fontcolor": "#FFFFFF", "style": "filled"},
            "activity": {
                "fillcolor": "#000000", "fontcolor": "#FFFFFF",
                "shape": "polygon", "sides": "4", "style": "filled"
            },
            "agent": {"fillcolor": "#000000", "fontcolor": "#FFFFFF", "shape": "house", "style": "filled"},
            "value": {"fillcolor": "#000000", "fontcolor": "#FFFFFF", "style": "filled"},
            "version": {"fillcolor": "#000000", "fontcolor": "#FFFFFF", "style": "filled"},
        })


EXPORT = WhiteBlackStyle

Usage:

echo "entity(e1)" | provn -s opposite_bw | dot -Tpng -o opposite_bw.png

Resulting opposite_bw.png

Other styles that can be used as reference are available at: https://github.com/JoaoFelipe/extensible_provn/tree/master/extensible_provn/view/style

Extending PROV-N with views

This project supports extending PROV-N by adding new statements or overriding existing ones to add more options. It can be performed by using the decorator view.dot.graph.prov as following:

"""
PROV-N with entlist(lid; e1, e2, ..., en) statement that is equivalent to:
entity(lid)
entity(e1)
hadMember(lid, e1)
entity(e2)
hadMember(lid, e2)
...
entity(en)
hadMember(lid, en)
"""

from extensible_provn.view import provn  # Use Plain PROV as base
from extensible_provn.view.dot import graph

@graph.prov("entlist")
def entlist(dot, *args, attrs=None, id_=None):
    lines = [dot.node(attrs, "entity", id_)]
    for entity_id in args:
        lines.append(dot.node(attrs, "entity", entity_id))
        lines.append(dot.arrow2(attrs, "hadMember", id_, entity_id))
    return "\n".join(lines)

if __name__ == "__main__":
    graph.main()

Note: This code is valid for Python 3. Some changes are required for Python 2.

The graph.main() adds the default CLI to this extension.

Usage:

echo "entlist(lid; e1, e2, e3)" | python prov_list.py | dot -Tpng -o prov_list.png

Resulting prov_list.png

Jupyter Integration

Besides the CLI for converting PROV-N files to GraphViz files, this project also provides a Cell Magic to visualize the Provenance in Jupyter Noteboks. To do so, you must load the IPython extension and import the desirable PROV-N extension as follows:

%load_ext extensible_provn.prov_magics
import extensible_provn.view.provn  # Use Plain PROV as base

And then, use the %%provn cell magic in a Cell that you want to display the Graph.

Jupyter Notebook Example

Note that it requires GraphViz to be installed. The option -e pdf may also benefit from an inkscape installation. However it uses GraphViz if inkscape is not installed.

Querying

Besides visualizing provenance, we also support querying PROV-N files by pattern matching. The built-in queries are rather simple and incomplete, but extending them is similar to creating a visualization extension.

"""Defines a pattern matching query for entlist"""
from extensible_provn.query.provn import *

@querier.prov("entlist", ["id", "elements", "text"])
def entlist(querier, eid, *args, attrs={}, id_=None):
    return [
        id_, args,
        querier.text("entlist", list(args), attrs, id_)
    ]

This code does not need to be in a separate python file. It can be in a notebook.

Then, to execute the query, you must load it into the querier:

querier.load_text("""
entity(lid1)
entlist(lid1; e1, e2, e3)
entlist(lid2; e4, e5)
""")

And execute the pattern matching queries:

Eid, Elements, Text = var("Eid Elements Text")
_ = BLANK
for __ in entity(Eid, _) & entlist(Eid, Elements, Text):
    print(Eid.bound, Elements.bound, Text.bound)

This query gets all entities that share the entity id with entlist, and prints their id, a tuple with the entlist elements, and the entlist text.

Output:

lid1 ('e2', 'e3') entlist(lid1; e2, e3)

Note in the query that we define variables (var) for elements that we want to extract and join across multiple rules. Additionally, we use BLANK for elements we want to ignore.

Note also that the query function signature is based on the second argument of the @querier.prov decorator. Thus, the entlist query accepts only 3 parameters, while the entlist statement may accept many more.

Contributing

Feel free to contribute with this project and submit pull requests, bug reports, or questions as issues.

To install the project for development, please run on the root of the repository.

$ pip install -e .

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

extensible_provn-0.2.1.tar.gz (30.3 kB view hashes)

Uploaded Source

Built Distribution

extensible_provn-0.2.1-py2.py3-none-any.whl (37.7 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page