Skip to main content

Python code to identify temporal K_{2,h} motifs in networks

Project description

tgm - Temporal Graph Motifs

Python code to look for temporal motifs, namely temporal K_{2,h} motifs, in a graph or network of events.

We assume that events are time indexed and grouped under some root event which happens first via a tree structure. Most styles of threaded conversation (e.g. tweets and retweets, reddit comments, email chains) can be described in this format.

Data Format

The data consists of temporal 4- or 5-tuples:

  • (event_id, actor_id, time, root_id)
  • (event_id, actor_id, time, root_id, parent_id)

Where:

  • event_id: is a unique id for each event
  • actor_id: is the id of the actor/user generating the event
  • time: in UTC format
  • root_id: is the id of the root event that this event_id is part of
  • parent_id: is the id of the parent of this event when events with same root_id have a tree structure

Assumption:

  • event_id == root_id for the root event; in this case, parent_id is meaningless
  • all events with same root_id can not happen before the root event itself (where event_id == root_id)
  • if "parent_id" are used, the parent event can not happen before one of its child event

Temporal K_{2,h} motifs

Such motifs are parametrized by 3 values:

  • dt: the reaction time, in seconds
  • dT: the repetition time, in seconds
  • h: the number of distinct root events forming the motif

We distinguish 2 families of motifs:

(1) root-based motifs

A temporal K_{2,h} motifs given (dt, dT) occurs when:

  • an actor A submits a root event and a different actor B sumbits an event under that root event (i.e. same root_id) within dt seconds (reaction time)
  • the above scenario happens h times, for h distinct root events, all within dT seconds (repetition time)

(2) hop-based motifs

A temporal K_{2,h} motifs given (dt, dT) occurs when:

  • an actor A submits an event under a root_event (the event needs not be the root event but it can be) with some event_id, and a different actor B sumbits another event such that its parent_id == event_id, within dt seconds (reaction time)
  • the above scenario happens h times, in h distinct root events, all within dT seconds (repetition time)

Finding K_{2,h} motifs

The function K2h is called to find temporal motifs. When called first, a DataFrame with the required columns needs to be passed (see details below). If the function is called several times for the same dataset (for example, trying different dt and dT parameters), then some speed-up can be achieved as we illustrate below.

Parameters:

  • mode: 'root' for root-based motifs, or 'hop' for hop-based motifs
  • df: pandas DataFrame with the required columns:
    • event_id: unique id for each event
    • actor_id: actor/user id
    • time: in seconds, typically UTC
    • root_id: id of the root event that this event_id is part of
    • parent_id: id of the parent event, only required if mode == 'hop'
  • dt: reaction time for the motifs, in seconds
  • dT: repetition time for the motifs, in seconds
  • h: the 'h' in K2h; the number of different root_id's in the motifs (>=2)
  • bg: bipartite graph obtained from a previous call of the 'K2h' function with the same dataframe 'df'; if supplied, 'df' is not required; this will speed-up the computation
  • verbose: if set, return the tuples of root_id's for each K_{2,h} motif found (default=False)
  • return_df: if set, also output the results as a pandas DataFrame (default=False)
  • return_bg: if set, output the bipartite graph that can be re-used for faster subsequent runs of 'K2h' (default=True)

Result are pairs of actors with the number of temporal motifs they share and if 'verbose' is set, the list of tuples of root_id's they share.

Output:

  • self.graph: igraph Graph with the K2h motifs (nodes=actors, edges=K_{2,h} motifs)
  • self.df: pandas DataFrame with K2h motifs (optional, default=False)
  • self.bg: igraph multi-bipartite Graph; can be used for faster subsequent runs of 'K2h' (optional, default=True)

For the DataFrame output, we also group the actors into weakly connected components, returning the component number for each one.

For the Graph output, the edges are directed from the non-root actor to the root actor. Edge weights indicate the number of motifs. There are no loops.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tgm-0.0.3.tar.gz (6.7 kB view hashes)

Uploaded Source

Built Distribution

tgm-0.0.3-py3-none-any.whl (6.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page