Python code to identify temporal K_{2,h} motifs in networks

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

tgm - Temporal Graph Motifs

Python code to look for temporal motifs, namely temporal K_{2,h} motifs, in a graph or network of events.

We assume that events are time indexed and grouped under some root event which happens first via a tree structure. Most styles of threaded conversation (e.g. tweets and retweets, reddit comments, email chains) can be described in this format.

Data Format

The data consists of temporal 4- or 5-tuples:

(event_id, actor_id, time, root_id)
(event_id, actor_id, time, root_id, parent_id)

Where:

event_id: is a unique id for each event
actor_id: is the id of the actor/user generating the event
time: in UTC format
root_id: is the id of the root event that this event_id is part of
parent_id: is the id of the parent of this event when events with same root_id have a tree structure

Assumption:

event_id == root_id for the root event; in this case, parent_id is meaningless
all events with same root_id can not happen before the root event itself (where event_id == root_id)
if "parent_id" are used, the parent event can not happen before one of its child event

Temporal K_{2,h} motifs

Such motifs are parametrized by 3 values:

dt: the reaction time, in seconds
dT: the repetition time, in seconds
h: the number of distinct root events forming the motif

We distinguish 2 families of motifs:

(1) root-based motifs

A temporal K_{2,h} motifs given (dt, dT) occurs when:

an actor A submits a root event and a different actor B sumbits an event under that root event (i.e. same root_id) within dt seconds (reaction time)
the above scenario happens h times, for h distinct root events, all within dT seconds (repetition time)

(2) hop-based motifs

A temporal K_{2,h} motifs given (dt, dT) occurs when:

an actor A submits an event under a root_event (the event needs not be the root event but it can be) with some event_id, and a different actor B sumbits another event such that its parent_id == event_id, within dt seconds (reaction time)
the above scenario happens h times, in h distinct root events, all within dT seconds (repetition time)

Finding K_{2,h} motifs

The function K2h is called to find temporal motifs. When called first, a DataFrame with the required columns needs to be passed (see details below). If the function is called several times for the same dataset (for example, trying different dt and dT parameters), then some speed-up can be achieved as we illustrate below.

Parameters:

mode: 'root' for root-based motifs, or 'hop' for hop-based motifs
df: pandas DataFrame with the required columns:
- event_id: unique id for each event
- actor_id: actor/user id
- time: in seconds, typically UTC
- root_id: id of the root event that this event_id is part of
- parent_id: id of the parent event, only required if mode == 'hop'
dt: reaction time for the motifs, in seconds
dT: repetition time for the motifs, in seconds
h: the 'h' in K2h; the number of different root_id's in the motifs (>=2)
bg: bipartite graph obtained from a previous call of the 'K2h' function with the same dataframe 'df'; if supplied, 'df' is not required; this will speed-up the computation
verbose: if set, return the tuples of root_id's for each K_{2,h} motif found (default=False)
return_df: if set, also output the results as a pandas DataFrame (default=False)
return_bg: if set, output the bipartite graph that can be re-used for faster subsequent runs of 'K2h' (default=True)

Result are pairs of actors with the number of temporal motifs they share and if 'verbose' is set, the list of tuples of root_id's they share.

Output:

self.graph: igraph Graph with the K2h motifs (nodes=actors, edges=K_{2,h} motifs)
self.df: pandas DataFrame with K2h motifs (optional, default=False)
self.bg: igraph multi-bipartite Graph; can be used for faster subsequent runs of 'K2h' (optional, default=True)

For the DataFrame output, we also group the actors into weakly connected components, returning the component number for each one.

For the Graph output, the edges are directed from the non-root actor to the root actor. Edge weights indicate the number of motifs. There are no loops.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.0.3

Dec 16, 2022

0.0.2

Oct 14, 2022

0.0.1

Oct 14, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tgm-0.0.3.tar.gz (6.7 kB view hashes)

Uploaded Dec 16, 2022 Source

Built Distribution

tgm-0.0.3-py3-none-any.whl (6.9 kB view hashes)

Uploaded Dec 16, 2022 Python 3

Hashes for tgm-0.0.3.tar.gz

Hashes for tgm-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`42b143fe68c4f4b23d7c005264cf79d79503916f08010a6d73b7cc0511b3a3cd`
MD5	`7b0d3e67222b87e3a090c034a67b6e5c`
BLAKE2b-256	`a3ee1445f8a9aee9a4517c63312b8218c5d7e152a775a3301eb08b86802fe343`

Hashes for tgm-0.0.3-py3-none-any.whl

Hashes for tgm-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a5504adde78c829f5212e4afa31bbffbfee824ba4c1d8727a7cd843c2c15fc30`
MD5	`84c3efdd257ed87f908839e148d8d5b5`
BLAKE2b-256	`617f5c131619922ba6fd3b7a3516e274bc0af704e9c198076904e453c802662b`