Project description

SNoRe: Scalable Unsupervised Learning of Symbolic Node Representations

This repository contains the implementation of SNoRe algorithm from SNoRe paper found here:

@misc{meznar2020snore,
    title={SNoRe: Scalable Unsupervised Learning of Symbolic Node Representations},
    author={Sebastian Me\v{z}nar and Nada Lavra\v{c} and Bla\v{z} \v{S}krlj},
    year={2020},
    eprint={2009.04535},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

An overview of the algorithm is presented in the image below:

algorithm overview

Installing SNoRe

python setup.py install

pip install snore-embedding

Using SNoRe

A simple use-case is shown below. First, we import the necessary libraries and load the dataset and its labels.

from snore import SNoRe
from scipy.io import loadmat
from sklearn.utils import shuffle
from catboost import CatBoost
import pandas as pd
from sklearn.metrics import f1_score
import numpy as np

# Load adjacency matrix and labels
dataset = loadmat("../data/cora.mat")
network_adj = dataset["network"]
labels = dataset["group"]

We then create the SNoRe model and embed the network. In code, the default parameters are shown.

# Create the model
model = SNoRe(dimension=256, num_walks=1024, max_walk_length=5,
              inclusion=0.005, fixed_dimension=False, metric="cosine",
              num_bins=256)

# Embed the network
embedding = model.embed(network_adj)

Finally, we train the classifier and test on the remaining data.

# Train the classifier
nodes = shuffle([i for i in range(network_adj.shape[0])])
train_mask = nodes[:int(network_adj.shape[0]*0.8)]
test_mask = nodes[int(network_adj.shape[0]*0.8):]
classifier = CatBoost(params={'loss_function': 'MultiRMSE', 'iterations': 500})
df = pd.DataFrame.sparse.from_spmatrix(embedding)
classifier.fit(df.iloc[train_mask], labels[train_mask])

# Test prediction
predictions = classifier.predict(df.iloc[test_mask])
print("Micro score:",
      f1_score(np.argmax(labels[test_mask], axis=1),
               np.argmax(predictions, axis=1),
               average='micro'))

Further examples of evaluation and embedding explainability can be found in the example folder.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.3.3

Nov 17, 2021

This version

0.3.2

Nov 5, 2020

0.3.1

Nov 4, 2020

0.2.1

Sep 9, 2020

0.1

Sep 8, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

snore_embedding-0.3.2-py3-none-any.whl (22.2 kB view hashes)

Uploaded Nov 5, 2020 Python 3

Hashes for snore_embedding-0.3.2-py3-none-any.whl

Hashes for snore_embedding-0.3.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`42bfe9ff7a80add37f19e3440d3356961edebd0dc5d7780a15ac0cab6899273a`
MD5	`645a2e24a667d9a0019251960728245a`
BLAKE2b-256	`81d52ce917a6f784d5ec7b4b7520299372d775f77e5f1b12249cc39556f98054`