Skip to main content

A hierarchical divisive clustering toolbox

Project description

PyPI PyPI - Python Version example workflow codecov Codacy Badge License: MIT

HiPart: Hierarchical divisive clustering toolbox

HiPart is a package created for the implementation of hierarchical divisive clustering algorithms. Even among this family of algorithms, its specialty is high-performance algorithms for high-dimensional big data. It is a package with similar execution principles as the scikit-learn package. It also provides two types of static visualizations for all the algorithms executed in the package, with the addition of linkage generation for the divisive hierarchical clustering structure. Finally, the package provides an interactive visualization for manipulating the PDDP-based algorithms' split-point for each of the splits those algorithms generated from the clustering process.

Installation

For the installation of the package, the only necessary actions and requirements are a version of Python higher or equal to 3.6 and the execution of the following command.

pip install HiPart

Simple Example Execution

The example bellow is the simplest form of the package's execution. Shortly, it shows the creation of synthetic clustering dataset containing 6 clusters. Afterwards it is clustered with the dePDDP algorithm and only the cluster labels are returned.

from HiPart.clustering import dePDDP
from sklearn.datasets import make_blobs

X, y = make_blobs(n_samples=1500, centers=6, random_state=0)

clustered_class = dePDDP(max_clusters_number=6).fit_predict(X)

Users can find complete execution examples for all the algorithms of the HiPart package in the clustering_example file of the repository. Also, the users can find a KernelPCA method usage example in the clustering_with_kpca_example file of the repository. Finally, the file interactive_visualization_example contains an example execution of the interactive visualization. The instructions for the interactive visualization GUI can be found with the execution of this visualization.

Documentation

The full documentation of the package can be found here.

Collaborators

Dimitris Tasoulis :email: Panagiotis Anagnostou :email: Sotiris Tasoulis :email:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

HiPart-0.1.17.tar.gz (32.7 kB view hashes)

Uploaded Source

Built Distribution

HiPart-0.1.17-py3-none-any.whl (35.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page