Skip to main content

A data analysis package for PI-ICR Mass Spectroscopy

Project description

GMMClusteringAlgorithms

A package for implementing Gaussian Mixture Models as a data analysis tool in PI-ICR Mass Spectroscopy experiments. It was first developed in the Fall of 2020 to be used in PI-ICR experiments at Argonne National Laboratory (Lemont, IL, U.S.). At its core is a modified version of the 'mixture' module from the package scikit-learn. The modified version retains all the same components as the original version. In addition, it contains two classes with restricted fitting algorithms: a GMM fit where the phase dimension of the component means is not a parameter, and a BGM fit where the number of components is not a parameter. The rest of the gmm_clustering_algorithms package facilitates quick, intuitive use of the GMM algorithms through the use of 4 classes.

  1. DataFrame
    • This class is responsible for processing the .lmf file and phase shifts. As attributes, it holds the processed data for easy access, as well as any data cuts.
  2. GaussianMixtureModel
    • This class fits Gaussian Mixture Models to the DataFrame object. As parameters, it takes:
      1. Cartesian/Polar coordinates
      2. Number of components to use
      3. Covariance matrix type
      4. Information criterion
    • Allows for 'strict' fits, i.e. fits where the number of components is specified.
  3. BayesianGaussianModel
    • Exact same as the GaussianMixtureModel class, but uses the BayesianGaussianModel class from scikit-learn instead of the GaussianMixtureModel class.
  4. PhaseFirstGaussianModel
    • Implements a fit where the phase dimension is fit to first, followed by a GMM fit to both spatial dimensions in which the phase dimension of the component means is fixed. This type of fit was found to work especially well with data sets in which there were many species, like the 168Ho data.
    • Only works with Polar coordinates

Each model class also includes the ability to visualize results in several ways (clustering results, One-dimensional histograms, Probability density function) and the ability to copy fit results to the clipboard for pasting into an Excel spreadsheet.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

GMMClusteringAlgorithms-0.0.2-py2.py3-none-any.whl (72.7 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page