Skip to main content

UNKNOWN

Project description

PySurvey is a Python package designed to perform interactive analysis of survey data, composed of counts of occurrence of different categories in a collection of samples. Specifically, PySurvey is developed in the context of genomic surveys, such as 16S surveys, where one studies the occurrence of OTUs across samples. Though much of PySurvey’s functionality is not unique to survey data, and equivalent features are implemented in many other packages, PySurvey is intended to serve as a ‘one-stop-shop’, and thus attempts to includes all the methods that are commonly used in the analysis of genomic survey data (often by wrapping around other packages), with a sensible choice of default parameters (e.g. distance metrics, etc’).

PySurvey is based on the powerful pandas package which offers rich data structures which are tailored and optimized for interactive analysis of large data tables.

PySurvey Resources

Key Features

  • General utility:
    • Metadata support.

    • Filtering of samples/components.

    • ML and Bayesian estimation of component fractions.

  • Exploratory analysis:
    • Dimension reduction: PCoA.

    • Clustering: hierarchical, gaussian mixture models GMM.

    • Compositional correlations via SparCC.

    • Plotting: sorted heatmaps, stacked plots, …

  • Ecological theory:
    • Sample diversities (alpha diversity).

    • Rarefaction.

    • Rank abundance plots.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PySurvey-0.1.2.tar.gz (3.0 MB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page