Skip to main content

Jaccard metric calculations for boolean vectors

Project description

boolean_jaccard

MIT License PyPI Version Python Versions CI/CD codecov Documentation Status Project Status: Active Codestyle: Black Imports: isort

Calculate Jaccard metrics for boolean values.

The source code lives on github.

The documentation lives at ReadTheDocs.

The project can be installed from PyPI.

Abstract

The code here represents a python implementation of the Jaccard package hosted here by N. Chung. Its citation follows.

Citation

Chung, N., Miasojedow, B., Startek, M., and Gambin, A. "Jaccard/Tanimoto similarity test and estimation methods for biological presence-absence data" BMC Bioinformatics (2019) 20(Suppl 15): 644. https://doi.org/10.1186/s12859-019-3118-5

Installation

It's a PyPI package, so the pocess is pretty straightforward:

pip install -U boolean_jaccard # for most recent version
pip install -U boolean_jaccard==0.0.1 # for a specific version

A list of all released versions can be found at our tags.

A Note on Version Numbers

boolean_jaccard uses strict automated semantic versioning. As such, we guarantee bugfixes in path releases, backwards compatible features in minor releases, and breaking changes in major releases. We will endeavour to avoid breaking changes where possible, but, should they occur, they will only be in major releases.

Installing from Source

Most users **will not need** these instructions.

If you need to customise the code in some manner, you'll need to install from source. To do that, either clone the repository from github, or download one of our releases. For full instructions, please see our guide on contributing.

Contributing

Open-source software is only open-source becaues of the excellent community, so we welcome any and all contributions! If you think you have found a bug, please log a report in our issues. If you think you can fix a bug, or have an idea for a new feature, please see our guide on contributing for more information on how to get started! While here, we request that you follow our code of conduct to help maintain a welcoming, respectful environment.

Future Developments

  • Fully vectorise to improve performance.

Citations

If you use boolean_jaccard in your work, please cite the following manuscripts:

  1. Chung, N., Miasojedow, B., Startek, M., and Gambin, A. "Jaccard/Tanimoto similarity test and estimation methods for biological presence-absence data" BMC Bioinformatics (2019) 20(Suppl 15): 644. https://doi.org/10.1186/s12859-019-3118-5

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

boolean_jaccard-0.1.1.tar.gz (6.5 kB view hashes)

Uploaded Source

Built Distribution

boolean_jaccard-0.1.1-py3-none-any.whl (5.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page