Python framework for interpretable protein prediction
Project description
Welcome to the AAanalysis documentation
AAanalysis (Amino Acid analysis) is a Python framework for interpretable sequence-based protein prediction, providing the following algorithms:
AAclust: k-optimized clustering wrapper framework to select redundancy-reduced sets of numerical scales (e.g., amino acid scales)
CPP: Comparative Physicochemical Profiling, a feature engineering algorithm comparing two sets of protein sequences to identify the set of most distinctive features.
dPULearn: deterministic Positive-Unlabeled (PU) Learning algorithm to enable training on unbalanced and small datasets.
Moreover, AAanalysis provides functions for loading protein benchmark datasets (load_data), amino acid scale sets (load_scales), and their in-depth two-level classification (AAontology).
If you are looking to make publication-ready plots with a view lines of code, see our Plotting Prelude.
Install
AAanalysis can be installed either from PyPi or conda-forge:
pip install -u aaanalysis
or
conda install -c conda-forge aaanalysis
Contributing
We appreciate bug reports, feature requests, or updates on documentation and code. For details, please refer to Contributing Guidelines. These include specifics about AAanalysis and also notes on Test Guided Development (TGD) using ChatGPT. For further questions or suggestions, please email stephanbreimann@gmail.com.
Citations
If you use AAanalysis in your work, please cite the respective publication as follows:
- AAclust:
[Citation details and link if available]
- AAontology:
Breimann, Kamp, Steiner, Frishman (2023), AAontology: An ontology of amino acid scales for interpretable machine learning, bioRxiv.
- CPP:
[Citation details and link if available]
- dPULearn:
[Citation details and link if available]
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for aaanalysis-0.1.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 41688637280cd0df4b8358de70608f1baf1b27a48cbd30d95e6e1f5af0848c8d |
|
MD5 | 9136be96a52012603ad1d2d12f703182 |
|
BLAKE2b-256 | 9be0c71c4ab3b8c1ddc10b7374bc43d323ae4f56c6bc92f71822925aacf2d47e |