biotite

A comprehensive library for computational molecular biology

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 4 - Beta
Intended Audience
- Developers
- Science/Research
License
- OSI Approved :: BSD License
Natural Language
- English
Operating System
Programming Language
- Python :: 3
- Python :: Implementation :: CPython
Topic
- Scientific/Engineering :: Bio-Informatics

Project description

Biotite project

Biotite is your Swiss army knife for bioinformatics. Whether you want to identify homologous sequence regions in a protein family or you would like to find disulfide bonds in a protein structure: Biotite has the right tool for you. This package bundles popular tasks in computational molecular biology into a uniform Python library. It can handle a major part of the typical workflow for sequence and biomolecular structure data:

Searching and fetching data from biological databases

Reading and writing popular sequence/structure file formats

Analyzing and editing sequence/structure data

Visualizing sequence/structure data

Interfacing external applications for further analysis

Biotite internally stores most of the data as NumPy ndarray objects, enabling

fast C-accelerated analysis,

intuitive usability through NumPy-like indexing syntax,

extensibility through direct access of the internal NumPy arrays.

As a result the user can skip writing code for basic functionality (like file parsers) and can focus on what their code makes unique - from small analysis scripts to entire bioinformatics software packages.

If you use Biotite in a scientific publication, please cite:

Kunzmann, P. & Hamacher, K. BMC Bioinformatics (2018) 19:346.

https://doi.org/10.1186/s12859-018-2367-z

Installation

Biotite requires the following packages:

numpy

requests

msgpack

networkx

Some functions require some extra packages:

mdtraj - Required for trajetory file I/O operations.

matplotlib - Required for plotting purposes.

Biotite can be installed via Conda…

$ conda install -c conda-forge biotite

… or pip

$ pip install biotite

Usage

Here is a small example that downloads two protein sequences from the NCBI Entrez database and aligns them:

import biotite.sequence.align as align
import biotite.sequence.io.fasta as fasta
import biotite.database.entrez as entrez

# Download FASTA file for the sequences of avidin and streptavidin
file_name = entrez.fetch_single_file(
    uids=["CAC34569", "ACL82594"], file_name="sequences.fasta",
    db_name="protein", ret_type="fasta"
)

# Parse the downloaded FASTA file
# and create 'ProteinSequence' objects from it
fasta_file = fasta.FastaFile.read(file_name)
avidin_seq, streptavidin_seq = fasta.get_sequences(fasta_file).values()

# Align sequences using the BLOSUM62 matrix with affine gap penalty
matrix = align.SubstitutionMatrix.std_protein_matrix()
alignments = align.align_optimal(
    avidin_seq, streptavidin_seq, matrix,
    gap_penalty=(-10, -1), terminal_penalty=False
)
print(alignments[0])

MVHATSPLLLLLLLSLALVAPGLSAR------KCSLTGKWDNDLGSNMTIGAVNSKGEFTGTYTTAV-TA
-------------------DPSKESKAQAAVAEAGITGTWYNQLGSTFIVTA-NPDGSLTGTYESAVGNA

TSNEIKESPLHGTQNTINKRTQPTFGFTVNWKFS----ESTTVFTGQCFIDRNGKEV-LKTMWLLRSSVN
ESRYVLTGRYDSTPATDGSGT--ALGWTVAWKNNYRNAHSATTWSGQYV---GGAEARINTQWLLTSGTT

DIGDDWKATRVGINIFTRLRTQKE---------------------
-AANAWKSTLVGHDTFTKVKPSAASIDAAKKAGVNNGNPLDAVQQ

More documentation, including a tutorial, an example gallery and the API reference is available at https://www.biotite-python.org/.

Contribution

Interested in improving Biotite? Have a look at the contribution guidelines. Feel free to join or community chat on Discord.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 4 - Beta
Intended Audience
- Developers
- Science/Research
License
- OSI Approved :: BSD License
Natural Language
- English
Operating System
Programming Language
- Python :: 3
- Python :: Implementation :: CPython
Topic
- Scientific/Engineering :: Bio-Informatics

Release history Release notifications | RSS feed

0.40.0

Apr 2, 2024

0.39.0

Jan 7, 2024

0.38.0

Sep 9, 2023

0.37.0

May 13, 2023

0.36.1

Feb 6, 2023

0.36.0

Feb 2, 2023

0.35.0

Oct 25, 2022

0.34.1

Aug 19, 2022

0.34.0

Jul 20, 2022

0.33.0

Apr 27, 2022

This version

0.32.0

Jan 25, 2022

0.31.0

Nov 12, 2021

0.30.0

Sep 1, 2021

0.29.0

Jul 21, 2021

0.28.0

Jun 8, 2021

0.27.0

Mar 24, 2021

0.26.0

Mar 4, 2021

0.25.0

Dec 10, 2020

0.24.0

Oct 5, 2020

0.23.0

Aug 4, 2020

0.22.0

Jun 4, 2020

0.21.0

Apr 29, 2020

0.20.1

Feb 28, 2020

0.20.0

Feb 27, 2020

0.19.2

Dec 26, 2019

0.19.1

Dec 24, 2019

0.18.0

Nov 21, 2019

0.17.0

Sep 20, 2019

0.16.0

Aug 16, 2019

0.15.1

Jul 22, 2019

0.15.0

Jul 16, 2019

0.14.0

May 29, 2019

0.13.1

Apr 1, 2019

0.13.0

Mar 19, 2019

0.12.0

Feb 14, 2019

0.11.1

Jan 17, 2019

0.11.0

Jan 14, 2019

0.10.1

Dec 3, 2018

0.10.0

Nov 28, 2018

0.9.0

Oct 31, 2018

0.8.1

Oct 10, 2018

0.8.0

Sep 24, 2018

0.7.0

Jul 12, 2018

0.6.0

May 31, 2018

0.5.3

Mar 8, 2018

0.5.2

Mar 4, 2018

0.5.1

Feb 26, 2018

0.5.0

Feb 8, 2018

0.4.1

Dec 13, 2017

0.4.0

Dec 2, 2017

0.3.3

Nov 27, 2017

0.3.2

Nov 23, 2017

0.3.1

Nov 14, 2017

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

biotite-0.32.0.tar.gz (31.8 MB view hashes)

Uploaded Jan 25, 2022 Source

Built Distributions

biotite-0.32.0-cp310-cp310-win_amd64.whl (30.9 MB view hashes)

Uploaded Jan 25, 2022 CPython 3.10 Windows x86-64

biotite-0.32.0-cp310-cp310-manylinux1_x86_64.whl (44.3 MB view hashes)

Uploaded Jan 25, 2022 CPython 3.10

biotite-0.32.0-cp310-cp310-macosx_10_9_x86_64.whl (31.5 MB view hashes)

Uploaded Jan 25, 2022 CPython 3.10 macOS 10.9+ x86-64

biotite-0.32.0-cp39-cp39-win_amd64.whl (30.9 MB view hashes)

Uploaded Jan 25, 2022 CPython 3.9 Windows x86-64

biotite-0.32.0-cp39-cp39-manylinux1_x86_64.whl (44.2 MB view hashes)

Uploaded Jan 25, 2022 CPython 3.9

biotite-0.32.0-cp39-cp39-macosx_10_9_x86_64.whl (31.5 MB view hashes)

Uploaded Jan 25, 2022 CPython 3.9 macOS 10.9+ x86-64

biotite-0.32.0-cp38-cp38-win_amd64.whl (30.9 MB view hashes)

Uploaded Jan 25, 2022 CPython 3.8 Windows x86-64

biotite-0.32.0-cp38-cp38-manylinux1_x86_64.whl (45.9 MB view hashes)

Uploaded Jan 25, 2022 CPython 3.8

biotite-0.32.0-cp38-cp38-macosx_10_9_x86_64.whl (31.6 MB view hashes)

Uploaded Jan 25, 2022 CPython 3.8 macOS 10.9+ x86-64

Hashes for biotite-0.32.0.tar.gz

Hashes for biotite-0.32.0.tar.gz
Algorithm	Hash digest
SHA256	`6dee8dca3d61193c4dda936d16d4c388934b1c1f5c981ebe343bbcf72c7332dc`
MD5	`1ebb6e8fe9374f1e327ff0d3ef21e75e`
BLAKE2b-256	`63b513e76c76ed045d1006a323ad7bd99e122688ee9930bcc66081311498922a`

Hashes for biotite-0.32.0-cp310-cp310-win_amd64.whl

Hashes for biotite-0.32.0-cp310-cp310-win_amd64.whl
Algorithm	Hash digest
SHA256	`3df13f309de5cf94b6f2c86afbc74670dc0020f826b42ce5f14115aa0d4feba5`
MD5	`92d093bff27b5590267d0be1527ef6a9`
BLAKE2b-256	`68859a14bc5d8e4675223757d65025830b11447146a57498021aee5079a5cb27`

Hashes for biotite-0.32.0-cp310-cp310-manylinux1_x86_64.whl

Hashes for biotite-0.32.0-cp310-cp310-manylinux1_x86_64.whl
Algorithm	Hash digest
SHA256	`cc0ef92a7ee2a1e5f36affacb752323adf91e254fed0cbb1e9c48966adbd8b25`
MD5	`18e64daee02ddf445fdd23cbead2943c`
BLAKE2b-256	`c68fe7805cc2dfff797ab7b28f0ba1f82d34d31213f94037d220b61ac7ef7f74`

Hashes for biotite-0.32.0-cp310-cp310-macosx_10_9_x86_64.whl

Hashes for biotite-0.32.0-cp310-cp310-macosx_10_9_x86_64.whl
Algorithm	Hash digest
SHA256	`f670d5757376138b0b1425f43d0ccc7752b896b4c43013003f45a7d44d9596d4`
MD5	`36fa4fff51ba0bab56b137f71d67362b`
BLAKE2b-256	`337055526a5fb3a683a1f255ce630a2ebd4d6ad555e3d61bb49c207e870d87b2`

Hashes for biotite-0.32.0-cp39-cp39-win_amd64.whl

Hashes for biotite-0.32.0-cp39-cp39-win_amd64.whl
Algorithm	Hash digest
SHA256	`8f01b9b24674e609b01ca084cbf06a5989e2fed698b0f134c6c8abfcaf70a86f`
MD5	`8afb497edc81bc72af01a833cb5cf7cf`
BLAKE2b-256	`7fb4b1c641f689f0188dd5d904b702e16f90dd3c9a39668909c459be37b000b6`

Hashes for biotite-0.32.0-cp39-cp39-manylinux1_x86_64.whl

Hashes for biotite-0.32.0-cp39-cp39-manylinux1_x86_64.whl
Algorithm	Hash digest
SHA256	`806609f8ec9dabd0f0067aa0734e117598cc329975f1a7596ddb159450304bb8`
MD5	`cdcac6b95685abe1fb6c8002cf106bca`
BLAKE2b-256	`0bb6bf80aed57c386dc2a61033f7bfe0cc0155fbcf7fd2cbacfde8a61029766e`

Hashes for biotite-0.32.0-cp39-cp39-macosx_10_9_x86_64.whl

Hashes for biotite-0.32.0-cp39-cp39-macosx_10_9_x86_64.whl
Algorithm	Hash digest
SHA256	`6fffc7982dd0043047c9a23123e8ceee7c478e35bdedf2481e4150cdf8f1499e`
MD5	`c2375fcaad65656d7e88ef9c2933fa8e`
BLAKE2b-256	`27cd690f18885402e7ed69bf524fcdd9be03f3e8190f4fc78ade7bd6351db68b`

Hashes for biotite-0.32.0-cp38-cp38-win_amd64.whl

Hashes for biotite-0.32.0-cp38-cp38-win_amd64.whl
Algorithm	Hash digest
SHA256	`5f982761d7f05dc5874f22067632a10cce638ca4c31a162e3f85a1d2e77e237c`
MD5	`bac2cc1fc1508aceff1ca45b2ee32bf7`
BLAKE2b-256	`28b395936c7f38015373371f9e112f55471d34426f72858e411e616092a2976e`

Hashes for biotite-0.32.0-cp38-cp38-manylinux1_x86_64.whl

Hashes for biotite-0.32.0-cp38-cp38-manylinux1_x86_64.whl
Algorithm	Hash digest
SHA256	`69fa9901b0c26a4dd49c51d48a5918fcebdc5e95d813160daa9e57aeea2b9c3e`
MD5	`6d617806f8c20425e9a6a9e9d0cb4757`
BLAKE2b-256	`f45bcea5cd633b13947a2e43ac21a9c6de7441464685807599760f2d1c24ad95`

Hashes for biotite-0.32.0-cp38-cp38-macosx_10_9_x86_64.whl

Hashes for biotite-0.32.0-cp38-cp38-macosx_10_9_x86_64.whl
Algorithm	Hash digest
SHA256	`2ac329571102d47f86091fb0d8902c795db22e77d806b3bd3a5e8d8133556f43`
MD5	`c4b4309e5a2039a5df9a490406c5dcce`
BLAKE2b-256	`6a4c5baeb14edbea9973bafb0a7e1d67b45d318981fe4a9da9d4205429d10c93`