cobrababel

CobraBabel: COBRA Model Translator

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
Environment
- Console
Intended Audience
- Science/Research
License
- OSI Approved :: BSD License
Operating System
- OS Independent
Programming Language
Topic
- Scientific/Engineering
- Scientific/Engineering :: Bio-Informatics

Project description

Metabolic models are available from many sources and different source systems use different ID namespaces. That makes combining and comparing models from different sources difficult. CobraBabel supports the following source systems:

Virtual Metabolic Human (VMH)
Biochemical, Genetic and Genomic knowledge base (BiGG)
MetaNetX
Kyoto Encyclopedia of Genes and Genomes (KEGG)

Features include:

Create a universal model from VMH, BiGG, MetaNetX
Get a list of available models from BiGG
Create an organism model from VMH, BiGG
Get a specific reaction or metabolite from BiGG, KEGG
Get a specific enzyme, DNA sequence, or amino acid sequence from KEGG

Installation

Use pip to install CobraBabel from PyPI (we recommend doing this inside a virtual environment):

pip install cobrababel

Source systems

CobraBabel uses the data access methods provided by the source systems. The current version of the source system supported by CobraBabel is shown below. If a source system changes its interface, it is possible for CobraBabel to return an error or incorrect data. Additional details on the source systems are in the notes below.

CobraBabel uses web services provided by other organizations which can be offline, the interface can change, or the URL can change. CobraBabel uses these default URLs:

VMH download service at https://webdav-r3lab.uni.lu/public/msp
BiGG web service at http://bigg.ucsd.edu/api/v2
MetaNetX download service at http://www.metanetx.org/cgi-bin/mnxget/mnxref
KEGG web service at http://rest.kegg.jp

VMH notes

CobraBabel uses the most recent versions of Recon2 and AGORA.

VMH provides a SEED2VMH_translation.csv file which is not the same as the translation table in Supplementary Table 15 in the AGORA paper. Supplementary Table 15 is provided as an Excel workbook which was parsed into two files: (1) vmh_metabolite_xref.tsv and (2) vmh_reaction_xref.tsv which are provided in the “data” folder.

BiGG Notes

CobraBabel uses BiGG API version 2.

Universal metabolites can have more than one formula if different models use a different formula for a metabolite with the same ID (same situation can occur with charge). Currently, CobraBabel blindly picks the first formula in the list of formulae for a universal metabolite.

Universal metabolites have a list of all the compartments where the metabolite is used but a compartment name is not provided. Since different models may use different names for the same compartment ID, when a universal metabolite is added to a model the compartment name is not set.

There is no bulk download of universal metabolites and reactions so creating a universal model is very slow because each metabolite and reaction is downloaded separately.

MetaNetX Notes

CobraBabel uses MNXref Version 2015/09/03.

Some reactions are defined with unspecified stoichiometry coefficients, for example: (2n) MNXM1471 + 1 MNXM3341 = (2n) MNXM1 + 1 MNXM4074 + (2n) MNXM537. Currently, CobraBabel does not include these reactions when creating an universal model.

There are some reactions where there is a BIOMASS metabolite in the reaction definition. But the BIOMASS metabolite is not defined so any reactions with the BIOMASS metabolite are not included in the universal model.

If a reaction has a value in the Source field, there is only one source in the format: source:id. Sources include Rhea (rhea), KEGG (kegg), MetaCyc (metacyc), UniPathway (upa), The Seed (seed), BiGG (bigg), BioPath (biopath), and Reactome (reactome). Set the verbose parameter when calling create_metanetx_universal_model() to show a warning for reactions with an invalid format in the Source field.

There are no compartments so metabolites are defined without a compartment.

There are no names for reactions so the reaction name is set to the reaction ID.

All reactions are defined as bi-directional so the lower and upper bounds are set to the default values (-1000.0, 1000.0) for all reactions.

KEGG Notes

CobraBabel uses the current version of the KEGG API.

Note, there are some limitations on using the KEGG API, most importantly:

Restrictions: KEGG API is provided for academic use by academic users belonging to academic institutions. This service should not be used for bulk data downloads. Please obtain KEGG FTP academic subscription for downloading KEGG data.

—www.kegg.jp

When setting OTU representatives in the organism database, there can be variability in which organism is selected when there are multiple equally good matches. For example, in the KEGG database there are two Mycobacterium tuberculosis H37Rv organisms, one with ID “T00015” and one with ID “T02178”. It is random as to which one gets picked as the OTU representative.

A sample OTU representative source file is provided in the “data/otu-reps.tsv” file which was retrieved from the KBase Central Data Model using the all_entities_OTU command.

References

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
Environment
- Console
Intended Audience
- Science/Research
License
- OSI Approved :: BSD License
Operating System
- OS Independent
Programming Language
Topic
- Scientific/Engineering
- Scientific/Engineering :: Bio-Informatics

Release history Release notifications | RSS feed

0.1.3

Aug 4, 2017

0.1.2

Feb 15, 2017

This version

0.1.1

Jun 7, 2017

0.1.0

May 30, 2017

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cobrababel-0.1.1.tar.gz (29.1 kB view hashes)

Uploaded Jun 7, 2017 Source

Hashes for cobrababel-0.1.1.tar.gz

Hashes for cobrababel-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`0a5086bba7edeec0c4a948f2df699b0b23390dd70af93bd10c6cae7d91a83528`
MD5	`e1449c287a02a75f751b8ca5f6b8a8fa`
BLAKE2b-256	`890913554a3390fce886ac5d1f07450ccb9840ae5ceef2205ad8630454272171`