Skip to main content

PROtein Feature Aggregation and Variants.

Project description

Protein Feature Aggregation and Variants

Pypi Build Status Documentation Status Python: versions License

ProteoFAV is a Python module that address the challenge of cross-mapping protein structures and protein sequences, allowing for protein structures to be annotated with sequence features. It implements methods for working with protein structures (via mmCIF, PDB, PDB Validation, DSSP and SIFTS files), sequence Features (via UniProt GFF annotations) and genetic variants (via UniProt/EBI Proteins API and Ensembl REST API). Cross-mapping of structure and sequence is performed with the aid of SIFTS.

ProteFAV relies heavily in the Pandas library to quickly load data into DataFrames for fast data exploration and analysis. Structure and sequence data are parsed/fetched onto Pandas DataFrames that are then merged-together (collapsed) onto a single DataFrame.

Getting Started

Dependencies

ProteoFAV was developed to support Python 3.5+ and Pandas 0.20+.

Check requirements for specific requirements.

Installation

To install the stable release, run this command in your terminal:

$ pip install proteofav

If you don’t have pip installed, this Python installation guide can guide you through the process.

Installing from source in a virtual environment

Getting ProteoFAV:

$ wget https://github.com/bartongroup/ProteoFAV/archive/master.zip -O ProteoFAV.zip
$ unzip ProteoFAV.zip

# alternatively, cloning the git repository
$ git clone https://github.com/bartongroup/ProteoFAV.git

Installing With Conda:

$ conda-env create -n proteofav -f path/to/ProteoFAV/requirements.txt
$ source activate proteofav
$ cd path/to/ProteoFAV
$ pip install .

Installing with Virtualenv:

$ virtualenv --python `which python` env
$ source env/bin/activate
$ pip install -r requirements.txt
$ python path/to/ProteoFAV/setup.py install

Testing the installation

Test dependencies should be resolved with:

$ python path/to/ProteoFAV/setup.py develop --user

Run the Tests with:

$ python path/to/ProteoFAV/setup.py test
# or
$ cd path/to/ProteoFAV/tests
$ python -m unittest discover

ProteoFAV Configuration

ProteoFAV uses a configuration file config.ini where the user can specify the directory paths, as well as urls for commonly used data sources.

After installing run:

$ proteofav-setup

Example Usage

Example usage is currently provided as a Jupyter Notebook, which can be viewed with the GitHub’s file viewer or with the Jupyter nbviewer.

You can download the Jupyter notebook from GitHub and test it with your ProteoFAV’s installation.

Contributing and Bug tracking

Feel free to fork, clone, share and distribute. If you find any bugs or issues please log them in the issue tracker.

Before you submit your Pull-requests read the Contributing Guide.

Credits

See the Credits

Changelog

See the Changelog

Licensing

The MIT License (MIT). See license for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

proteofav-0.2.3.tar.gz (89.5 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page