Python Package for running custom protein inference algorithms on tab-formatted tandem MS/MS search results.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Py Protein Inference

Requirements

Python 3.7 or greater. This package was created using Python 3.7
Python Packages: numpy, pyteomics, pulp, PyYAML, matplotlib. These should be installed automatically during installation.

Quick Start Guide

Install the package using pip:

pip install pyproteininference

Run the Heuristic method with tab delimited results directly from percolator to generate results for peptide centric, parsimony, inclusion, and exclusion:

protein_inference_heuristic_cli.py \
-t /path/to/target/file1.txt \
-d /path/to/decoy/file1.txt \
-db /path/to/database/file.fasta

Run the standard commandline tool with tab delimited results directly from percolator to run a particular inference method. By default, peptide centric inference is selected if a parameter file is not specified:

protein_inference_cli.py \
-t /path/to/target/file.txt \
-d /path/to/decoy/file.txt \
-db /path/to/database/file.fasta

Specifying Parameters. The two most common parameters to change are the inference type, and the decoy symbol (for identifying decoy proteins vs target proteins). The parameters can be quickly altered by creating a file called params.yaml as follows:

parameters:
  inference:
    inference_type: parsimony
  identifiers:
    decoy_symbol: "decoy_"

The inference type can be one of: parsimony, peptide_centric, inclusion, exclusion, or first_protein. All parameters are optional, so you only need to define the ones you want to alter. Parameters that are not defined are set to default values. See the package documentation for the default parameters.

Run the standard commandline tool again, this time specifying the parameters as above:

protein_inference_cli.py \
-t /path/to/target/file.txt \
-d /path/to/decoy/file.txt \
-db /path/to/database/file.fasta \
-y /path/to/params.yaml

Running with docker
- Either Pull the image from docker hub:
  - docker pull hinklet/pyproteininference:latest
- Or Build the image with the following command (After having cloned the repository):
  - git clone REPOSITORY_URL
  - cd pyproteininference
  - docker build -t pyproteininference:latest .
- Run the tool, making sure to volume mount in the directory with your input data and parameters. In the case below, that local directory would be /path/to/local/directory and the path in the container is /data
```
  	docker run -v /path/to/local/directory/:/data \
  	-it hinklet/pyproteininference:latest \
  	python /usr/local/bin/protein_inference_cli.py \
  	-f /data/input_file.txt \
  	-db /data/database.fasta \
  	-y /data/parameters.yaml \
  	-o /data/
```

Documentation

For more information please see the full package documentation (https://thinkle12.github.io/pyproteininference/).

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

1.0.0

Jun 21, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyproteininference-1.0.0.tar.gz (638.8 kB view hashes)

Uploaded Jun 21, 2022 Source

Built Distribution

pyproteininference-1.0.0-py3-none-any.whl (67.4 kB view hashes)

Uploaded Jun 21, 2022 Python 3

Hashes for pyproteininference-1.0.0.tar.gz

Hashes for pyproteininference-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`055b3347742e70d0b4a04519de9cdab8b265558f9ffc00c68a0e10cf68cd0064`
MD5	`c0f19843f3e245277b3019408167a3bb`
BLAKE2b-256	`6d11523ca0a68ad06b847f6fb66c42a439ecd831258656c04e8dc06700ae55dc`

Hashes for pyproteininference-1.0.0-py3-none-any.whl

Hashes for pyproteininference-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`680c5198d4dae3df67d1991c16034c2d3ab69d8a60c0a4e61928410fa9de7baa`
MD5	`391f4a086d45bd090b249d85d012cd86`
BLAKE2b-256	`19015d899bf68f0b29403e1253ca3e3cec74f883e8c2be705dd568c9986d5d60`