Skip to main content

find housekeeping genes in prokaryotic (meta)genomic data

Project description

hkgfinder

PyPI Wheel Language Pyver Downloads License: MIT

Find housekeeping genes in prokaryotic (meta)genomes

Introduction

hkgfinder is a fast and accurate housekeeping gene finder and classifier. Hkgfinder can run on raw sequences, genomes and metagenomes. The novel value of this method lies is in its ability to directly predict and classify gene sequences into housekeeping gene families at a high specificity and sensitivity, while being also faster than genome and metagenome annotator on genome and metagenome data.

How hkgfinder works

Installation

You will have first to install Prodigal and HMMER 3 to be able to run hkgfinder.

Install from Pip

pip install hkgfinder

Install from source

# Download hkgfinder development version
git clone https://github.com/Ebedthan/hkgfinder.git hkgfinder

# Navigate to directory
cd hkgfinder

# Install with poetry: see https://python-poetry.org
poetry install --no-dev

# Enter the Python virtual environment with
poetry shell

# Test hkgfinder is correctly installed
hkgfinder -h

If you do not want to go into the virtual environment just do:

poetry run hkgfinder -h

Test

  • Type hkgfinder -h and it should output something like:
usage: hkgfinder [options] [<FILE>]

optional arguments:
  -o [FILE]      output result to FILE [stdout]
  -g             activate genome mode [false]
  -m             activate metagenome mode [false]
  --faa FILE     output matched proteins sequences to FILE
  --fna FILE     output matched DNA sequences to FILE
  -t INT         number of threads [1]
  -q             decrease program verbosity
  -v, --version  show program's version number and exit
  -h, --help     show this help message and exit

Invoking hkgfinder

hkgfinder --faa housekeeping.faa --fna housekeeping.fna file.fa.gz
  • hkgfinder supports gzip, lzma, bz2 and zstd compressed files.

Bugs

Submit problems or requests to the Issue Tracker.

Dependencies

Mandatory

  • Prodigal
    Used for protein-coding gene prediction.
    Hyatt, D., Chen, GL., LoCascio, P.F. et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11, 119 (2010). https://doi.org/10.1186/1471-2105-11-119

  • HMMER 3
    Used for HMM profile prediction.
    Eddy SR, Accelerated Profile HMM Searches. PLOS Computational Biology 2011, 10.1371/journal.pcbi.1002195

Licence

MIT.

Author

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hkgfinder-0.1.0.tar.gz (853.2 kB view hashes)

Uploaded Source

Built Distribution

hkgfinder-0.1.0-py3-none-any.whl (868.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page