find housekeeping genes in prokaryotic (meta)genomic data
Project description
hkgfinder
Find housekeeping genes in prokaryotic (meta)genomes
Introduction
hkgfinder is a fast and accurate housekeeping gene finder and classifier. Hkgfinder can run on raw sequences, genomes and metagenomes. The novel value of this method lies is in its ability to directly predict and classify gene sequences into housekeeping gene families at a high specificity and sensitivity, while being also faster than genome and metagenome annotator on genome and metagenome data.
How hkgfinder works
Installation
You will have first to install Prodigal and HMMER 3 to be able to run hkgfinder.
Install from Pip
pip install hkgfinder
Install from source
# Download hkgfinder development version
git clone https://github.com/Ebedthan/hkgfinder.git hkgfinder
# Navigate to directory
cd hkgfinder
# Install with poetry: see https://python-poetry.org
poetry install --no-dev
# Enter the Python virtual environment with
poetry shell
# Test hkgfinder is correctly installed
hkgfinder -h
If you do not want to go into the virtual environment just do:
poetry run hkgfinder -h
Test
- Type
hkgfinder -h
and it should output something like:
usage: hkgfinder [options] [<FILE>]
optional arguments:
-o [FILE] output result to FILE [stdout]
-g activate genome mode [false]
-m activate metagenome mode [false]
--faa FILE output matched proteins sequences to FILE
--fna FILE output matched DNA sequences to FILE
-t INT number of threads [1]
-q decrease program verbosity
-v, --version show program's version number and exit
-h, --help show this help message and exit
Invoking hkgfinder
hkgfinder --faa housekeeping.faa --fna housekeeping.fna file.fa.gz
- hkgfinder supports gzip, lzma, bz2 and zstd compressed files.
Bugs
Submit problems or requests to the Issue Tracker.
Dependencies
Mandatory
-
Prodigal
Used for protein-coding gene prediction.
Hyatt, D., Chen, GL., LoCascio, P.F. et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11, 119 (2010). https://doi.org/10.1186/1471-2105-11-119 -
HMMER 3
Used for HMM profile prediction.
Eddy SR, Accelerated Profile HMM Searches. PLOS Computational Biology 2011, 10.1371/journal.pcbi.1002195
Licence
MIT.
Author
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for hkgfinder-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7e2bb67103f849ad268827837520ea6c70256990a135ee5fba695ccc1dfc8eb2 |
|
MD5 | 8cbee5996244edb49d30b80755fdf69c |
|
BLAKE2b-256 | 6486719761ae36ab79afa3dc69e75a5e3a512cb7e182e59aa8bb20909ef8e448 |