Skip to main content

A pipeline for binning metagenomic datasets from 3C data.

Project description

# metaTOR

[![PyPI version](https://badge.fury.io/py/metator.svg)](https://badge.fury.io/py/metator)
[![Read the docs](https://readthedocs.org/projects/metator/badge)](https://metator.readthedocs.io)
[![License: Artistic-2.0](https://img.shields.io/badge/License-GPL%203-0298c3.svg)](https://opensource.org/licenses/GPL-3.0)
[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/ambv/black)

Metagenomic Tridimensional Organisation-based Reassembly - A set of scripts that streamline the processing and binning of metagenomic 3C datasets.

## Installation

```sh
pip3 install metator
```

or, to use the latest version:

```sh
pip3 install -e git+https://github.com/koszullab/metator.git@master#egg=metator
```

Python 3.4 or later is required. A [standalone
version](https://github.com/koszullab/metaTOR/tree/python3-standalone) (no
installation, just download/unzip/run) is also available, as well as a [Python
2 version](https://github.com/koszullab/metaTOR/tree/python2), but keep
in mind that development will focus on the current branch.

## Usage

metator {align|partition|annotation|binning} [parameters]

A metaTOR command takes the form ```metator action --param1 arg1 --param2
arg2 #etc.```

There are four actions/steps in the metaTOR pipeline, which must be run in the following order:

* ```align``` : map paired-end reads on a preliminary assembly, then generate a network from
detected contacts between DNA chunks.
* ```partition``` : perform the Louvain community detection algorithm many times to isolate
chunks that consistently cluster together for binning purposes.
* ```annotation``` : run standard annotation software on the assembly (namely gene prediction
and database comparison) to match with the bins.
* ```binning``` : match annotations to bins, extract bin genomes and subnetworks, build bin-local
and global contact maps.

After the last step is completed there should be a set of bins, their relative
enrichments in various gene categories, and the contact map of each bin.

There are a number of other, optional, miscellaneous actions:

* ```pipeline``` : check the environment is right, then run all four of the above actioins sequentially.
This can take a while.
* ```dependencies``` : download third party dependencies that are not
available in most package managers.
* ```deploy``` : set up the environment and all dependencies for Ubuntu 14.04
and higher (run as root).
* ```version``` : display current version number.
* ```help``` : display this help message.

Please refer to the [metaTOR manual](https://github.com/koszullab/metaTOR/meta3c_manual.pdf) for detailed explanations on the parameters.

## Requirements

* Python 3 with ```numpy```, ```scipy```, ```matplotlib```, ```biopython``` and ```pysam``` libraries.
* [bowtie2](http://bowtie-bio.sourceforge.net/bowtie2/index.shtml)
* [samtools](http://www.htslib.org/)
* [hmmer](http://hmmer.org/) and some HMM databases (such as [these](http://dl.pasteur.fr/fop/LItxiFe9/hmm_databases.tgz))
* [prodigal](https://github.com/hyattpd/Prodigal)
* [louvain](https://sourceforge.net/projects/louvain/) (original
implementation)

Requirements can usually be installed with an OS's package manager. The requirements
that can not (namely ```prodigal```, ```louvain``` and HMM databases) can be
fetched with the following (The package may need to be run as a root
depending on where the package is installed):

```sh
metator dependencies
```

A dockerfile is also available if that is of interest.

## References

* [Metagenomic chromosome conformation capture (meta3C) unveils the diversity of chromosome organization in microorganisms](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4381813/), Martial Marbouty, Axel Cournac, Jean-François Flot, Hervé Marie-Nelly, Julien Mozziconacci, and Romain Koszul, eLife, 2014
* [Scaffolding bacterial genomes and probing host-virus interactions in gut microbiome by proximity ligation (chromosome capture) assay](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5315449/), Martial Marbouty, Lyam Baudry, Axel Cournac, and Romain Koszul, Science Advances, 2017

## Contact

### Authors

* lyam.baudry@pasteur.fr
* thfoutel@pasteur.fr
* martial.marbouty@pasteur.fr
* romain.koszul@pasteur.fr

### Research lab

[Spatial Regulation of Genomes](https://research.pasteur.fr/en/team/spatial-regulation-of-genomes/) (Institut Pasteur, Paris)


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

metator-0.1.3.tar.gz (3.0 MB view hashes)

Uploaded Source

Built Distribution

metator-0.1.3-py2-none-any.whl (3.0 MB view hashes)

Uploaded Python 2

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page