Skip to main content

Analysis of nanopore sequencing data.

Project description

Summary

This package provides tools for the analysis of raw nanopore sequencing data, including correction of basecalls and visualization.

Full Documentation

Full documentation avaialable at Read the Docs: https://nanoraw.readthedocs.io

Installation

Install nanoraw via pip

pip install nanoraw

Install bleeding edge via github

pip install git+https://github.com/marcus1487/nanoraw.git

Usage

nanoraw -h
nanoraw [command] [options]

Main Comand (Must be run before any other commands):

  • genome_resquiggle: Re-annotate raw signal with genomic aignement of existing basecalls.

Genome Anchored Plotting Commands:

  • plot_max_coverage: Plot signal in regions with the maximum coverage.

  • plot_genome_location: Plot signal at defined genomic locations.

  • plot_motif_centered: Plot signal at regions centered on a specific motif.

  • plot_max_difference: Plot signal where signal differs the most between two groups.

  • plot_most_significant: Plot signal where signal differs the most significantly between two groups.

  • plot_motif_with_stats: Plot signal from several regions and test statistics centered on a k-mer of interst.

Sequencing Time Anchored Plotting Command:

  • plot_correction: Plot segmentation before and after correction.

  • plot_multi_correction: Plot multiple raw signals anchored by genomic location.

Other Plotting Command:

  • plot_kmer: Plot signal quantiles acorss kmers.

  • cluster_most_significant: Clustering traces at bases with significant differences.

Auxiliary Command:

  • write_most_significant_fasta: Write sequence where signal differs the most significantly between two groups.

  • write_wiggles : Write wiggle files for various attribute of reads.

    Get additional help for subcommands with nanoraw [command] -h

Requirements

python Requirements:

  • numpy

  • scipy

  • h5py

Optional plotting packages (install R packages with install.packages([package_name]) from an R prompt):

  • rpy2 (python package; with R installation)

  • changepoint (for using R’s changepoint package for re-segmentation)

  • ggplot2 (required for all plotting subcommands)

  • cowplot (required for plot_motif_with_stats subcommand)

Citation

Stoiber, M.H. et al. De novo Identification of DNA Modifications Enabled by Genome-Guided Nanopore Signal Processing. bioRxiv (2016).

http://biorxiv.org/content/early/2016/12/15/094672

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nanoraw-0.4.1.tar.gz (45.6 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page