Skip to main content

SMAP is a software package that analyzes next-generation DNA sequencing read mapping distributions and performs haplotype calling to create multi-allelic molecular markers.

Project description

SMAP - Stack Mapping Anchor Points

pipepeline status badge coverage report

  • SMAP delineate analyses read mapping distributions for GBS read mapping QC, defines read mapping polymorphisms within loci and across samples, and selects high quality loci across the sample set for downstream analyses.
  • SMAP sliding-frames defines loci covering SNPs and/or structural variants to run SMAP haplotype-sites.
  • SMAP compare identifies the overlap between two sets of loci (e.g. common loci across two runs of SMAP delineate).
  • SMAP haplotype-sites performs read-backed haplotyping using a priori known polymorphic SNP sites, and creates "ShortHaps". As a special case, SMAP haplotype-sites also captures GBS read mapping polymorphisms (here called "SMAPs") as a novel genetic diversity marker type, and integrates those with SNPs for ShortHap haplotyping.
  • SMAP target-selection creates input files for SMAP design.
  • SMAP design creates highly multiplex amplicon sequencing (HiPlex) primers and/or gRNA panels for genotyping CRISPR/Cas-induced or natural variation in a genepool.
  • SMAP haplotype-window works independent of prior knowledge of polymorphisms, groups reads by locus, defines a window enclosed between two custom border sequences, and retains the entire corresponding DNA sequence as haplotype.
  • SMAP effect-prediction is designed to provide biological interpretation of the haplotype call tables created by SMAP haplotype-window.
  • SMAP grm creates a similarity/distance matrix by converting a SMAP haplotype-site genotype call table based on GBS or amplicon sequencing data.

Documentation

An extensive manual of the SMAP package can be found on Read the Docs including detailed explanations and illustrations.

Citation

If you use SMAP, please cite "Schaumont et al., (2022). Stack Mapping Anchor Points (SMAP): a versatile suite of tools for read-backed haplotyping. https://doi.org/10.1101/2022.03.10.483555". Source code is available online at https://gitlab.ilvo.be/genomics/smap-package/smap.

License

SMAP is available under the GNU Affero General Public License v3.0 (GNU AGPLv3.

Building and installing

SMAP is being developed and tested on Linux. Additionally, some dependencies are only developed on Linux.

https://ngs-smap.readthedocs.io/en/latest/quickstart/quickstart.html#smapinstallationquickstart describes the installation guidelines.

Contributions

  • The Ghent University 2019 and 2021 Computational Biology class under supervision of prof. Dr. Peter Dawyndt and Felix Van der Jeugt has made contributions to reduce memory usage and to speed up haplotype calculations.

Links

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ngs-smap-5.0.1.tar.gz (14.0 MB view hashes)

Uploaded Source

Built Distribution

ngs_smap-5.0.1-py3-none-any.whl (14.1 MB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page