phase genomic variants using DNA sequencing reads
Project description
WhatsHap
WhatsHap is a software for phasing genomic variants using DNA sequencing reads, also called haplotype assembly. It is especially suitable for long reads, but works also well with short reads.
If you use WhatsHap, please cite:
Murray Patterson, Tobias Marschall, Nadia Pisanti, Leo van Iersel, Leen Stougie, Gunnar W. Klau, Alexander Schönhuth. WhatsHap: Weighted Haplotype Assembly for Future-Generation Sequencing Reads. Journal of Computational Biology, 22(6), pp. 498-509, 2015. (Get a self-archived version)
The version of WhatsHap you find here is the result of further development focused on making the software easy and straightforward to use. WhatsHap is now Open Source software under the MIT license and we welcome contributions.
Pedigree phasing
WhatsHap is capable of using pedigree information about individuals to further improve phasing results, and to drastically reduce the needed coverage. A preprint is available on bioRxiv:
Read-Based Phasing of Related Individuals. Shilpa Garg, Marcel Martin, Tobias Marschall. doi:10.1101/037101
Parallel Version: pWhatsHap
A parallelization of the core dynamic programming algorithm has been described in
M. Aldinucci, A. Bracciali, T. Marschall, M. Patterson, N. Pisanti, M. Torquati. High-Performance Haplotype Assembly. Proceedings of the 11th International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB), 245-258, 2015.
The current implementation can be found in branch parallel.
Documentation
Read the documentation online. Offline documentation is available in the doc/ subdirectory in the repository and in the downloaded tar distribution.
Mailing list
We run a public mailing list. Please don’t hesitate to post questions and comments.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.