capsid

CaPSID: Computational Pathogen Sequence Identification

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Environment
- Console
Intended Audience
- Developers
- Science/Research
License
- OSI Approved :: GNU General Public License (GPL)
Operating System
- Unix
Programming Language
- Python :: 2.7
Topic
- Scientific/Engineering :: Bio-Informatics

Project description

CaPSID

CaPSID (Computational Pathogen Sequence IDentification) is a comprehensive open source platform which integrates a high-performance computational pipeline for pathogen sequence identification and characterization in human genomes and transcriptomes together with a scalable results database and a user-friendly web-based software application for managing, querying and visualizing results.

Project Leader Vincent Ferretti Development Team Ivan Borozan, Philippe Laflamme, Shane Wilson, Stuart Watt

Downloading and using CaPSID is free, if you use CaPSID or its code in your work please acknowledge CaPSID by referring to its github homepage https://github.com/capsid. This is important for us since obtaining grants is one significant way to fund planning and implementation for our project. Also if you find CaPSID useful in your research feel free to let us know.

Getting Started

You will need a MongoDB database, a Python 2.7 installation and a Java Web Server. For more details, read the wiki:

https://github.com/capsid/capsid/wiki/

License and Copyright

Licensed under the GNU General Public License, Version 3.0. See LICENSE for more details.

Acknowledgement

This project is supported by the Ontario Institute for Cancer Research (OICR) through funding provided by the government of Ontario, Canada.

News

1.6.2

Release date: 16 July 2014

enforce the use of pymongo version==2.8

1.6.1

Release date: 16 September 2014

added taxon tables for viruses
added gra (genome relative abundance) calculations
metrics are now calculated for i) reads aligning to pathogen only (excluding those aligning simultaneously to the human ref (i.e digital subtraction)) and ii) reads aligning simultaneously to both pathogen and human refs
paired reads from the human ref pair-end BAM file are considered valid only if aligned as proper pairs
changed the filtering approach from collapsing of FASTQ and filtering to filtering
fixed a bug to do with the RAM usage when reading a xeno file during subtraction

1.4.3

Release date: 2 October 2012

Fixes Error message when running subtraction and alignment is not found
More work on fixing long running queries

1.4.2

Release date: 2 September 2012

Removes MongoKit dependency

1.4.1

Release date: 4 July 2012

Fixes cursor timeout for long running statistics queries

1.4.0

Release date: 1 June 2012

Adds statistics while they are being run rather than all at the end
Saves unique id for genes as uid

1.3.0

Release date: 28 Feb 2012

Adds a new field in mapped for AS tag
Links mapped genes to NCBI page
Uses max sample coverage for project max coverage
Adds timer to wait for file collapsing to finish when intersecting
Using a path/to/file input no longer breaks intersecting

1.2.6

Release date: 6 Feb 2012

Subtraction filters out unmapped when building mapped reads

1.2

Release date: 20 Jan 2012

Utility to create FastQ files from unmapped reads
Utility to return intersection of FastQ files
Utility to return filter FastQ files
Added mapq threshold flag to subtraction
Gbloader can now load bacteria and fungal genomes
Saves genome sequences to database using GridFS
Reduced memory footprint when calculating statistics
use subprocesses instead of os.system
mapq scores filter needs to include 0 <= mapq <=3 due to multiple mapping reads

1.1

Release date: 25 Nov 2011

Adds support for pair-end reads

1.0.1

Release date: 19 Oct 2011

Added support for MongoDB authentication

1.0

Release date: 12 Oct 2011

Initial release

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Environment
- Console
Intended Audience
- Developers
- Science/Research
License
- OSI Approved :: GNU General Public License (GPL)
Operating System
- Unix
Programming Language
- Python :: 2.7
Topic
- Scientific/Engineering :: Bio-Informatics

Release history Release notifications | RSS feed

This version

1.6.2

Jul 16, 2015

1.6.1

Jul 7, 2015

1.4.3

Oct 2, 2012

1.4.2

Sep 2, 2012

1.4.1

Jul 4, 2012

1.4.0

Jun 1, 2012

1.2.7

Mar 8, 2012

1.2.6

Feb 6, 2012

1.2

Jan 20, 2012

1.1

Nov 25, 2011

1.0.1

Oct 19, 2011

1.0

Oct 12, 2011

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

capsid-1.6.2.tar.gz (28.8 kB view hashes)

Uploaded Jul 16, 2015 Source

Hashes for capsid-1.6.2.tar.gz

Hashes for capsid-1.6.2.tar.gz
Algorithm	Hash digest
SHA256	`33cfdf54ea737ad41c8902c2f6ccb0f9ac182a29414279236ae9cd45235fb41e`
MD5	`a6cdf7f7316e8b1a2534d4255c4b6ff3`
BLAKE2b-256	`bbbc94b0662d93cd6b1f9976e9c23530995731d14955c4c2c129a9a66e204850`