Skip to main content

Pipelines for genomic region file to produce bed files, and it's statistics

Project description

bedboss


Run pytests docs-badge pypi-badge Code style: black

bedboss is a command-line pipeline that standardizes and calculates statistics for genomic interval data, and enters the results into a BEDbase database. It has 3 components: 1) bedmaker (bedboss make); 2) bedqc (bedboss qc); and 3) bedstat bedboss stat. You may run all 3 pipelines separately, together (bedbase all).

1) bedmaker

Converts supported file types into BED and bigBed format. Currently supported formats:

  • bedGraph
  • bigBed
  • bigWig
  • wig

2) bedqc

Assess QC of BED files and flag potential problems for further evaluation so you can determine whether they should be included in downstream analysis. Currently, it flags BED files that are larger than 2 GB, have over 5 milliom regions, or have mean region width less than 10 bp. These thresholds can be changed with pipeline arguments.

bedstat

Calculates statistics about BED files.

Documentation

Detailed information about each pipeline can be found in the bedboss Readme.

For the specific bedbase.org instance, see instructions in the bedbase.org repo.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bedboss-0.1.0a3.tar.gz (23.6 kB view hashes)

Uploaded Source

Built Distribution

bedboss-0.1.0a3-py3-none-any.whl (24.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page