hybkit

Toolkit for analysis of .hyb format genomic sequence data from ribonomics experiments.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

hybkit

GitHub release (latest by date including pre-releases)

Welcome to hybkit, a toolkit for analysis of “.hyb” format genomic sequence data
generated from ribonomics techniques such as CLASH and qCLASH.
This software is available via Github, at http://www.github.com/RenneLab/hybkit .
Full project documentation is available at
hybkit’s ReadTheDocs.

This project contains multiple components:

(ToDo) The hybkit toolkit of command-line utilities for manipulating, analyzing, and plotting data contained within hyb-format files.
Analysis pipelines utilizing the toolkit for analysis of qCLASH hybrid sequence data.
The hybkit python API, an extendable documented codebase for creation of custom analyses of hyb-format data.

Hybkit Toolkit:

hybkit includes (will include) command-line utilities for the manipulation of “.hyb” format data:

Utility

Description

hyb_check.py

Read a “.hyb” file and check for errors

hyb_filter.py

Filter a “.hyb” file to a specific subset of sequences

hyb_analyze.py

Analyze and set details for hyb records, such as segtypes

hyb_type_analysis.py

Perform a type analysis on a prepared “hyb” file

hyb_mirna_count_anlaysis.py

Perform a miRNA_count analysis on a prepared “hyb” file

hyb_summary_anlaysis.py

Perform a summary analysis on a prepared “hyb” file

hyb_mirna_target_analysis.py

Perform a mirna_target analysis on a prepared “hyb” file

hyb_fold_analysis.py

Perform a fold analysis on a prepared “hyb” file

Utility	Description
hyb_check.py	Read a “.hyb” file and check for errors
hyb_filter.py	Filter a “.hyb” file to a specific subset of sequences
hyb_analyze.py	Analyze and set details for hyb records, such as segtypes
hyb_type_analysis.py	Perform a type analysis on a prepared “hyb” file
hyb_mirna_count_anlaysis.py	Perform a miRNA_count analysis on a prepared “hyb” file
hyb_summary_anlaysis.py	Perform a summary analysis on a prepared “hyb” file
hyb_mirna_target_analysis.py	Perform a mirna_target analysis on a prepared “hyb” file
hyb_fold_analysis.py	Perform a fold analysis on a prepared “hyb” file

These scripts are used on the command line with hyb-format files. For example, to filter a hyb file to contain only sequences with identifiers containing the string “KSHV”:

$ hyb_filter.py ....[command_example]

Further detail on the usage of each script is provided in the hybkit Toolkit section of hybkit’s ReadTheDocs.

Pipelines:

Hybkit provides several example pipelines for analysis of “hyb” data using the utilities provided in the toolkit. These include:

pipeline

description

Summary Analysis

Summarize the sequence and miRNA types in a hyb file

Target Analysis

Analyze targets of a set of miRNA

Grouped Target Analysis

Analyze targets of a set of miRNA with grouped replicates

Fold Analysis

Analyze fold patterns of miRNA-containing hybrids

Fold Target Region Analysis

Perform fold analysis separated by targeted mRNA region

pipeline	description
Summary Analysis	Summarize the sequence and miRNA types in a hyb file
Target Analysis	Analyze targets of a set of miRNA
Grouped Target Analysis	Analyze targets of a set of miRNA with grouped replicates
Fold Analysis	Analyze fold patterns of miRNA-containing hybrids
Fold Target Region Analysis	Perform fold analysis separated by targeted mRNA region

These pipelines provide analysis results in both tabular and graph form. As an illustration, the example summary analysis includes the return of the contained hybrid sequence types as both a csv table and as a pie chart:

CSV Output

Further detail on each provided pipeline can be found in the Example Pipelines section of hybkit’s ReadTheDocs.

Hybkit API:

Hybkit provides a Python3 module with a documented API for interacting with records in “.hyb” files. This capability was inspired by the object interactions in the BioPython Project. The primary utility is provided by objects used to represent hyb records within hyb files. These records are assigned accessible attributes, and can be analyzed using builtin functions. For example, a workflow to print the identifiers of only sequences within a “.hyb” file that contain a miRNA can be performed as such:

import hybkit
in_file = '/path/to/my_hyb_file.hyb'

# Open a hyb file as a HybFile Object:
with hybkit.HybFile.open(in_file, 'r') as hyb_file:

    # Return each line in a hyb file as a HybRecord object
    for hyb_record in hyb_file:

        # Analyze each record to assign segment types
        hyb_record.find_types()

        # If the record contains an miRNA type, print the record identifier.
        if hyb_record.has_property('segtype_contains', 'miRNA')
            print(hyb_record.id)

Further documentation on the hybkit API can be found in the hybkit API section of hybkit’s ReadTheDocs.

Hybkit is still in beta testing. Feedback and comments are welcome to ds@ufl.edu !

Installation

Hybkit requires Python 3.6+ and the use of the matplotlib package.

The recommended installation method is via python pip, which will automatically handle version control and dependency installation:

$ pip install hybkit

Acquisition of the package can also be performed by cloning the project’s Github repository:

$ git clone git:://github.com/RenneLab/hybkit

Or by downloading the zipped package:

$ curl -OL https://github.com/dstrib/hybkit/archive/master.zip
$ unzip master.zip

Followed by installation using python’s setuptools:

$ python setup.py install

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.3.3

Sep 5, 2023

0.3.0

Apr 16, 2023

0.2.0

Mar 11, 2020

0.1.10

Mar 11, 2020

This version

0.1.9

Mar 11, 2020

0.1.8

Mar 10, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hybkit-0.1.9.tar.gz (37.1 MB view hashes)

Uploaded Mar 11, 2020 Source

Built Distribution

hybkit-0.1.9-py3-none-any.whl (37.2 MB view hashes)

Uploaded Mar 11, 2020 Python 3

Hashes for hybkit-0.1.9.tar.gz

Hashes for hybkit-0.1.9.tar.gz
Algorithm	Hash digest
SHA256	`40924323847fc52fe25dc441c796dff1e60c776de5f7b0cfc0b29ed457a84730`
MD5	`95b9979e4ef7a4bc57dec0ff27da5471`
BLAKE2b-256	`3b314cafb2ff7c1cefb0e15971f33899d0a5800c8b771168053f88b3bc02662e`

Hashes for hybkit-0.1.9-py3-none-any.whl

Hashes for hybkit-0.1.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7c2fe8419182bfe0e3a7188d2b4b5dd98b6e00dc996b2f76b4661de8f533725a`
MD5	`5ecc84782d5f68f17dfd1191ba3d56c9`
BLAKE2b-256	`c0d076a465619d93c744aa20b774600e2ad92af2b26e43aa32e46a2cf470643d`