My python library of classes and functions that help me work

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

lhc-python

This is my personal library of python classes and functions, many of them have bioinformatics applications. The library changes constantly and at a whim. If you want to use it, approach with caution. Over time however, parts appear to be settling on a stable configuration.

lhc.binf

lhc.binf.alignment

A pure Python implementation of the Smith-Waterman local alignment algorithm.

lhc.binf.digen

A C++ and pure Python implementation of sequence generation algorithm. The generated sequence will have a specified dinucleotide frequency.

lhc.binf.genomic_coordinate

An implementation of intervals and points for genomic coordinates. Useful for representing gene models.

lhc.binf.genetic_code

A class to read genetic codes and translate DNA sequences into protein sequences

lhc.binf.iupac

A class to convert protein names between the one and three letter codes and the full name.

lhc.binf.kmer

A class that calculates k-mers for a given sequence. The class behaves likea dict, but calculates new k-mers on the fly.

lhc.binf.skew

A class that calculates skews for a given sequence. The class behaves like a dict, but calculates new skews on the fly.

lhc.collections

Several collections mostly for holding intervals. If only intervals need to be held, use the IntervalTree, otherwise the MultiDimensionMap may be more appropriate.

lhc.filetools

Classes for working with files

lhc.graph

A pure Python implementation of graphs

lhc.indices

Intended to be my own code for indexing files but is still very unstable an immature

lhc.interval

A class for intervals and interval operations

lhc.io

Classes for parsing and working with several file formats

lhc.itertools

Classes for working with iterators

lhc.tools

Various classes, mostly unused and out-of-date

lhc.random

lhc.random.reservoir

An implementation of the reservoir sampling algorithm. Can also be run from the command line to sample lines from files. To sample 50 lines from a file called input_file.txt, run:

python -m lhc.random.reservoir input_file.txt 50

lhc.stats

Really old code. Probably the NIPALS and PCA algorithms are of most use.

lhc.test

Unit tests! These should be mostly up-to-date now.

lhc.tools

lhc.tools.sorter

A sorter for very large iterators. The iterator will be split into chunks which are then sorted individually and then merged into a single file.

lhc.tools.tokeniser

A basic tokeniser. Users define which characters belong to which classes and the tokeniser will split strings into substrings where all characters have the same type.

>>> tokeniser = Tokeniser({'word': 'abcdefghijklmnopqrstuvwxyz',
                       'number': '0123456789',
                       'space': ' \t'})
>>> tokens = tokeniser.tokenise('there were 1000 bottles on the wall')
>>> tokeniser.next()
Token(type='word', value='there')
>>> tokeniser.next()
Token(type='space', value=' ')
>>> tokeniser.next()
Token(type='word', value='were')
>>> tokeniser.next()
Token(type='space', value=' ')
>>> tokeniser.next()
Token(type='number', value='1000')

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

2.5.0

Dec 16, 2021

2.3.1

Aug 10, 2020

2.3.0

Mar 24, 2020

2.2.0

Mar 22, 2020

2.1.8

Mar 9, 2020

2.1.7

Feb 19, 2020

2.1.6

Feb 19, 2020

2.1.5

Feb 19, 2020

2.1.4

Feb 17, 2020

2.1.3

Feb 10, 2020

2.1.2

Feb 10, 2020

2.0.3

Jul 19, 2017

2.0.2

Jul 19, 2017

2.0.1

Jul 19, 2017

2.0.0

Jul 19, 2017

1.3.8

Dec 7, 2016

1.3.7

Dec 6, 2016

1.3.6

Oct 15, 2016

1.3.5

Oct 14, 2016

1.3.4

Oct 13, 2016

1.3.3

Oct 6, 2016

1.3.2

Sep 30, 2016

1.3.1

Sep 29, 2016

1.3.0

Sep 29, 2016

1.2.6

Sep 28, 2016

1.2.5

Sep 28, 2016

1.2.4

Sep 28, 2016

1.2.3

Sep 27, 2016

1.2.2

Sep 26, 2016

1.2.1

Sep 19, 2016

1.1.6

Jul 19, 2016

1.1.5

May 9, 2016

1.1.4

May 4, 2016

1.1.3

May 4, 2016

1.1.2

May 3, 2016

1.1.1

May 3, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lhc-python-2.5.0.tar.gz (95.7 kB view hashes)

Uploaded Dec 16, 2021 Source

Hashes for lhc-python-2.5.0.tar.gz

Hashes for lhc-python-2.5.0.tar.gz
Algorithm	Hash digest
SHA256	`d947abe90ecc5e67aa13bb42ea1a4cb584eb4f4def2a4a983e1b5ac8a78359ea`
MD5	`c168bfcbb919544809796a13f84a2ff4`
BLAKE2b-256	`4b24689dfb21592cdbb2be8bf2875ff9bd6e1415977c72c59ba9aa31a2361b1a`