skip to navigation
skip to content

ucscgenome 0.1

Simple access to the reference genomes at UCSC

The package allows to easily download and use genomic data at UCSC. This is essentially a thin “caching” wrapper around the twobitreader library.


The simplest way to install the package is via easy_install or pip:

$ easy_install ucscgenome


  • twobitreader


The primary usage example is the following:

from ucscgenome import Genome
g = Genome('sacCer2')
print str(g['chrI'][0:100])

On the second line of the above example the following steps are performed:

  • The local cache directory is searched for the pre-downloaded genome data. If the data is readily available, it is opened for reading.
  • If there is not cached version of the sacCer2 genome, it is downloaded from the UCSC site to the cache directory.
  • The local data is downloaded and stored using the compact 2bit format.

You can configure the details of the procedure by providing additional options to the Genome constructor:

g = Genome('hg19', cache_dir='my_genomes', use_web=False)

which means that the genome data is to be searched for in the ./my_genomes directory and in no case should a download be attempted, or:

g = Genome('hg19', source_url_pattern='')

which means that the genomic data is to be downloaded from your own server rather than UCSD’s.

See also

File Type Py Version Uploaded on Size (md5) Source 2013-06-12 8KB