Skip to main content

Multimodal Datasets in MuData format

Project description

Multimodal Datasets

mudatasets provides some public datasets with multimodal data, primarily focusing on multimodal omics datasets.

MuData library | MuData documentation

Installation

PyPi version

# Stable, with muon
pip install "mudatasets[muon]"
# Dev
pip install git+https://github.com/gtca/mudatasets

Getting started

import mudatasets as mds

Find available datasets

mds.list_datasets()

Load a dataset

mdata = mds.load("pbmc3k_multiome")
print(mdata)

Some common attributes for .load() are:

  • data_dir= for location to save the dataset (~/mudatasets/ by default)
  • with_info=True for also returning the second argument with dataset description as a dictionary (False by default)
  • backed=True for reading data in a backed format, only for .h5mu and .h5ad files (True by default)
  • files= for downloading specific files from the dataset
  • full=True for downloading all the files defined for the dataset (False by default)

Get dataset info

mds.info("pbmc3k_multiome")

List dataset file names

mds.list_files("pbmc3k_multiome")

Webpage with all the files

mds.serve_webpage(port=8000)

This command will launch a server providing a simple (temporarily created) HTML page at http://localhost:8000 with files across all of the datasets listed.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mudatasets-0.0.2.tar.gz (9.5 kB view hashes)

Uploaded Source

Built Distribution

mudatasets-0.0.2-py3-none-any.whl (12.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page