Skip to main content

An python client for communicating with the transmart rest api.

Project description

The tranSMART API Training given on [2015 tranSMART Foundation Annual Meeting in Amsterdam](http://transmartfoundation.org/2015-annual-meeting/)

The python client created for that project has now been recovered to work with version 2 of the tranSMART API.

This is meant to work with Python 3.x only.

#tranSMART RESTful API (v1) documentation

### Available calls for the tranSMART RESTful API

The following is a list of all HTTP requests that can be made to a tranSMART RESTful API. All URI’s are relative to your tranSMART server’s URI (eg. http://some.transmart.server/transmart).

The return message’s body will contain JSON with additional HAL format, with the exception of the request for high dimensional data (see section below).

Call | HTTP request | Description |
— | — | — |
get all studies | GET /studies | returns information on all available studies |
get single study | GET /studies/{studyId} | returns information on a single study |
get all concepts | GET /studies/{studyId}/concepts | returns information on all concepts for a study |
get single concept | GET /studies/{studyId}/concepts/{conceptPath} | returns information on one concept for a study |
get all subjects | GET /studies/{studyId}/subjects | returns information on all subjects for a study |
get single subject | GET /studies/{studyId}/subjects/{subjectId} | returns information on one subject for a study |
get subjects with concept | GET /studies/{studyId}/concepts/{conceptPath}/subjects | returns the subjects which have data for this concept |
get all observations | GET /studies/{studyId}/observations | returns all clinical observation values for a study |
get observations subj to certain criteria | GET /observations | returns all clinical observation according to some filters. These are patient_sets (list of result instance ids or single patient set id (integer); cannot be combined with patients), patients (list of patient ids or single patient id (integer); cannot be combined with patient_sets; avoid specifying a large list for performance reasons), concept_paths (list of concepts paths or single concept path (string); mandatory). Multiple values are specified in the usual fashion for url parameters. There is one extra parameter: variable_type – this specifies the type of ClinicalVariable to create (defaults to normalized_leafs_variable) |
get observations for single concept | GET /studies/{studyId}/concepts/{conceptPath}/observations | returns clinical observation values for one concept for a study |
create patient set | POST /patient_sets | body should be query definition in a subset of i2b2’s XML schema. Response is the same as for GET /patient_sets/{result_instance_id} |
get single patient set | GET /patient_sets/{resultInstanceId} | returns information a previously created patient set |
get index of highdim data for single concept| GET /studies/{studyId}/concepts/{conceptPath}/highdim | returns index with the available datatype and projections, assay constraints and data constraints for this highdim concept for a study |
get highdim data for single concept| GET /studies/{studyId}/concepts/{conceptPath}/highdim?dataType={dataType}&projection={projectionType}&assayConstraints={assayConstraints}&dataConstraints={dataConstraints} | returns highdim data of a specific dataType and projection for one concept of a study |

#### Explanation of URI variables | variable | explanation | | — | — | | {studyId} | The id of the study, as returned by the /studies call. | | {conceptPath} | A path that defines the concept within a study. This is similar to the concept path as defined within tranSMART, but without the initial part that defines the study path (and with necessary character conversion to make it compatible with URI syntax). The safest and most robust method of obtaining this value is by making use of the embedded links in the /studies/{studyId}/concepts result. | | {subjectId} | A unique subject identifier, as returned by /studies/{studyId}/subjects call. | | {dataType} | High dimensional concepts can be of several types, depending on what your tranSMART version supports. Possible data type options are contained in the highdim index returned by the /studies/{studyId}/concepts/{conceptPath}/highdim call. | | {projectionType} | High dimensional data can have values stored in a variety of projections. Possible projection options are contained in the highdim index returned by the /studies/{studyId}/concepts/{conceptPath}/highdim call. If not specified, it defaults to ‘default_real_projection’ or, if not supported ‘all_data’. It is strongly recommended that the user include a value for this parameter and do not rely on the defaults. | | {assayConstraints} | Assay constraints limit the assays included in the response. This is a urlencoded JSON object. The keys are the names of the constraints and each value is either 1) an object where the keys are the corresponding constraint’s parameters and the values are the parameters’ values or 2) an array of such objects. The parameters for the constraints are not described in the API. Look at core-api’s documentation; for the standard constraints, see the [AssayConstraint](https://github.com/transmart/transmart-core-api/blob/master/src/main/groovy/org/transmartproject/core/dataquery/highdim/assayconstraints/AssayConstraint.groovy) class. | | {dataConstraints} | Analogous to {assayConstraints}, except it limits the rows in the high dimensional result. For the standard data constraints, see [DataConstraint](https://github.com/transmart/transmart-core-api/blob/master/src/main/groovy/org/transmartproject/core/dataquery/highdim/dataconstraints/DataConstraint.groovy) |

#### HTTP exchange details Each of the above GET requests needs two header fields set:

  1. Authentication: if your tranSMART server requires OAuth authentication (see section on OAuth below), then this header needs to have a value set to Bearer {accessToken}.

  2. accept: the return message will contain a body that contains JSON with HAL format. This needs to be specifically requested by setting this field to application/hal+json. Otherwise, plain JSON will be returned, and you will miss out on all the HAL explorable goodness.

Here is an example HTTP request for the /studies call:

GET /transmart/studies HTTP/1.1 Host: some.transmart.server Authorization: Bearer 12345-abcde accept: application/hal+json

### High dimensional protobuf data To facilitate the many variants of high dimensional data formats to be returned by tranSMART, we have chosen to use Google’s protobuf solution. Whereas all other calls return a body containing JSON with additional HAL format, the request for high dimensional data returns a binary protobuf stream.

This binary stream can be parsed using the protobuf library, the implementation of which depends on the libraries available for your client application. The exact structure of the incoming data is defined in the highdim.proto file, which is available in the src/protobuf/highdim folder of the transmart-rest-api repository. This proto file must be included in your client application’s resources and loaded by your protobuf instance, for it allows the correct parsing of the messages contained in the binary stream.

There are two types of messages contained in the binary stream, which need to be parsed differently. The first message contains header information, and all subsequent messages will each contain information on one row of your dataset.

The header message primarily contains the definition of all assays in your dataset, for which the subsequent row messages will contain the values. An assay is usually equivalent to a single subject, but please see the highdim.proto file for what else is definable per assay (under message Assay). In addition to the assay information, the header message also defines the name and type for each of the returned values in the row messages (under message ColumnSpec)

Each row message contains one ColumnValue item per assay, and the order is identical to the order of the assay definitions in the header. The content of this item is defined in the highdim.proto file under message ColumnValue, and it can contain multiple values (eg. z-score and log-value), for which the name and type is defined in the header (under message ColumnSpec).

NOTE: If there is no measurement taken for a combination of assay and probe then NaN would be used as the value. This workaround was made later to avoid changing the protobuf declaration, so we could still use compressed arrays of primitive data types, but at the same time mark absence of a value.

Example code for how to parse the protobuf binary stream into something sensible, please see the [transmartRClient](https://github.com/thehyve/RInterface/blob/transmartRClient/transmartRClient/R/getHighdimData.R#L50).

### Authentication with OAuth

This plugin does not implement any authentication scheme. However, it is usually deployed together with transmartApp with configuration that secures this plugin’s resources with the [OAuth2 Spring Security provider](https://grails.org/plugin/spring-security-oauth2-provider). The best workflow to use will depend on the type of client.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transmart-0.1.1.tar.gz (13.1 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page