Skip to main content

Liability Estimation in Case Control Studies

Project description

# Dependencies: #

LEAP requires Python 2.7, and is dependent on the FastLMM Python package. Please make sure these are installed prior to using LEAP. The easiest way to install LEAP is via pip, by typing the command: “pip install –user leap_gwas”. This should also install all dependencies. Typically, the LEAP scripts will be installed at ~/.local/lib/python2.7/site-packages/LEAP/

# Usage instructions: #

LEAP is invoked though a series of Python scripts, as detailed below. The script leapUtils.sh runs the full LEAP pipeline on a small example dataset, and can be used for reference.

Generally, LEAP uses the same file formats as FastLMM. Namely, input files are in binary Plink format (http://pngu.mgh.harvard.edu/~purcell/plink/data.shtml#bed). When there is a contradiction between file formats used by Plink and by FastLMM, LEAP uses the convention adopted by FastLMM. Explanations about the parameters used by all the scripts can be seen by typing “python <script_name> –help”.

The LEAP pipeline includes the following stages:

1. (optional): Find related individuals to be removed: python findRelated.py –bfilesim <Plink base file> –out <output file>

This script creates a file marking the individuals that need to be removed to eliminate relatedness

2. Compute heritability using the method of Golan and Rosset: python calc_h2.py –bfilesim <Plink base file> –extractSim <SNPs used for heritability estimation> –prev <prevalence> –pheno <phenotype file> –related <relatedness file>

This script outputs the heritability estimate

3. Estimate liabilities: python probit.py –bfilesim <Plink base file> –pheno <phenotype file> –prev <prevalence> –extractSim <SNPs used in the heritability estimation> –out <output base file> –related <relatedness file> –h2 <heritability>

This script creates a file called <output base file>.liabs, with estimated liabilities for every individual. The estimated liabilities can be used directly for GWAS by using them as a standard phenotype file.

4. Compute GWAS: python leap_gwas.py –bfilesim <Plink base file> –pheno <estimated liabilities file> –extractSim <SNPs used in the LMM kinship matrix> –out <output file> –h2 <heritability> –bfile <Plink file with tested SNPs> –extract <SNPs to test>

This script performs GWAS with a prespecified h2. The syntax largely follows that of FaSTLMM C++ version.

General comments and tips

1. Fixed effects can be included in the thresholds, liability estimation and GWAS stages. Please type “python probit.py –help” and “leap_gwas.py” for instructions

  1. As described in the main text, it is recommended to perform a different liability estimation for every excluded chromosome, and then testing the SNPs on the excluded chromosome for association with the estimated liabilities. The “-extractSim” flag is useful for this. Please see the example file “leap_pipeline.sh” for a usage example.

3. A complete end-to-end usage example is provided with the LEAP source files, and can be invoked via the script “leap_pipeline.sh”. This example estimates liabilities for a small balanced case-control dataset. The dataset was simulated with 50% heritability and 0.1% prevalence. It included 500 cases, 500 controls, 499 causal SNPs, 100 unusually differentiated SNPs and 10000 SNPs differentiated with FST=0.01. Causal SNPs are called “csnp<i>”, and unusually differentiated SNPs are called “dsnp<i>”. The original liabilities for this file are available in the file dataset1.phe.liab (but this file is not used by LEAP).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

leap_gwas-0.1.tar.gz (3.0 MB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page