Skip to main content

Automation of computations in quantum chemistry

Project description

https://img.shields.io/github/license/SCM-NV/qmflows.svg?maxAge=2592000 https://travis-ci.org/SCM-NV/qmflows.svg?branch=master https://img.shields.io/badge/python-3.5-blue.svg https://api.codacy.com/project/badge/Grade/7e95b6d4d873458daf0de5e4b3110885 https://api.codacy.com/project/badge/Coverage/7e95b6d4d873458daf0de5e4b3110885

QMFlows

See http://scm-nv.github.io/qmflows/ for tutorials and documentation.

Motivation

Research on modern computational quantum chemistry relies on a set of computational tools to carry out calculations. The complexity of the calculations usually requires intercommunication between the aforementioned tools, such communication is usually done through shell scripts that try to automate input/output actions like: launching the computations in a cluster, reading the resulting output and feeding the relevant numerical result to another program. Such scripts are difficult to maintain and extend, requiring a significant programming expertise to work with them. Being then desirable a set of automatic and extensible tools that allows to perform complex simulations in heterogeneous hardware platforms.

This library tackles the construction and efficient execution of computational chemistry workflows. This allows computational chemists to use the emerging massively parallel compute environments in an easy manner and focus on interpretation of scientific data rather than on tedious job submission procedures and manual data processing.

Description

This library consists of a set of modules written in Python 3.5 to automate the following tasks:

  1. Input generation.

  2. Handle tasks dependencies (Noodles).

  3. Advanced molecular manipulation capabilities with (rdkit).

  4. Numerical data storage and manipulation (HDF5).

  5. Jobs failure detection and recovery.

  6. Distribution in heterogeneous hardware platforms.

Tutorial and Examples

A tutorial written as a jupyter-notebook is available from: tutorial-qmflows. You can also access direclty more advanced examples.

Installation

  • Download miniconda for python >= 3.5: miniconda (also you can install the complete anaconda version).

  • Install according to: installConda.

  • Reopen terminal (or type source ~/.bashrc).

  • Create a new virtual environment using the following commands:

    • conda create -n qmflows python=3.5

  • Activate the new virtual environment

    • source activate qmflows

To exit the virtual environment type source deactivate.

Dependencies installation

Using the conda environment the following packages should be installed:

  • install rdkit using the following command:

    • conda install -y -q --name qmflows -c https://conda.anaconda.org/rdkit rdkit

  • install HDF5 using conda:

    • conda install -y -q --name qmflows -c anaconda h5py

Package installation

  • Type in your terminal,

    source activate qmflows

  • Then

    pip install qmflows --upgrade

Now you are ready to use qmflows.

Latest stable version

If you want to install the latest stable version directly from github you can use the following command:

pip install https://github.com/SCM-NV/qmflows/tarball/master#egg=qmflows

Notes:

  • Once the libraries and the virtual environment are installed, you only need to type source activate qmflows each time that you want to use the software.

Remote/Xenon setup

Qmflows supports running jobs over a variety of cluster computing schedulers like Slurm and Torque. You program and run your workflows from your laptop, but the jobs are run at the remote site. For this to work you need to setup Qmflows both locally and remotely. In addition you need to add a Bash script that loads the VirtualEnv and starts the Noodles remote worker. This remote worker acts as a pilot job, reading job descriptions from input and returning the results. If you defined the remote VirtualEnv with the name qmflows, the following Bash script gives an idea of what you need:

#!/bin/bash
# comment/uncomment lines that you need

# If you need ADF, and it is available in a module
module load adf/2016.102
# or if you installed it yourself
# ADFHOME=${HOME}/.local/opt/adf
# source ${ADFHOME}/bin/adfrc.sh

# Point PLAMS to its place
export PLAMSDEFAULTS="${HOME}/.local/src/plams/utils/plams_defaults.py"

# Go to the directory that contains this script
cd "$(dirname "${BASH_SOURCE[0]}")"

# Activate the VirtualEnv
source activate qmflows

# Start the remote worker
python -m noodles.worker ${@:2}

# Bye!
source deactivate

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

qmflows-0.2.1.tar.gz (52.8 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page