A scheduler for resource-aware parallel computing on clusters.

These details have not been verified by PyPI

Project links

Homepage

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
Framework
- Trio
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Programming Language
- Python :: 3 :: Only
- Python :: 3.6
Topic
- Scientific/Engineering
- System :: Distributed Computing

Project description

Grain

A scheduler built with trio for resource-aware parallel computing on clusters.

TL;DR

Three core functions for you to run async jobs in an arbitary mix of parallel and sequential manner.

# Jobs/subtasks inside a waitgroup run parallelly
async with grain.open_waitgroup() as wg:

    # Put a job onto the waitgroup to be executed
    wg.submit(resource, fn, *args, **kwargs)

    # Put a subtask onto the waitgroup. Submit jobs / 
    # start other subtasks inside the subtask.
    wg.start_subtask(vfn, *args, **kwargs)

# Waitgroup blocks here until all of its jobs are done,
# so outside a waitgroup is essentially sequencial.

results = wg.results # sorted in the order of submission


# Execute one job sequentially
result = await grain.exec1(resource, fn, *args, **kwargs)

Entrypoint:

async def main(): # top-level subtask
    # Submit jobs / start subtasks here
grain.run_combine(main, [worker1_addr, worker2_addr, ...], resource_per_worker)
# ... Or for top-level parallelism, ...
#grain.run_combine([main1, main2, ...], ...)

Check out example for complete demos / more patterns and configuration sample.

Resource-awareness

Every job in the job queue has a resource request infomation along with the job to run. Before the executor run each job, it queries each worker for resource availability. If resource is insufficient, the job queue is suspended until completed jobs return resources. Resources can be CPU cores, virtual memory, both, (or anything user defined following interface grain.resource.Resource).

Every time a job function runs, it has access to grain.GVAR.res, a context-local variable giving the information of specific resource dedicated to the job. (e.g. if a job is submitted with CPU(3), asking for 3 cores, it might receive allocation like CPU([6,7,9]).)

Executor, Workers and communication

The top-level APIs (i.e. "combine") are built upon an executor-like backend called grain.GrainExecutor. It schedules and dispatches jobs to workers, and it maintains a single job queue and a result queue. The executor usually runs on the head node in a cluster.

Workers, one per node, simply receive async functions (i.e. jobs) from the executor and run them. Executor and workers use socket for communication, and dill serializes the functions to byte payloads.

Acknowledgement

The API of Grain is largely insipred by structured concurrency, a major design principle behind Trio, and it is specifically inspired by the API of Trio. And of course, Grain uses Trio internally.

Caveat

Relative import (import not on Python package path) should be within the job function. Global reference fails in this case.

Project details

These details have not been verified by PyPI

Project links

Homepage

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
Framework
- Trio
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Programming Language
- Python :: 3 :: Only
- Python :: 3.6
Topic
- Scientific/Engineering
- System :: Distributed Computing

Release history Release notifications | RSS feed

0.17.0

Jul 18, 2023

0.17.0b3 pre-release

Nov 9, 2022

0.17.0b2 pre-release

Oct 19, 2022

0.17.0b1 pre-release

Jun 14, 2022

0.16.2

May 30, 2022

0.16.1

Apr 30, 2022

0.16.0

Jan 17, 2022

0.15.2

Sep 6, 2021

0.15.1

Jun 18, 2021

0.15.0

Jun 7, 2021

0.14.0

Nov 23, 2020

0.13.1

Jun 25, 2020

0.13.0

Jun 6, 2020

This version

0.12.1

May 19, 2020

0.12.0

Apr 21, 2020

0.11.0

Apr 1, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

grain-scheduler-0.12.1.tar.gz (26.5 kB view hashes)

Uploaded May 19, 2020 Source

Built Distribution

grain_scheduler-0.12.1-py3-none-any.whl (32.8 kB view hashes)

Uploaded May 19, 2020 Python 3

Hashes for grain-scheduler-0.12.1.tar.gz

Hashes for grain-scheduler-0.12.1.tar.gz
Algorithm	Hash digest
SHA256	`0fbe579f9f8cf89bc7d4c0dcb883992d92059702fd0621ea772760387a89b784`
MD5	`364d3162757e3d9a21323cd5f67caa70`
BLAKE2b-256	`3fad93ac420ecdeed21f790f43119e724990214cbcc1e377168675c5712dea91`

Hashes for grain_scheduler-0.12.1-py3-none-any.whl

Hashes for grain_scheduler-0.12.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7319fb4d7df9509a14981aea0de6f66505d4b64e71eaa20f542aab4c5b32597e`
MD5	`4b0618c057485688f831a569385daa92`
BLAKE2b-256	`76e5130e663a8df993157ef91f24fde2b0ed2ddb6fd4998075f22e689e5ad866`