Python gRPC functions for the Rainbow Scheduler

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Intended Audience
- Developers
- Science/Research
License
- OSI Approved :: Apache Software License
Operating System
- Unix
Programming Language
- C
- Python
- Python :: 3.7
Topic
- Scientific/Engineering
- Software Development

Project description

rainbow (python)

🌈️ Where keebler elves and schedulers live, somewhere in the clouds, and with marshmallows

This is the rainbow scheduler prototype, specifically Python bindings for a gRPC client. To learn more about rainbow, visit https://github.com/converged-computing/rainbow.

Example

Assuming that you can run the server with Go, let's first do that (e.g., from the root of the repository linked above, and soon we will provide a container):

Register

make server

go run cmd/server/server.go
2024/02/12 19:38:58 creating 🌈️ server...
2024/02/12 19:38:58 ✨️ creating rainbow.db...
2024/02/12 19:38:58    rainbow.db file created
2024/02/12 19:38:58    create cluster table...
2024/02/12 19:38:58    cluster table created
2024/02/12 19:38:58    create jobs table...
2024/02/12 19:38:58    jobs table created
2024/02/12 19:38:58 starting scheduler server: rainbow v0.1.0-draft
2024/02/12 19:38:58 server listening: [::]:50051

And then let's do a registration, but this time from the Python bindings (client) here! We will use the core bindings in rainbow/client.py but run a custom command from examples. Assuming you've installed everything into a venv:

python -m venv env
source env/bin/activate
pip install -e .

The command below will register and save the secret to a new configuration file. Note that if you provide an existing one, it will use or update it.

python ./examples/flux/register.py keebler --config-path ./rainbow-config.yaml

Saving rainbow config to ./rainbow-config.yaml
🤫️ The token you will need to submit jobs to this cluster is rainbow
🔐️ The secret you will need to accept jobs is 649598a9-e77b-4aa3-ab46-bfbbc5e2d606

Try running it again - you can't register a cluster twice. But of course other cluster names you can register. A "cluster" can actually be a cluster, or a flux instance, or any entity that can accept jobs. The script also accepts arguments (see register.py --help)

python ./examples/flux/register.py --help

🌈️ Rainbow scheduler register

options:
  -h, --help            show this help message and exit
  --cluster CLUSTER     cluster name to register
  --host HOST           host of rainbow cluster
  --secret SECRET       Rainbow cluster registration secret
  --config-path CONFIG_PATH
                        Path to rainbow configuration file to write or use
  --cluster-nodes CLUSTER_NODES
                        Nodes to provide for registration

Register Subsystem

Let's now register the subsystem. Akin to register, this has the path to the subsystem nodes set as a default, and the name --subsystem set to "io." This assumes you've registered your cluster and have the cluster secret in your ./rainbow-config.yaml

python ./examples/flux/register-subsystem.py keebler --config-path ./rainbow-config.yaml

status: REGISTER_SUCCESS

In the server window you'll see the subsystem added:

...
2024/03/09 14:21:50 📝️ received subsystem register: keebler
2024/03/09 14:21:50 Preparing to load 6 nodes and 30 edges
2024/03/09 14:21:50 We have made an in memory graph (subsystem io) with 7 vertices, with 15 connections to the dominant!
{
 "keebler": {
  "Name": "keebler",
  "Counts": {
   "io": 1,
   "mtl1unit": 1,
   "mtl2unit": 1,
   "mtl3unit": 1,
   "nvme": 1,
   "shm": 1
  }
 }
}

Submit Job (Simple)

Now let's submit a job to our faux cluster. We need to provide the token we received above. Remember that this is a two stage process:

Query the graph database for one or more cluster matches.
Send that request to rainbow.

The client handles both, so you (as the user) only are exposed to the single submit. We will be providing basic arguments for the job, but note you can provide other arguments too:

python ./examples/flux/submit-job.py --help

🌈️ Rainbow scheduler submit

positional arguments:
  command               Command to submit

options:
  -h, --help            show this help message and exit
  --config-path CONFIG_PATH
                        config path with cluster names
  --host HOST           host of rainbow cluster
  --token TOKEN         Cluster token for permission to submit jobs
  --nodes NODES         Nodes for job (defaults to 1)

And then submit! Remember that you need to have registered first. Note that we need to provide our cluster config path.

$ python examples/flux/submit-job.py --config-path ./rainbow-config.yaml --nodes 1 echo hello world
```bash
```console
{
    "version": 1,
    "resources": [
        {
            "type": "node",
            "count": 1,
            "with": [
                {
                    "type": "slot",
                    "count": 1,
                    "label": "echo",
                    "with": [
                        {
                            "type": "core",
                            "count": 1
                        }
                    ]
                }
            ]
        }
    ],
    "tasks": [
        {
            "command": [
                "echo",
                "hello",
                "world"
            ],
            "slot": "echo",
            "count": {
                "per_slot": 1
            }
        }
    ],
    "attributes": {}
}
clusters: "keebler"
status: RESULT_TYPE_SUCCESS

status: SUBMIT_SUCCESS

Submit Jobspec

We can also submit a jobspec directly, which is an advanced use case. It works predominantly the same, except we load in the Jobspec from the yaml directly.

python examples/flux/submit-jobspec.py --config-path ./rainbow-config.yaml ../../docs/examples/scheduler/jobspec-io.yaml

🌈️ Rainbow scheduler submit

positional arguments:
  jobspec               Jobspec path to submit

options:
  -h, --help            show this help message and exit
  --config-path CONFIG_PATH
                        config path with cluster metadata

It largely looks the same - I'll cut most of it out. It's just a different entry point for the job definition.

clusters: "keebler"
status: RESULT_TYPE_SUCCESS

status: SUBMIT_SUCCESS

Receive Jobs

After we submit jobs, rainbow assigns them to a cluster. For this dummy example we are assigning to the same cluster (keebler) so we can also use our host "keebler" to receive the job. Here is what that looks like.

python ./examples/flux/receive-jobs.py --help

🌈️ Rainbow scheduler receive jobs

options:
  -h, --help            show this help message and exit
  --max-jobs MAX_JOBS   Maximum jobs to request (unset defaults to all)
  --config-path CONFIG_PATH
                        config path with cluster metadata

And then request and accept jobs:

 python examples/flux/receive-jobs.py --config-path ./rainbow-config.yaml
Status: REQUEST_JOBS_SUCCESS
Received 1 jobs to accept...

If this were running in Flux, we would be able to run it, and the response above has told rainbow that you've accepted it (and rainbow deletes the record of it).

License

HPCIC DevTools is distributed under the terms of the MIT license. All new contributions must be made under this license.

See LICENSE, COPYRIGHT, and NOTICE for details.

SPDX-License-Identifier: (MIT)

LLNL-CODE- 842614

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Intended Audience
- Developers
- Science/Research
License
- OSI Approved :: Apache Software License
Operating System
- Unix
Programming Language
- C
- Python
- Python :: 3.7
Topic
- Scientific/Engineering
- Software Development

Release history Release notifications | RSS feed

0.0.14rc1 pre-release

Mar 21, 2024

0.0.14rc0 pre-release

Mar 17, 2024

This version

0.0.13

Mar 15, 2024

0.0.12

Mar 9, 2024

0.0.1

Feb 14, 2024

0.0.0

Feb 13, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rainbow-scheduler-0.0.13.tar.gz (19.4 kB view hashes)

Uploaded Mar 15, 2024 Source

Built Distribution

rainbow_scheduler-0.0.13-py3-none-any.whl (26.0 kB view hashes)

Uploaded Mar 15, 2024 Python 3

Hashes for rainbow-scheduler-0.0.13.tar.gz

Hashes for rainbow-scheduler-0.0.13.tar.gz
Algorithm	Hash digest
SHA256	`9264220cd7098d4d9efff081a87391a83653bd079796be01abe3f8210cb47b34`
MD5	`6b9405ec781a7782079a1ca081acc423`
BLAKE2b-256	`bd45b0cfb2a91390e8a46f07fa0c164557157c9b1df2741a4f37635814460ae0`

Hashes for rainbow_scheduler-0.0.13-py3-none-any.whl

Hashes for rainbow_scheduler-0.0.13-py3-none-any.whl
Algorithm	Hash digest
SHA256	`30cf51bbd8747b0c348207f1381b626a1185ecde961db65bb1ec604f55882c8d`
MD5	`cdb2d68422df274f3cfc8a457773871b`
BLAKE2b-256	`a8b1648087c5ffe9c2ddafc371277c0e1ddc5e4053ba3c6e6a6a5acfbf4fe657`