Project description

optuna-distributed

An extension to Optuna which makes distributed hyperparameter optimization easy, and keeps all of the original Optuna semantics. Optuna-distributed can run locally, by default utilising all CPU cores, or can easily scale to many machines in Dask cluster.

Note

Optuna-distributed is still in the early stages of development. While core Optuna functionality is supported, few missing APIs (especially around Optuna integrations) might prevent this extension from being entirely plug-and-play for some users. Bug reports, feature requests and PRs are more than welcome.

Features

Asynchronous optimization by default. Scales from single machine to many machines in cluster.
Distributed study walks and quacks just like regular Optuna study, making it plug-and-play.
Compatible with all standard Optuna storages, samplers and pruners.
No need to modify existing objective functions.

Installation

pip install optuna-distributed

Optuna-distributed requires Python 3.7 or newer.

Basic example

Optuna-distributed wraps standard Optuna study. The resulting object behaves just like regular study, but optimization process is asynchronous. Depending on setup of Dask client, each trial is scheduled to run on available CPU core on local machine, or physical worker in cluster.

Note

Running distributed optimization requires a Dask cluster with environment closely matching one on the client machine. For more information on cluster setup and configuration, please refer to https://docs.dask.org/en/stable/deploying.html.

import random
import time

import optuna
import optuna_distributed
from dask.distributed import Client


def objective(trial):
    x = trial.suggest_float("x", -100, 100)
    y = trial.suggest_categorical("y", [-1, 0, 1])
    # Some expensive model fit happens here...
    time.sleep(random.uniform(1.0, 2.0))
    return x**2 + y


if __name__ == "__main__":
    # client = Client("<your.cluster.scheduler.address>")  # For distributed optimization.
    client = Client()  # For local asynchronous optimization.
    study = optuna_distributed.from_study(optuna.create_study(), client=client)
    study.optimize(objective, n_trials=10)
    print(study.best_value)

But there's more! All of the core Optuna APIs, including storages, samplers and pruners are supported!

What's missing?

Support for callbacks and Optuna integration modules.
Study APIs such as study.stop can't be called from trial at the moment.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.6.1

Jul 28, 2023

0.6.0

Jul 26, 2023

0.5.0

May 1, 2023

0.4.0

Jan 25, 2023

0.3.0

Jan 17, 2023

0.2.0

Nov 11, 2022

This version

0.1.1

Oct 26, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

optuna-distributed-0.1.1.tar.gz (30.3 kB view hashes)

Uploaded Oct 26, 2022 Source

Built Distribution

optuna_distributed-0.1.1-py3-none-any.whl (29.9 kB view hashes)

Uploaded Oct 26, 2022 Python 3

Hashes for optuna-distributed-0.1.1.tar.gz

Hashes for optuna-distributed-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`ba8806855d234542440d64d1e140cc54d7a1731691f02c9bafd0c8cc79699791`
MD5	`8836c1d14458cd080cf0aac57b973cd3`
BLAKE2b-256	`7cacc9a0630c7a6687e23400fbb079d4cc537eb86d28763fb1355accd6fcce43`

Hashes for optuna_distributed-0.1.1-py3-none-any.whl

Hashes for optuna_distributed-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bcaf0255dbb7bf7ed1fe476974bec4045c78195842514abce2e73df06d391476`
MD5	`06da8167437cab2c7337726ee75ab42c`
BLAKE2b-256	`381517d4512246b178f96d20ac9ece8d734a13799f502ce3ac90b487b288b205`