evostra

Evolution Strategy Solver in Python

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Evostra: Evolution Strategy for Python
--------

Evolutio Strategy (ES) is an optimization technique based on ideas of adaptation and evolution.
You can learn more about it at https://blog.openai.com/evolution-strategies/

Installation
--------
It's compatible with both python2 and python3.

Install from source:

.. code-block:: bash

$ sudo python setup.py install

Install from PyPI:

.. code-block:: bash

$ sudo pip install evostra

(You may need to use python3 or pip3 for python3)

Usage
--------

The input weights of the EvolutionStrategy module is a list of arrays (one array with any shape for each layer of the neural network), so we can use any framework for builing the model and just pass the weights to ES.

Here we use Keras to build the model and we pass its weights to ES.

.. code:: python

from evostra import EvolutionStrategy
from keras.models import Model, Input
from keras.layers import Dense
from keras.optimizers import Adam # not important as there's no training here.
import numpy as np

input_layer = Input(shape=(5,1))
layer = Dense(8)(input_layer)
output_layer = Dense(3)(layer)
model = Model(input_layer, output_layer)
model.compile(Adam(), 'mse')

Now we define our get_reward function:

.. code:: python

solution = np.array([0.1, -0.4, 0.5])
inp = np.asarray([[1,2,3,4,5]])
inp = np.expand_dims(inp, -1)

def get_reward(weights):
global solution, model, inp
model.set_weights(weights)
prediction = model.predict(inp)[0]
# here our best reward is zero
reward = -np.sum(np.square(solution - prediction))
return reward

Now we can build the EvolutionStrategy object and run it for some iterations:

.. code:: python

es = EvolutionStrategy(model.get_weights(), get_reward, population_size=50, sigma=0.1, learning_rate=0.001)
es.run(1000, print_step=100)

Here's the output:

.. code::

iter 0. reward: -68.819312
iter 100. reward: -0.218466
iter 200. reward: -0.110204
iter 300. reward: -0.089003
iter 400. reward: -0.078224
iter 500. reward: -0.063891
iter 600. reward: -0.049090
iter 700. reward: -0.027701
iter 800. reward: -0.013094
iter 900. reward: -0.009140

Now we have the optimized weights and we can update our model:

.. code:: python

optimized_weights = es.get_weights()
model.set_weights(optimized_weights)

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

2.5.2

May 31, 2018

2.5.1

May 11, 2018

2.5.0

May 11, 2018

2.0

May 10, 2018

This version

1.0.1

Jun 16, 2017

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

evostra-1.0.1-py2.py3-none-any.whl (5.6 kB view hashes)

Uploaded Jun 16, 2017 Python 2 Python 3

Hashes for evostra-1.0.1-py2.py3-none-any.whl

Hashes for evostra-1.0.1-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`865c40fe1ea16644676abc0d76af7fb20a5f601271f3f086bbb7bfc8f1d8ac31`
MD5	`7d476603f452debc84327fcc75f2e043`
BLAKE2b-256	`568457d641e603d6e5e824c9c2bb6b9d91dd9044aad7cd157ec8f37b8f4b46f6`