tfrddlsim

RDDL2TensorFlow parser, compiler, and simulator.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
Environment
- Console
Intended Audience
- Science/Research
License
- OSI Approved :: GNU General Public License v3 (GPLv3)
Natural Language
- English
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Artificial Intelligence

Project description

# tf-rddlsim [![Build Status](https://travis-ci.org/thiagopbueno/tf-rddlsim.svg?branch=master)](https://travis-ci.org/thiagopbueno/tf-rddlsim) [![Documentation Status](https://readthedocs.org/projects/tf-rddlsim/badge/?version=latest)](https://tf-rddlsim.readthedocs.io/en/latest/?badge=latest) [![License](https://img.shields.io/aur/license/yaourt.svg)](https://github.com/thiagopbueno/tf-rddlsim/blob/master/LICENSE)

RDDL2TensorFlow compiler and trajectory simulator in Python3.

# Quickstart

```text
$ pip3 install tfrddlsim
```

# Usage

tf-rddlsim can be used as a standalone script or programmatically.

## Script mode

```text
$ usage: tfrddlsim [-h] (--file FILE | --rddl RDDL) [--policy {default,random}]
[--viz {generic,navigation}] [-hr HORIZON] [-b BATCH_SIZE]
[-v]

RDDL2TensorFlow compiler and simulator

optional arguments:
-h, --help show this help message and exit
--file FILE RDDL filepath
--rddl RDDL RDDL domain id
--policy {default,random}
type of policy (default=random)
--viz {generic,navigation}
type of visualizer (default=generic)
-hr HORIZON, --horizon HORIZON
number of timesteps of each trajectory (default=40)
-b BATCH_SIZE, --batch_size BATCH_SIZE
number of trajectories in a batch (default=75)
-v, --verbose verbosity mode
```

## Programmatic mode

```python
import rddlgym

from tfrddlsim.policy import RandomPolicy
from tfrddlsim.simulation.policy_simulator import PolicySimulator
from tfrddlsim.viz import GenericVisualizer

# parse and compile RDDL
rddl2tf = rddlgym.make('Reservoir-8')
rddl2tf.batch_mode_on()

# run simulations
horizon = 40
batch_size = 75
policy = RandomPolicy(rddl2tf, batch_size)
simulator = Simulator(rddl2tf, policy, batch_size)
trajectories = simulator.run(horizon)

# visualize trajectories
viz = GenericVisualizer(rddl2tf, verbose=True)
viz.render(trajectories)
```

# Simulator

The ``tfrddlsim.Simulator`` implements a stochastic Recurrent Neural Net (RNN) in order to sample state-action trajectories. Each RNN cell encapsulates a ``tfrddlsim.Policy`` module generating actions for current states and comprehends the transition (specified by the CPFs) and reward functions. Sampling is done through dynamic unrolling of the RNN model with the embedded ``tfrddlsim.Policy``.

Note that the ``tfrddlsim`` package only provides a ``tfrddlsim.RandomPolicy`` and a ``tfrddlsim.DefaultPolicy`` (constant policy with all action fluents with default values).

# Documentation

Please refer to [https://tf-rddlsim.readthedocs.io/](https://tf-rddlsim.readthedocs.io/en/latest/) for the code documentation.

# Support

If you are having issues with ``tf-rddlsim``, please let me know at: [thiago.pbueno@gmail.com](mailto://thiago.pbueno@gmail.com).

# License

Copyright (c) 2018 Thiago Pereira Bueno All Rights Reserved.

tf-rddlsim is free software: you can redistribute it and/or modify it
under the terms of the GNU Lesser General Public License as published by
the Free Software Foundation, either version 3 of the License, or (at
your option) any later version.

tf-rddlsim is distributed in the hope that it will be useful, but
WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser
General Public License for more details.

You should have received a copy of the GNU Lesser General Public License
along with tf-rddlsim. If not, see http://www.gnu.org/licenses/.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
Environment
- Console
Intended Audience
- Science/Research
License
- OSI Approved :: GNU General Public License v3 (GPLv3)
Natural Language
- English
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Artificial Intelligence

Release history Release notifications | RSS feed

0.8.1

Sep 15, 2020

0.8.0

Sep 14, 2020

0.7.1

Sep 7, 2020

0.7.0

Apr 2, 2019

This version

0.6.11

Nov 25, 2018

0.6.10

Nov 9, 2018

0.6.9

Nov 4, 2018

0.6.8

Sep 11, 2018

0.6.7

Sep 11, 2018

0.6.6

Sep 8, 2018

0.6.5

Sep 8, 2018

0.6.3

Aug 31, 2018

0.6.2

Aug 30, 2018

0.6.1

Aug 19, 2018

0.6.0

Aug 15, 2018

0.5.0

Aug 13, 2018

0.4.9

Aug 12, 2018

0.4.8

Aug 12, 2018

0.4.7

Aug 12, 2018

0.4.5

Aug 12, 2018

0.4.4

Aug 12, 2018

0.4.3

Aug 12, 2018

0.4.2

Aug 12, 2018

0.4.1

Aug 10, 2018

0.3.0

Aug 6, 2018

0.2.1

Aug 6, 2018

0.2.0

Aug 6, 2018

0.1.0

Aug 5, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tfrddlsim-0.6.11.tar.gz (16.8 kB view hashes)

Uploaded Nov 25, 2018 Source

Hashes for tfrddlsim-0.6.11.tar.gz

Hashes for tfrddlsim-0.6.11.tar.gz
Algorithm	Hash digest
SHA256	`2ad21b0ea1d4431104c6ec4869c2d805e45d772102b3504bebe25ac94e67838f`
MD5	`9ef76216c8ea20e633f8af821f7e8b48`
BLAKE2b-256	`c3ea7d089d78e35104bf9ce44ad1695b254636fc2eb648ebddaf10ada27e21d8`