rllib
Here are 39 public repositories matching this topic...
-
Updated
Dec 2, 2021 - Python
-
Updated
Oct 23, 2020 - Python
-
Updated
Sep 8, 2021 - Jupyter Notebook
-
Updated
Aug 25, 2021 - Jupyter Notebook
-
Updated
Dec 10, 2021 - Jupyter Notebook
-
Updated
Dec 6, 2021 - C++
-
Updated
May 29, 2021 - Jupyter Notebook
Currently we use a very ad-hoc procedure for scaling the quadratic component of NAF when used for exploration:
https://github.com/angelolovatto/raylab/blob/9820275b17ee085e1955a6d845c0bdf61333f8da/raylab/algorithms/naf/naf_policy.py#L150-L155
A possibly better alternative would be to scale it based on the desired average action stddev. Something like:
scale_tril * (1.0 / average_st-
Updated
Nov 17, 2020 - Python
-
Updated
May 22, 2021 - Python
-
Updated
Jul 7, 2021 - Python
-
Updated
Oct 8, 2021 - Python
-
Updated
Nov 17, 2021 - Python
-
Updated
Dec 8, 2020 - Python
-
Updated
Aug 25, 2020 - Jupyter Notebook
-
Updated
Dec 15, 2020 - Jupyter Notebook
-
Updated
Jun 12, 2019 - Python
-
Updated
Sep 6, 2019 - Python
-
Updated
Nov 27, 2020 - Jupyter Notebook
-
Updated
Oct 12, 2021 - Jupyter Notebook
-
Updated
Aug 11, 2020 - Python
-
Updated
Sep 10, 2021 - Python
-
Updated
Nov 10, 2021 - Jupyter Notebook
-
Updated
Dec 13, 2021 - Python
Implement parameterized linear and non-linear dynamics model with bimodal Gaussian noise.
-
Updated
May 26, 2019 - Python
-
Updated
Sep 30, 2019 - Python
Improve this page
Add a description, image, and links to the rllib topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rllib topic, visit your repo's landing page and select "manage topics."


When users starts Serve cluster with a set of options (http options, checkpoint path) and then connects to it with a different set of options, we should either update it, or error out.