reinforcement-learning
Here are 8,266 public repositories matching this topic...
-
Updated
May 26, 2022
-
Updated
May 20, 2022 - C#
-
Updated
May 26, 2022 - Python
-
Updated
May 10, 2022 - Python
-
Updated
Jan 8, 2022 - Python
-
Updated
Apr 10, 2022 - HTML
-
Updated
Jun 2, 2022 - C++
-
Updated
Jun 9, 2022 - Jupyter Notebook
-
Updated
Jan 15, 2021 - Jupyter Notebook
-
Updated
Jan 29, 2022 - Python
-
Updated
Nov 1, 2020 - Python
-
Updated
Apr 23, 2022 - Python
-
Updated
Jun 9, 2022 - Jupyter Notebook
Bidirectional RNN
Is there a way to train a bidirectional RNN (like LSTM or GRU) on trax nowadays?
-
Updated
May 16, 2022 - Jupyter Notebook
-
Updated
Feb 3, 2022
-
Updated
Jun 9, 2022 - Python
-
Updated
May 4, 2022 - Python
-
Updated
May 23, 2022 - Python
-
Updated
Apr 17, 2022 - Jupyter Notebook
-
Updated
Jun 10, 2022
-
Updated
Jun 10, 2022 - Python
-
Updated
Dec 14, 2019 - Jupyter Notebook
-
Updated
May 7, 2021 - JavaScript
-
Updated
Feb 15, 2022 - Jupyter Notebook
-
Updated
Jun 10, 2022 - Python
-
Updated
Mar 18, 2022
🐛 Bug
The documentation of DQN agent (https://stable-baselines3.readthedocs.io/en/master/modules/dqn.html) specifies that log_interval parameter is "The number of timesteps before logging". However, when set to 1 (or any other value) the logging is not made at that pace but is instead made every log_interval episode (and not timesteps). In the example below this is made every 200 timesteps.
Improve this page
Add a description, image, and links to the reinforcement-learning topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the reinforcement-learning topic, visit your repo's landing page and select "manage topics."
What happened + What you expected to happen
When initializing a Ray Trainer, we provide a
logdir
argument, and the__init__
method of the Trainer stores it as alogdir
class variable.Then, when creating a Trainable with
Trainer.to_tune_trainable()
, it in-turn calls_create_tune_trainable()
, which does not useself.logdir
. So whentune_function
is defined inside `_create_tu