Skip to content
#

deep-reinforcement-learning

Here are 1,945 public repositories matching this topic...

scaro86
scaro86 commented Mar 21, 2022

Hi,
I am currently using your FinRL_PortfolioAllocation_NeurIPS_2020 code and I have some strange behavior at the beginning of training. Sometimes the first episode reward mean value is super high and then drops during the training as shown on the tensorboard plot. This high value is never reached again. Any idea why this is happening ?

Edit: I'm training PPO agent from stablebaselines3 wit

bug good first issue

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

  • Updated Dec 14, 2021
  • Python

Improve this page

Add a description, image, and links to the deep-reinforcement-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the deep-reinforcement-learning topic, visit your repo's landing page and select "manage topics."

Learn more