Build software better, together

eriklindernoren / ML-From-Scratch

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

data-science machine-learning data-mining deep-learning genetic-algorithm deep-reinforcement-learning machine-learning-from-scratch

Updated Jun 28, 2021
Python

microsoft / AirSim

Star

Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research

simulator research ai computer-vision cross-platform deep-reinforcement-learning artificial-intelligence pixhawk self-driving-car unreal-engine drones deeplearning control-systems platform-independent autonomous-quadcoptor autonomous-vehicles airsim

Updated Mar 23, 2022
C++

Unity-Technologies / ml-agents

Star

Unity Machine Learning Agents Toolkit

reinforcement-learning deep-learning unity unity3d deep-reinforcement-learning neural-networks

Updated Mar 15, 2022
C#

kmario23 / deep-learning-drizzle

Star

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

machine-learning natural-language-processing deep-neural-networks reinforcement-learning computer-vision deep-learning optimization machine-translation deep-reinforcement-learning medical-imaging speech-recognition artificial-neural-networks pattern-recognition probabilistic-graphical-models bayesian-statistics artificial-intelligence-algorithms visual-recognition graph-neural-networks

Updated Feb 1, 2022
HTML

lexfridman / mit-deep-learning

Star

Tutorials, assignments, and competitions for MIT Deep Learning related courses.

data-science machine-learning mit deep-learning tensorflow deep-reinforcement-learning artificial-intelligence neural-networks segmentation tensorflow-tutorials deeplearning jupyter-notebooks self-driving-cars deep-rl

Updated May 17, 2021
Jupyter Notebook

carla-simulator / carla

Sponsor

Star

Open-source simulator for autonomous driving research.

simulator research ai computer-vision deep-learning cross-platform deep-reinforcement-learning artificial-intelligence ros self-driving-car ue4 autonomous-driving autonomous-vehicles imitation-learning unreal-engine-4 carla carla-simulator

Updated Mar 23, 2022
C++

google / trax

Star

Open

Bidirectional RNN

7

jonatasgrosman commented Oct 7, 2020

Is there a way to train a bidirectional RNN (like LSTM or GRU) on trax nowadays?

yenchenlin / DeepLearningFlappyBird

Star

Flappy Bird hack using Deep Reinforcement Learning (Deep Q-learning).

game deep-learning deep-reinforcement-learning

Updated Dec 9, 2021
Python

aamini / introtodeeplearning

Star

Lab Materials for MIT 6.S191: Introduction to Deep Learning

mit computer-vision deep-learning tensorflow deep-reinforcement-learning neural-networks tensorflow-tutorials deeplearning jupyter-notebooks music-generation algorithmic-bias

Updated Jan 28, 2022
Jupyter Notebook

yandexdataschool / Practical_RL

Star

A course in reinforcement learning in the wild

reinforcement-learning deep-learning course-materials mooc tensorflow keras deep-reinforcement-learning pytorch hacktoberfest git-course pytorch-tutorials

Updated Mar 23, 2022
Jupyter Notebook

evilsocket / pwnagotchi

Sponsor

Star

(⌐■_■) - Deep Reinforcement Learning instrumenting bettercap for WiFi pwning.

ai deep-learning deep-reinforcement-learning wpa-psk bettercap deep-neural-network handshakes

Updated Mar 12, 2022
JavaScript

udacity / deep-reinforcement-learning

Star

Repo for the Deep Reinforcement Learning Nanodegree program

reinforcement-learning deep-reinforcement-learning openai-gym pytorch dqn neural-networks reinforcement-learning-algorithms dynamic-programming hill-climbing ddpg cross-entropy openai-gym-solutions pytorch-rl ppo ml-agents rl-algorithms

Updated Feb 15, 2022
Jupyter Notebook

datawhalechina / easy-rl

Star

强化学习中文教程（蘑菇书），在线阅读地址：https://datawhalechina.github.io/easy-rl/

reinforcement-learning deep-reinforcement-learning q-learning dqn policy-gradient sarsa a3c ddpg imitation-learning ppo easy-rl

Updated Mar 19, 2022
Jupyter Notebook

AI4Finance-Foundation / FinRL

Sponsor

Star

Open

reward decreasing during training

scaro86 commented Mar 21, 2022

Hi,
I am currently using your FinRL_PortfolioAllocation_NeurIPS_2020 code and I have some strange behavior at the beginning of training. Sometimes the first episode reward mean value is super high and then drops during the training as shown on the tensorboard plot. This high value is never reached again. Any idea why this is happening ?

Edit: I'm training PPO agent from stablebaselines3 wit

Alpaca data source URL endpoints v1 has been deprecated as of March 17

5

Open

Does anyone have a learning chart based on episodes?

4

Find more good first issues

andri27-ts / Reinforcement-Learning

Star

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

machine-learning reinforcement-learning qlearning deep-learning deep-reinforcement-learning artificial-intelligence dqn deepmind evolution-strategies ppo a2c policy-gradients

Updated Jun 30, 2020
Jupyter Notebook

simoninithomas / Deep_reinforcement_learning_Course

Star

Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

qlearning deep-learning unity tensorflow deep-reinforcement-learning pytorch tensorflow-tutorials deep-q-network actor-critic deep-q-learning ppo a2c

Updated Oct 20, 2020
Jupyter Notebook

tensorforce / tensorforce

Star

Tensorforce: a TensorFlow library for applied reinforcement learning

control reinforcement-learning tensorflow deep-reinforcement-learning tensorflow-library system-control tensorforce

Updated Feb 10, 2022
Python

rlcode / reinforcement-learning

Star

Minimal and Clean Reinforcement Learning Examples

machine-learning reinforcement-learning deep-learning deep-reinforcement-learning dqn policy-gradient a3c deep-q-network actor-critic

Updated Mar 11, 2022
Python

ikostrikov / pytorch-a2c-ppo-acktr-gail

Star

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).