Deep Reinforcement Learning Agents

This repository contains a collection of reinforcement learning algorithms written in Tensorflow. The ipython notebook here were written to go along with a still-underway tutorial series I have been publishing on Medium. If you are new to reinforcement learning, I recommend reading the accompanying post for each algorithm.

The repository currently contains the following algorithms:

Q-Table - An implementation of Q-learning using tables to solve a stochastic environment problem.
Q-Network - A neural network implementation of Q-Learning to solve the same environment as in Q-Table.
Simple-Policy - An implementation of policy gradient method for stateless environments such as n-armed bandit problems.
Contextual-Policy - An implementation of policy gradient method for stateful environments such as contextual bandit problems.
Policy-Network - An implementation of a neural network policy-gradient agent that solves full RL problems with states and delayed rewards, and two opposite actions (ie. CartPole or Pong).
Vanilla-Policy - An implementation of a neural network vanilla-policy-gradient agent that solves full RL problems with states, delayed rewards, and an arbitrary number of actions.
Model-Network - An addition to the Policy-Network algorithm which includes a separate network which models the environment dynamics.
Double-Dueling-DQN - An implementation of a Deep-Q Network with the Double DQN and Dueling DQN additions to improve stability and performance.
Deep-Recurrent-Q-Network - An implementation of a Deep Recurrent Q-Network which can solve reinforcement learning problems involving partial observability.
Q-Exploration - An implementation of DQN containing multiple action-selection strategies for exploration. Strategies include: greedy, random, e-greedy, Boltzmann, and Bayesian Dropout.
A3C-Doom - An implementation of Asynchronous Advantage Actor-Critic (A3C) algorithm. It utilizes multiple agents to collectively improve a policy. This implementation can solve RL problems in 3D environments such as VizDoom challenges.

A set of Deep Reinforcement Learning Agents implemented in Tensorflow.

Related tags

Overview

Deep Reinforcement Learning Agents

Owner

Arthur Juliani

DWIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data.

Pynomial - a lightweight python library for implementing the many confidence intervals for the risk parameter of a binomial model

An easy-to-use app to visualise attentions of various VQA models.

DeepMoCap: Deep Optical Motion Capture using multiple Depth Sensors and Retro-reflectors

PrimitiveNet: Primitive Instance Segmentation with Local Primitive Embedding under Adversarial Metric (ICCV 2021)

Car Price Predictor App used to predict the price of the car based on certain input parameters created using python's scikit-learn, fastapi, numpy and joblib packages.

Hydra Lightning Template for Structured Configs

TensorFlow implementation of Barlow Twins (Barlow Twins: Self-Supervised Learning via Redundancy Reduction)

VoxHRNet - Whole Brain Segmentation with Full Volume Neural Network

Pytorch implementation for Patient Knowledge Distillation for BERT Model Compression

A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently develop and compare their own methods.

Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)

Not Suitable for Work (NSFW) classification using deep neural network Caffe models.

Relaxed-machines - explorations in neuro-symbolic differentiable interpreters

An executor that performs image segmentation on fashion items

PyTorch module to use OpenFace's nn4.small2.v1.t7 model

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

TensorFlow ROCm port

End-to-End Object Detection with Fully Convolutional Network

Normalizing Flows with a resampled base distribution