PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces

Last update: Mar 10, 2022

Overview

Exploring Munchausen Reinforcement Learning

This is the project repository of my team in the "Advanced Deep Learning for Robotics" course at TUM. Our project's topic is "Exploring Munchausen Reinforcement Learning" based on this paper.

For a detailed discussion, see the report and the final presentation.

Setup

Create a virtual environment.
Run pip3 install -r requirements.txt

Code Structure

This repository is structured as follows:

The directories M-DQN and M-SAC contain the implementations of the RL agents DQN and SAC extended with the Munchausen term, respectively.
The directories rl-baselines3-zoo contains a copy of this repository, where we included the implementations of M-DQN so that we can easily train and test the M-DQN agent on benchmark environments and also compare it to other classical agents. To do so, just follow the steps described in the original repository and insert M-DQN as the agent argument.
The directory particles-envcontains a modified version of this repository. The modified version contains code for a particles environment, where an agent wants to reach a goal, while avoiding obstacles. Besides, M-SAC agent is implemented and included in the code, so that it can be trained and compared to the classical SAC agent.
The directory action-gap contains implementation of callbacks for experiment manager of rl-baselines3-zoo which logs action-gap for tensorboard.

PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces

Related tags

Overview

Exploring Munchausen Reinforcement Learning

Setup

Code Structure

Owner

Mohamed Amine Ketata

CUda Matrix Multiply library.

The easiest tool for extracting radiomics features and training ML models on them.

[ICCV 2021] Focal Frequency Loss for Image Reconstruction and Synthesis

Joint parameterization and fitting of stroke clusters

Python Actor concurrency library

Official PyTorch implementation of "Physics-aware Difference Graph Networks for Sparsely-Observed Dynamics".

Crowd-sourced Annotation of Human Motion.

Code related to the manuscript "Averting A Crisis In Simulation-Based Inference"

Deep-Learning-Book-Chapter-Summaries - Attempting to make the Deep Learning Book easier to understand.

This is a demo app to be used in the video streaming applications

Black box hyperparameter optimization made easy.

SCU OlympicsRunning Baseline

Live training loss plot in Jupyter Notebook for Keras, PyTorch and others

Code for ICCV2021 paper SPEC: Seeing People in the Wild with an Estimated Camera

A PaddlePaddle version image model zoo.

This repository contains a pytorch implementation of "StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision".

deep_image_prior_extension

arxiv-sanity, but very lite, simply providing the core value proposition of the ability to tag arxiv papers of interest and have the program recommend similar papers.

Meta graph convolutional neural network-assisted resilient swarm communications

Code for the CVPR 2021 paper "Triple-cooperative Video Shadow Detection"