Do you want a RL agent nicely moving on Atari?

Rainbow is all you need!

This is a step-by-step tutorial from DQN to Rainbow. Every chapter contains both of theoretical backgrounds and object-oriented implementation. Just pick any topic in which you are interested, and learn! You can execute them right away with Colab even on your smartphone.

Please feel free to open an issue or a pull-request if you have any idea to make it better. :)

If you want a tutorial for policy gradient methods, please see PG is All You Need.

DQN [NBViewer] [Colab]
DoubleDQN [NBViewer] [Colab]
PrioritizedExperienceReplay [NBViewer] [Colab]
DuelingNet [NBViewer] [Colab]
NoisyNet [NBViewer] [Colab]
CategoricalDQN [NBViewer] [Colab]
N-stepLearning [NBViewer] [Colab]
Rainbow [NBViewer] [Colab]

Prerequisites

This repository is tested on Anaconda virtual environment with python 3.7+

$ conda create -n rainbow-is-all-you-need python=3.7
$ conda activate rainbow-is-all-you-need

Installation

First, clone the repository.

git clone https://github.com/Curt-Park/rainbow-is-all-you-need.git
cd rainbow-is-all-you-need

Secondly, install packages required to execute the code. Just type:

make setup

Contributors

Thanks goes to these wonderful people (emoji key):

_{Jinwoo Park (Curt)}

_{Kyunghwan Kim}

_{Wei Chen}

_{WANG Lei}

_leeyaf

_ahmadF

This project follows the all-contributors specification. Contributions of any kind welcome!

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Related tags

Overview

Rainbow is all you need!

Contents

Prerequisites

Installation

Related Papers

Contributors

Owner

Jinwoo Park (Curt)

Auto-Lama combines object detection and image inpainting to automate object removals

MADE (Masked Autoencoder Density Estimation) implementation in PyTorch

Open-sourcing the Slates Dataset for recommender systems research

SymmetryNet: Learning to Predict Reflectional and Rotational Symmetries of 3D Shapes from Single-View RGB-D Images

Source code for deep symbolic optimization.

Py4fi2nd - Jupyter Notebooks and code for Python for Finance (2nd ed., O'Reilly) by Yves Hilpisch.

A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.

Pytorch implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

Official implementation of Rethinking Graph Neural Architecture Search from Message-passing (CVPR2021)

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

TorchPQ is a python library for Approximate Nearest Neighbor Search (ANNS) and Maximum Inner Product Search (MIPS) on GPU using Product Quantization (PQ) algorithm.

On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))

Vis2Mesh: Efficient Mesh Reconstruction from Unstructured Point Clouds of Large Scenes with Learned Virtual View Visibility ICCV2021

Study of human inductive biases in CNNs and Transformers.

3D Avatar Lip Syncronization from speech (JALI based face-rigging)

Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"

A PyTorch implementation of QANet.

Learning to Disambiguate Strongly Interacting Hands via Probabilistic Per-Pixel Part Segmentation [3DV 2021 Oral]

High-Resolution Image Synthesis with Latent Diffusion Models

Package for extracting emotions from social media text. Tailored for financial data.