Reinforcement Learning Tricks, Index

This repository contains the code for the paper "Distilling Reinforcement Learning Tricks for Video Games".

Short story shorter: RL algorithms are neat and all, but to get it to work in video games (RL competitions and whatnot), there are some nifty little tricks involved that need bit of expertise in the domain. This includes reward shaping, curriculum learning, splitting task into subtasks by hand and guiding agent's actions. We took some of these tricks and tried them on three environments with DQN. With right setup you get more out of DQN.

Code authors: Anssi Kanervisto, Christian Scheller and Yanick Schraner.

The experiments in the three environments are split into three git branches:

vizdoom for ViZDoom Deathmatch experiments
minerl for MineRL ObtainDiamond experiments
gfootball for Football environment experiments

To run the experiments, checkout the repository you want to run experiments for with git checkout [branch name], and follow the instructions in the README file there.

After running all the experiments, collect the results as described the respective branches. You should have three directories

vizdoom-runs
minerl-runs
football-runs

After this, running python plot_paper.py should create a figures/learning_curves.pdf file which summarizes the results.

Evaluating different engineering tricks that make RL work

Related tags

Overview

Reinforcement Learning Tricks, Index

Owner

Anssi

Benchmark for Answering Existential First Order Queries with Single Free Variable

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

MatchGAN: A Self-supervised Semi-supervised Conditional Generative Adversarial Network

Additional functionality for use with fastai’s medical imaging module

Trainable PyTorch reproduction of AlphaFold 2

Real-time analysis of intracranial neurophysiology recordings.

A pytorch implementation of faster RCNN detection framework (Use detectron2, it's a masterpiece)

A Pytorch loader for MVTecAD dataset.

Image Segmentation using U-Net, U-Net with skip connections and M-Net architectures

Unofficial implementation of Perceiver IO: A General Architecture for Structured Inputs & Outputs

A unified 3D Transformer Pipeline for visual synthesis

FedJAX is a library for developing custom Federated Learning (FL) algorithms in JAX.

[ACM MM 2021] Joint Implicit Image Function for Guided Depth Super-Resolution

SwinTrack: A Simple and Strong Baseline for Transformer Tracking

🚀 An end-to-end ML applications using PyTorch, W&B, FastAPI, Docker, Streamlit and Heroku

A Kitti Road Segmentation model implemented in tensorflow.

This project uses Template Matching technique for object detecting by detection of template image over base image.

The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".

A deep learning tabular classification architecture inspired by TabTransformer with integrated gated multilayer perceptron.

A full pipeline AutoML tool for tabular data