Proto-RL: Reinforcement Learning with Prototypical Representations

Last update: Dec 06, 2022

Overview

Proto-RL: Reinforcement Learning with Prototypical Representations

This is a PyTorch implementation of Proto-RL from

Reinforcement Learning with Prototypical Representations by

Denis Yarats, Rob Fergus, Alessandro Lazaric, Lerrel Pinto.

[Paper]

Citation

If you use this repo in your research, please consider citing the paper as follows

@article{yarats2021proto,
    title={Reinforcement Learning with Prototypical Representations},
    author={Denis Yarats and Rob Fergus and Alessandro Lazaric and Lerrel Pinto},
    year={2021},
    eprint={2102.11271},
    archivePrefix={arXiv},
    primaryClass={cs.ML}
}

Requirements

We assume you have access to a gpu that can run CUDA 11. Then, the simplest way to install all required dependencies is to create an anaconda environment by running

conda env create -f conda_env.yml

After the instalation ends you can activate your environment with

conda activate proto

Instructions

In order to pretrain the agent you need to specify the number of task-agnostic environment steps by setting num_expl_steps, after that many steps, the agent will start receving the downstream task reward until it takes num_train_steps in total. For example, to pre-train the Proto-RL agent on Cheetah Run task unsupervisely for 500k environment steps and then train it further with the downstream reward for another 500k steps, you can run:

python train.py env=cheetah_run num_expl_steps=250000 num_train_steps=500000

Note that we divede the number of steps by action repeat, which is set to 2 for all the environments.

This will produce the exp_local folder, where all the outputs are going to be stored including train/eval logs, tensorboard blobs, and evaluation episode videos. To launch tensorboard run

tensorboard --logdir exp_local

Proto-RL: Reinforcement Learning with Prototypical Representations

Related tags

Overview

Proto-RL: Reinforcement Learning with Prototypical Representations

Citation

Requirements

Instructions

Owner

Denis Yarats

FastyAPI is a Stack boilerplate optimised for heavy loads.

Code for the paper “The Peril of Popular Deep Learning Uncertainty Estimation Methods”

Code for paper [ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot] (ICCV 2021, oral))

A script that trains a model to recognize handwritten digits using the MNIST data set.

ReLoss - Official implementation for paper "Relational Surrogate Loss Learning" ICLR 2022

classification task on dataset-CIFAR10,by using Tensorflow/keras

Metadata-Extractor - Metadata Extractor Script can be used to read in exif metadata

Fully Convlutional Neural Networks for state-of-the-art time series classification

This repository is for DSA and CP scripts for reference.

GBK-GNN: Gated Bi-Kernel Graph Neural Networks for Modeling Both Homophily and Heterophily

QRec: A Python Framework for quick implementation of recommender systems (TensorFlow Based)

Computer Vision application in the web

Motion and Shape Capture from Sparse Markers

Empirical Study of Transformers for Source Code & A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source Code

Omnidirectional camera calibration in python

a pytorch implementation of auto-punctuation learned character by character

A hue shift helper for OBS

Public repository containing materials used for Feed Forward (FF) Neural Networks article.

Implementation of a Transformer using ReLA (Rectified Linear Attention)

Run Effective Large Batch Contrastive Learning on Limited Memory GPU