TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.

Last update: Dec 25, 2022

Related tags

Overview

TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL

TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods. We leverage Box2D procedurally generated environments to assess the performance of teacher algorithms in continuous task spaces. Our repository provides:

Two parametric Box2D environments: Stumps Tracks and Parkour
Multiple embodiments with different locomotion skills (e.g. bipedal walker, spider, climbing chimpanzee, fish)
Two Deep RL students: SAC and PPO
Several ACL algorithms: ADR, ALP-GMM, Covar-GMM, SPDL, GoalGAN, Setter-Solver, RIAC
Two benchmark experiments using elements above: Skill-specific comparison and global performance assessment
Three notebooks for systematic analysis of results using statistical tests along with visualization tools (plots, videos...) allowing to reproduce our figures

See our documentation for an exhaustive list.

Using this, we performed a benchmark of the previously mentioned ACL methods which can be seen in our paper. We also provide additional visualization on our website.

Installation

1- Get the repository

git clone https://github.com/flowersteam/TeachMyAgent
cd TeachMyAgent/

2- Install it, using Conda for example (use Python >= 3.6)

conda create --name teachMyAgent python=3.6
conda activate teachMyAgent
pip install -e .

Note: For Windows users, add -f https://download.pytorch.org/whl/torch_stable.html to the pip install -e . command.

Import baseline results from our paper

In order to benchmark methods against the ones we evaluated in our paper you must download our results:

Go to the notebooks folder
Make the download_baselines.sh script executable: chmod +x download_baselines.sh
Download results: ./download_baselines.sh

WARNING: This will download a zip weighting approximayely 4.5GB. Then, our script will extract the zip file in TeachMyAgent/data. Once extracted, results will weight approximately 15GB.

Usage

See our documentation for details on how to use our platform to benchmark ACL methods.

Development

See CONTRIBUTING.md for details.

Citing

If you use TeachMyAgent in your work, please cite the accompanying paper:

@inproceedings{romac2021teachmyagent,
  author    = {Cl{\'{e}}ment Romac and
               R{\'{e}}my Portelas and
               Katja Hofmann and
               Pierre{-}Yves Oudeyer},
  title     = {TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep
               {RL}},
  booktitle = {Proceedings of the 38th International Conference on Machine Learning,
               {ICML} 2021, 18-24 July 2021, Virtual Event},
  series    = {Proceedings of Machine Learning Research},
  volume    = {139},
  pages     = {9052--9063},
  publisher = {{PMLR}},
  year      = {2021}
}

TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.

Related tags

Overview

TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL

Installation

Import baseline results from our paper

Usage

Development

Citing

Owner

Flowers Team

Only works with the dashboard version / branch of jesse

VISNOTATE: An Opensource tool for Gaze-based Annotation of WSI Data

COPA-SSE contains crowdsourced explanations for the Balanced COPA dataset

Christmas face app for Decathlon xmas coding party!

InterfaceGAN++: Exploring the limits of InterfaceGAN

Vit-ImageClassification - Pytorch ViT for Image classification on the CIFAR10 dataset

PyTorch implementation of Convolutional Neural Fabrics http://arxiv.org/abs/1606.02492

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

Matplotlib Image labeller for classifying images

Adversarial Robustness Comparison of Vision Transformer and MLP-Mixer to CNNs

Learning-based agent for Google Research Football

Efficient Training of Visual Transformers with Small Datasets

Example for AUAV 2022 with obstacle avoidance.

A Python library for unevenly-spaced time series analysis

A simple interface for editing natural photos with generative neural networks.

Fuzzification helps developers protect the released, binary-only software from attackers who are capable of applying state-of-the-art fuzzing techniques

Pytorch implementation of Masked Auto-Encoder

Pytorch code for semantic segmentation using ERFNet

Implementation of U-Net and SegNet for building segmentation

Composable transformations of Python+NumPy programsComposable transformations of Python+NumPy programs