Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]

Last update: Jan 01, 2023

Related tags

Overview

Offline Meta-Reinforcement Learning with Advantage Weighting (MACAW)

MACAW code used for the experiments in the ICML 2021 paper.

Installing the environment

# Install Python 3.7.9 if necessary
$ pyenv install 3.7.9
$ pyenv shell 3.7.9

$ python --version
Python 3.7.9

$ python -m venv env
$ source env/bin/activate
$ pip install -r requirements.txt

Downloading the data

The offline data used for MACAW can be found here. Download it and use the default name (macaw_offline_data) for the folder where the four data directories are stored. gDrive might be useful here if downloading from the Google Drive GUI is not an option.

Running MACAW 🦜

Run offline meta-training with periodic online evaluations with any of the scripts in scripts/. e.g.

$ . scripts/macaw_dir.sh # MACAW training on Cheetah-Direction (Figure 1)
$ . scripts/macaw_vel.sh # MACAW training on Cheetah-Velocity (Figure 1)
$ . scripts/macaw_quality_ablation.sh # Data quality ablation (Figure 5-left)
...

Outputs (tensorboard logs) will be written to the log/ directory.

Reach out!

If you're having issues with the code or data, feel free to open an issue or send me an email.

Citation

If our code or research was useful for your own work, you can cite us with the following attribution:

@InProceedings{mitchell2021offline,
    title = {Offline Meta-Reinforcement Learning with Advantage Weighting},
    author = {Mitchell, Eric and Rafailov, Rafael and Peng, Xue Bin and Levine, Sergey and Finn, Chelsea},
    booktitle = {Proceedings of the 38th International Conference on Machine Learning},
    year = {2021}
}

Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]

Related tags

Overview

Offline Meta-Reinforcement Learning with Advantage Weighting (MACAW)

Installing the environment

Downloading the data

Running MACAW 🦜

Reach out!

Citation

Owner

Eric Mitchell

Official repository for Natural Image Matting via Guided Contextual Attention

Multi-view 3D reconstruction using neural rendering. Unofficial implementation of UNISURF, VolSDF, NeuS and more.

Python lib to talk to pylontech lithium batteries (US2000, US3000, ...) using RS485

The official implementation for "FQ-ViT: Fully Quantized Vision Transformer without Retraining".

WSDM‘2022: Knowledge Enhanced Sports Game Summarization

Spatiotemporal resampling methods for mlr3

Density-aware Single Image De-raining using a Multi-stream Dense Network (CVPR 2018)

A Pytorch implementation of CVPR 2021 paper "RSG: A Simple but Effective Module for Learning Imbalanced Datasets"

State of the Art Neural Networks for Deep Learning

Code and data for ACL2021 paper Cross-Lingual Abstractive Summarization with Limited Parallel Resources.

Neural Module Network for VQA in Pytorch

Unofficial implementation of "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" (https://arxiv.org/abs/2103.14030)

Segmentation vgg16 fcn - cityscapes

The pyrelational package offers a flexible workflow to enable active learning with as little change to the models and datasets as possible

A package to predict protein inter-residue geometries from sequence data

PyTorch implementation of Federated Learning with Non-IID Data, and federated learning algorithms, including FedAvg, FedProx.

PyTorch implementation of popular datasets and models in remote sensing

AdaNet is a lightweight TensorFlow-based framework for automatically learning high-quality models with minimal expert intervention

Code of the paper "Deep Human Dynamics Prior" in ACM MM 2021.

Pytorch implementation for RelTransformer