Multi Task RL Baselines

Last update: Jan 09, 2023

Related tags

Deep Learning mtrl

Overview

MTRL

Multi Task RL Algorithms

Introduction
Setup
Usage
Documentation
Contributing to MTRL
Community
Acknowledgements

Introduction

MTRL is a library of multi-task reinforcement learning algorithms. It has two main components:

Building blocks and agents that implement the multi-task RL algorithms.
Experiment setups that enable training/evaluation on different setups.

Together, these two components enable use of MTRL across different environments and setups.

List of publications & submissions using MTRL (please create a pull request to add the missing entries):

Learning Robust State Abstractions for Hidden-Parameter Block MDPs

License

Citing MTRL

If you use MTRL in your research, please use the following BibTeX entry:

@Misc{Sodhani2021MTRL,
  author =       {Shagun Sodhani and Amy Zhang},
  title =        {MTRL - Multi Task RL Algorithms},
  howpublished = {Github},
  year =         {2021},
  url =          {https://github.com/facebookresearch/mtrl}
}

Setup

Clone the repository: git clone [email protected]:facebookresearch/mtrl.git.
Install dependencies: pip install -r requirements/dev.txt

Usage

MTRL supports 8 different multi-task RL algorithms as described here.
MTRL supports multi-task environments using MTEnv. These environments include MetaWorld and multi-task variants of DMControl Suite
Refer the tutorial to get started with MTRL.

Documentation

https://mtrl.readthedocs.io

Contributing to MTRL

There are several ways to contribute to MTRL.

Use MTRL in your research.
Contribute a new algorithm. We currently support 8 multi-task RL algorithms and are looking forward to adding more environments.
Check out the good-first-issues on GitHub and contribute to fixing those issues.
Check out additional details here.

Community

Ask questions in the chat or github issues:

Chat
Issues

Acknowledgements

Our implementation of SAC is inspired by Denis Yarats' implementation of SAC.
Project file pre-commit, mypy config, towncrier config, circleci etc are based on same files from Hydra.

Multi Task RL Baselines

Related tags

Overview

MTRL

Contents

Introduction

List of publications & submissions using MTRL (please create a pull request to add the missing entries):

License

Citing MTRL

Setup

Usage

Documentation

Contributing to MTRL

Community

Acknowledgements

Owner

Facebook Research

Styled text-to-drawing synthesis method. Featured at the 2021 NeurIPS Workshop on Machine Learning for Creativity and Design

An Unbiased Learning To Rank Algorithms (ULTRA) toolbox

git《Commonsense Knowledge Base Completion with Structural and Semantic Context》(AAAI 2020) GitHub: [fig1]

Implementation for Homogeneous Unbalanced Regularized Optimal Transport

This repository compare a selfie with images from identity documents and response if the selfie match.

Simulation-based performance analysis of server-less Blockchain-enabled Federated Learning

PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)

Human head pose estimation using Keras over TensorFlow.

A scikit-learn-compatible module for estimating prediction intervals.

HDR Video Reconstruction: A Coarse-to-fine Network and A Real-world Benchmark Dataset (ICCV 2021)

Self-Supervised Multi-Frame Monocular Scene Flow (CVPR 2021)

This is a vision-based 3d model manipulation and control UI

UV matrix decompostion using movielens dataset

Implementation of Self-supervised Graph-level Representation Learning with Local and Global Structure (ICML 2021).

Implementations of paper Controlling Directions Orthogonal to a Classifier

A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-based singing voice separation." 21th International Society for Music Information Retrieval Conference, ISMIR. 2020.

AdamW optimizer for bfloat16 models in pytorch.

A FAIR dataset of TCV experimental results for validating edge/divertor turbulence models.

Resco: A simple python package that report the effect of deep residual learning

Source code, data, and evaluation details for “Cross-Lingual Citations in English Papers: A Large-Scale Analysis of Prevalence, Formation, and Ramifications”