Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

Last update: Jan 06, 2023

Overview

RIIT

Our open-source code for RIIT: Rethinking the Importance of Implementation Tricks in Multi-AgentReinforcement Learning. We implement and standardize the hyperparameters of numerous QMIX variant algorithms that achieve SOTA.

Python MARL framework

PyMARL is WhiRL's framework for deep multi-agent reinforcement learning and includes implementations of the following algorithms:

Value-based Methods:

Actor Critic Methods:

PyMARL is written in PyTorch and uses SMAC as its environment.

Installation instructions

Install Python packages

# require Anaconda 3 or Miniconda 3
bash install_dependecies.sh

Set up StarCraft II and SMAC:

bash install_sc2.sh

This will download SC2 into the 3rdparty folder and copy the maps necessary to run over.

Run an experiment

# For SMAC
python3 src/main.py --config=qmix --env-config=sc2 with env_args.map_name=corridor

# For Cooperative Predator-Prey
python3 src/main.py --config=qmix_prey --env-config=stag_hunt with env_args.map_name=stag_hunt

The config files act as defaults for an algorithm or environment.

They are all located in src/config. --config refers to the config files in src/config/algs --env-config refers to the config files in src/config/envs

Run parallel experiments:

# bash run.sh config_name map_name_list (threads_num arg_list gpu_list experinments_num)
bash run.sh qmix corridor 2 epsilon_anneal_time=500000 0,1 5

xxx_list is separated by ,.

All results will be stored in the Results folder and named with map_name.

Force all trainning processes to exit

# all python and game processes of current user will quit.
bash clean.sh

Some test results on Super Hard scenarios

Cite

@article{hu2021riit,
      title={RIIT: Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning}, 
      author={Jian Hu and Siyang Jiang and Seth Austin Harding and Haibin Wu and Shih-wei Liao},
      year={2021},
      eprint={2102.03479},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

Related tags

Overview

RIIT

Python MARL framework

Installation instructions

Run an experiment

Run parallel experiments:

Force all trainning processes to exit

Some test results on Super Hard scenarios

Cite

Owner

FedJAX is a library for developing custom Federated Learning (FL) algorithms in JAX.

An implementation demo of the ICLR 2021 paper Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural Networks in PyTorch.

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.

Official implementation of the Neurips 2021 paper Searching Parameterized AP Loss for Object Detection.

This is a repository of our model for weakly-supervised video dense anticipation.

PyTorch and GPyTorch implementation of the paper "Conditioning Sparse Variational Gaussian Processes for Online Decision-making."

Implementation of Bagging and AdaBoost Algorithm

YOLOX Win10 Project

The code of "Dependency Learning for Legal Judgment Prediction with a Unified Text-to-Text Transformer".

Code and data for ACL2021 paper Cross-Lingual Abstractive Summarization with Limited Parallel Resources.

CLDF dataset derived from Robbeets et al.'s "Triangulation Supports Agricultural Spread" from 2021

Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations

Create UIs for prototyping your machine learning model in 3 minutes

We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Official repository for Natural Image Matting via Guided Contextual Attention

Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in ONNX

I explore rock vs. mine prediction using a SONAR dataset

Fashion Landmark Estimation with HRNet

Implementation of the paper "Generating Symbolic Reasoning Problems with Transformer GANs"