Multi Agent Reinforcement Learning for ROS in 2D Simulation Environments

Last update: Oct 29, 2022

Related tags

Overview

IROS21 information

To test the code and reproduce the experiments, follow the installation steps in Installation.md. Afterwards, follow the steps in Evaluations.md.

To test the different Waypoint Generators, follow the steps in waypoint_eval.md

DRL agents are located in the agents folder.

Arena-MARL

A flexible, high-performance 2D simulator with configurable agents, multiple sensors, and benchmark scenarios for testing robotic navigation in multi-agent settings.

Arena-MARL uses Flatland as the core simulator and is a modular high-level library for end-to-end experiments in embodied AI -- defining embodied AI tasks (e.g. navigation, obstacle avoidance, behavior cloning), training agents (via imitation or reinforcement learning, or no learning at all using conventional approaches like DWA, TEB or MPC), and benchmarking their performance on the defined tasks using standard metrics.


Before Training	After Training

What is this repository for?

Train DRL agents on ROS compatible simulations for autonomous navigation in highly dynamic environments. Flatland-DRL integration is inspired by Ronja Gueldenring's work: drl_local_planner_ros_stable_baselines. Test state of the art local and global planners in ROS environments both in simulation and on real hardware. Following features are included:

Setup to train a local planner with reinforcement learning approaches from stable baselines3
Training in simulator Flatland in train mode
Include realistic behavior patterns and semantic states of obstacles (speaking, running, etc.)
Include different obstacles classes (other robots, vehicles, types of persons, etc.)
Implementation of intermediate planner classes to combine local DRL planner with global map-based planning of ROS Navigation stack
Testing a variety of planners (learning based and model based) within specific scenarios in test mode
Modular structure for extension of new functionalities and approaches

Start Guide

We recommend starting with the start guide which contains all information you need to know to start off with this project including installation on Linux and Windows as well as tutorials to start with.

For Mac, please refer to our Docker.

1. Installation

Please refer to Installation.md for detailed explanations about the installation process.

1.1. Docker

We provide a Docker file to run our code on other operating systems. Please refer to Docker.md for more information.

2. Usage

DRL Training

Please refer to DRL-Training.md for detailed explanations about agent, policy and training setups.

Scenario Creation with the arena-scenario-gui

To create complex, collaborative scenarios for training and/or evaluation purposes, please refer to the repo arena-scenario-gui. This application provides you with an user interface to easily create complex scenarios with multiple dynamic and static obstacles by drawing and other simple UI elements like dragging and dropping. This will save you a lot of time in creating complex scenarios for you individual use cases.

Used third party repos:

Flatland: http://flatland-simulator.readthedocs.io
ROS navigation stack: http://wiki.ros.org/navigation
Pedsim: https://github.com/srl-freiburg/pedsim_ros
Pettingzoo: https://github.com/Farama-Foundation/PettingZoo
Supersuit: https://github.com/Farama-Foundation/SuperSuit

Multi Agent Reinforcement Learning for ROS in 2D Simulation Environments

Related tags

Overview

IROS21 information

Arena-MARL

What is this repository for?

Start Guide

1. Installation

1.1. Docker

2. Usage

DRL Training

Scenario Creation with the arena-scenario-gui

Used third party repos:

Owner

CL-Gym: Full-Featured PyTorch Library for Continual Learning

Adabelief-Optimizer - Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"

Exposure Time Calculator (ETC) and radial velocity precision estimator for the Near InfraRed Planet Searcher (NIRPS) spectrograph

[CVPR 2021] MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition

Tensorflow implementation and notebooks for Implicit Maximum Likelihood Estimation

Code to reproduce the results for Compositional Attention

Generate pixel-style avatars with python.

Code for paper: "Spinning Language Models for Propaganda-As-A-Service"

Curating a dataset for bioimage transfer learning

Code for the ECCV2020 paper "A Differentiable Recurrent Surface for Asynchronous Event-Based Data"

Code for Dual Contrastive Learning for Unsupervised Image-to-Image Translation, NTIRE, CVPRW 2021.

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

Auto Seg-Loss: Searching Metric Surrogates for Semantic Segmentation

Some experiments with tennis player aging curves using Hilbert space GPs in PyMC. Only experimental for now.

A pytorch implementation of faster RCNN detection framework (Use detectron2, it's a masterpiece)

Traffic4D: Single View Reconstruction of Repetitious Activity Using Longitudinal Self-Supervision

Benchmark VAE - Library for Variational Autoencoder benchmarking

Lyapunov-guided Deep Reinforcement Learning for Stable Online Computation Offloading in Mobile-Edge Computing Networks

Code for the AI lab course 2021/2022 of the University of Verona

U-Time: A Fully Convolutional Network for Time Series Segmentation