A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Last update: Dec 28, 2022

Overview

Multi-Agent-Deep-Deterministic-Policy-Gradients

A Pytorch implementation of the multi agent deep deterministic policy gradients(MADDPG) algorithm

This is my implementation of the algorithm presented in the paper: Multi Agent Actor Critic for Mixed Cooperative-Competitive Environments. You can find this paper here: https://arxiv.org/pdf/1706.02275.pdf

You will need to install the Multi Agent Particle Environment(MAPE), which you can find here: https://github.com/openai/multiagent-particle-envs

Make sure to create a virtual environment with the dependencies for the MAPE, since they are somewhat out of date. I also recommend running this with PyTorch version 1.4.0, as the latest version (1.8) seems to have an issue with an in place operation I use in the calculation of the critic loss.

It's probably easiest to just clone this repo into the same directory as the MAPE, as the main file requires the make_env function from that package.

The video for this tutorial is found here: https://youtu.be/tZTQ6S9PfkE

A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Related tags

Overview

Multi-Agent-Deep-Deterministic-Policy-Gradients

Owner

Phil Tabor

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Scribble-Supervised LiDAR Semantic Segmentation, CVPR 2022 (ORAL)

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

Pretrained Pytorch face detection (MTCNN) and recognition (InceptionResnet) models

Paddle-Skeleton-Based-Action-Recognition - DecoupleGCN-DropGraph, ASGCN, AGCN, STGCN

This is a model made out of Neural Network specifically a Convolutional Neural Network model

Unsupervised MRI Reconstruction via Zero-Shot Learned Adversarial Transformers

Code repository for Self-supervised Structure-sensitive Learning, CVPR'17

Fast Differentiable Matrix Sqrt Root

TJU Deep Learning & Neural Network

NeRD: Neural Reflectance Decomposition from Image Collections

TLDR: Twin Learning for Dimensionality Reduction

Repo for 2021 SDD assessment task 2, by Felix, Anna, and James.

Tutel MoE: An Optimized Mixture-of-Experts Implementation

A method to perform unsupervised cross-region adaptation of crop classifiers trained with satellite image time series.

Deep Q-learning for playing chrome dino game

Voila - Voilà turns Jupyter notebooks into standalone web applications

Differentiable architecture search for convolutional and recurrent networks

SOFT: Softmax-free Transformer with Linear Complexity, NeurIPS 2021 Spotlight

CenterNet:Objects as Points目标检测模型在Pytorch当中的实现