A project studying the influence of communication in multi-objective normal-form games

Last update: Dec 17, 2021

Related tags

Overview

Communication in Multi-Objective Normal-Form Games

This repo consists of five different types of agents that we have used in our study of communication in multi-objective normal-form games. The settings that involve communication do this following a leader-follower model as seen in Stackelberg games. In such settings, agents switch in a round-robin fashion between being the leader and communicating something and being the follower and observing the communication.

No communication setting

In this setting two agents play a normal-form game for a certain amount of episodes. This experiment serves as a baseline for all other experiments.

Cooperative action communication setting

In this setting, agents communicate the next action that they will play. The follower uses this message to pre-update their policy. This setting is similar to Iterated Best Response and attempts to find the optimal joint policy.

Competitive action communication setting

This setting places the agents in a more competitive environment. This means that agents learn a specific best-response policy to every possible message. As such, agent's are not optimising for an optimal joint policy, but rather are acting in a self-interested manner.

Cooperative policy communication setting

This setting follows the same dynamics as the cooperative action communication setting, but communicates the entire policy instead of the next action that will be played.

Optional communication setting

The last setting gives agents the chance to learn for themselves whether communication helps them. All agents learn a top-level policy that chooses whether they will communicate when they are the leader or not. They also have two low-level agents, one "no communication agent" and one agent that does communicate. Which agent that is used as the communicating agent, is completely optional. When agents choose to communicate, they utilise their lower level communicating agent. When agents opt out of communication, they utilise their lower level no communication agent.

Getting Started

Experiments can be run from the MONFG.py file. There are 5 MONFGs available, having different equilibria properties under the SER optimisation criterion, using the specified non linear utility functions. You can also specify the type of experiment to run and other parameters.

License

This project is licensed under the GNU General Public License v3.0 - see the LICENSE file for details

A project studying the influence of communication in multi-objective normal-form games

Related tags

Overview

Communication in Multi-Objective Normal-Form Games

No communication setting

Cooperative action communication setting

Competitive action communication setting

Cooperative policy communication setting

Optional communication setting

Getting Started

License

Owner

Willem Röpke

Spatial Transformer Nets in TensorFlow/ TensorLayer

maximal update parametrization (µP)

Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

Video-based open-world segmentation

Generative Exploration and Exploitation - This is an improved version of GENE.

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)

PyTorch implementations of algorithms for density estimation

ConvMAE: Masked Convolution Meets Masked Autoencoders

imbalanced-DL: Deep Imbalanced Learning in Python

A very tiny, very simple, and very secure file encryption tool.

Official code for "Decoupling Zero-Shot Semantic Segmentation"

PyTorch implementation of: Michieli U. and Zanuttigh P., "Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations", CVPR 2021.

Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)

A Python Reconnection Tool for alt:V

A novel pipeline framework for multi-hop complex KGQA task. About the paper title: Improving Multi-hop Embedded Knowledge Graph Question Answering by Introducing Relational Chain Reasoning

Causal-Adversarial-Instruments - PyTorch Implementation for Developing Library of Investigating Adversarial Examples on A Causal View by Instruments

Project dự đoán giá cổ phiếu bằng thuật toán LSTM gồm: code train và code demo

Data reduction pipeline for KOALA on the AAT.

Sinkformers: Transformers with Doubly Stochastic Attention