An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions

Last update: Jun 09, 2022

Overview

Agar.io_Q-Learning_AI

An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions.

An image of the circle categorisation function in action. Food blobs are outlined in blue, edible cells in green and dangerous cells in red according to where our program detects them. Screen edges mess that up a bit. The agents action at this moment is labelled with the green arrow.

States are calculated using the shortest euclidian distance to each of the three circle types: food, edible cells and dangerous cells. These distances are measured and discretized according to which interval they fall within. The rulers in this image are to scale.

Currently the agent can't press any keyboard buttons, only move around using the mouse. It could be added without too much hassle, but it would require a rework of some aspects of the code and a ton training, which already takes ages. The q-learning part could also do with a proper implementation of stochastic q-learning instead of our generic iterative q-learning, if I knew how to do it. I look forward to learning that at a later point.

Feel free to ask any questions about the code or the project. I hope you enjoy!

The humans in the experiment were subject to the same move set as the bots and agents, so only mouse movement.

An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions

Related tags

Overview

Agar.io_Q-Learning_AI

Owner

Fusion-DHL: WiFi, IMU, and Floorplan Fusion for Dense History of Locations in Indoor Environments

Pytorch implementation for "Density-aware Chamfer Distance as a Comprehensive Metric for Point Cloud Completion" (NeurIPS 2021)

Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

Pytorch implementation of FlowNet by Dosovitskiy et al.

Graph Robustness Benchmark: A scalable, unified, modular, and reproducible benchmark for evaluating the adversarial robustness of Graph Machine Learning.

3ds-Ghidra-Scripts - Ghidra scripts to help with 3ds reverse engineering

Perception-aware multi-sensor fusion for 3D LiDAR semantic segmentation (ICCV 2021)

A short and easy PyTorch implementation of E(n) Equivariant Graph Neural Networks

Voice Conversion Using Speech-to-Speech Neuro-Style Transfer

Code for the paper SphereRPN: Learning Spheres for High-Quality Region Proposals on 3D Point Clouds Object Detection, ICIP 2021.

Towards Long-Form Video Understanding

PyTorch implementation of neural style transfer algorithm

The repository forked from NVlabs uses our data. (Differentiable rasterization applied to 3D model simplification tasks)

Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code

CVPR 2022 "Online Convolutional Re-parameterization"

NeurIPS-2021: Neural Auto-Curricula in Two-Player Zero-Sum Games.

Implementation of ConvMixer for "Patches Are All You Need? 🤷"

Github Traffic Insights as Prometheus metrics.

RRL: Resnet as representation for Reinforcement Learning

Pytorch implementation of face attention network