evolvingrl

Supplementary Data for Evolving Reinforcement Learning Algorithms

This dataset contains 1000 loss graphs from two experiments: 500 unique graphs learned from scratch, and 500 unique graphs seeded by the DQN loss.

There are two csv files: from_scratch.csv and dqn_seeded.csv. They have two columns: id and reward. Each file is sorted by reward from highest to lowest. Graph with is visualized in a png file named .png. These graphs are under folders from_scratch_graphs/ and dqn_seeded_graphs/.

Notes on reading the graph:

Input nodes are in green, the output node is in blue.
The directed edges represent the data flow. A red edge represents the 2nd input for a binary operator, and all other edges are in black. Such coloring scheme is necesssary for encoding inputs for non-commutative operators like -, /, etc.
It’s common to have isolated input nodes and intermediate nodes that do not contribute to the final output. We can ignore these nodes.
As an example, Q(s_{t-1}, a_{t-1}) is represented by 5 nodes:
- Q_param → QValueListOp ← s_tm1. This gives Q(s_{t-1}, -).
- QValueListOp → SelectList ← a_{t-1}. This uses a_{t-1} to index into Q(s_{t-1}, -).

Supplementary Data for Evolving Reinforcement Learning Algorithms

Related tags

Overview

evolvingrl

Owner

John Co-Reyes

Implementation for Evolution of Strategies for Cooperation

A Python program to easily solve the n-queens problem using min-conflicts algorithm

A Python implementation of Jerome Friedman's Multivariate Adaptive Regression Splines

Infomap is a network clustering algorithm based on the Map equation.

Python Client for Algorithmia Algorithms and Data API

This repository explores an implementation of Grover's Algorithm for knights on a chessboard.

marching Squares algorithm in python with clean code.

This is the code repository for 40 Algorithms Every Programmer Should Know , published by Packt.

A* (with 2 heuristic functions), BFS , DFS and DFS iterativeA* (with 2 heuristic functions), BFS , DFS and DFS iterative

Leveraging Unique CPS Properties to Design Better Privacy-Enhancing Algorithms

A Python project for optimizing the 8 Queens Puzzle using the Genetic Algorithm implemented in PyGAD.

Algorithmic virtual trading using the neostox platform

Ralebel is an interpreted, Haitian Creole programming language that aims to help Haitians by starting with the fundamental algorithm

An implementation of ordered dithering algorithm in python as multimedia course project

Algoritmos de busca:

SortingAlgorithmVisualization - A place for me to learn about sorting algorithms

My own Unicode compression algorithm

This project is an implementation of a simple K-means algorithm

A calculator to test numbers against the collatz conjecture

This is a Python implementation of the HMRF algorithm on networks with categorial variables.