Adaptable tools to make reinforcement learning and evolutionary computation algorithms.

Last update: Jan 01, 2023

Overview

Pearl

The Parallel Evolutionary and Reinforcement Learning Library (Pearl) is a pytorch based package with the goal of being excellent for rapid prototyping of new adaptive decision making algorithms in the intersection between reinforcement learning (RL) and evolutionary computation (EC). As such, this is not intended to provide template pre-built algorithms as a baseline, but rather flexible tools to allow the user to quickly build and test their own implementations and ideas. A technical report can be found here.

Main Features

Features	Pearl
RL algorithms (e.g. Actor Critic)	✔️
EC algorithms (e.g. Genetic Algorithm)	✔️
Hybrid algorithms (e.g. CEM-DDPG)	✔️
Multi-agent suppport	✔️
Tensorboard integration	✔️
Modular and extensible components	✔️
Opinionated module settings	✔️
Custom callbacks	✔️

User Guide

Installation

There are two options to install this package:

pip install pearll
git clone [email protected]:LondonNode/Pearl.git

Module Guide

agents: implementations of RL and EC agents where the other modular components are put together
buffers: these handle storing and sampling of trajectories
callbacks: inject logic for every step made in an environment (e.g. save model, early stopping)
common: common methods applicable to all other modules (e.g. enumerations) and a main utils.py file with some useful general logic
explorers: action explorers for enhanced exploration by adding noise to actions and random exploration for first n steps
models: neural network structures which are structured as encoder -> torso -> head
signal_processing: signal processing logic for extra modularity (e.g. TD returns, GAE)
updaters: update neural networks and adaptive/iterative algorithms
settings.py: settings objects for the above components, can be extended for custom components

Agent Templates

See pearll/agents/templates.py for the templates to create your own agents! For more examples, see specific agent implementations under pearll/agents.

Agent Performance

To see training performance, use the command tensorboard --logdir runs or tensorboard --logdir <tensorboard_log_path> defined in your algorithm class initialization.

Python Scripts

To run these you'll need to go to wherever the library is installed, cd pearll.

demo.py: script to run very basic demos of agents with pre-defined hyperparameters, run python3 -m pearll.demo -h for more info
plot.py: script to plot more complex plots that can't be obtained via Tensorboard (e.g. multiple subplots), run python3 -m pearll.plot -h for more info

Developer Guide

Scripts

Linux

scripts/setup_dev.sh: setup your virtual environment
scripts/run_tests.sh: run tests

Windows

scripts/windows_setup_dev.bat: setup your virtual environment
scripts/windows_run_tests.bat: run tests

Dependency Management

Pearl uses poetry for dependency management and build release instead of pip. As a quick guide:

Run poetry add [package] to add more package dependencies.
Poetry automatically handles the virtual environment used, check pyproject.toml for specifics on the virtual environment setup.
If you want to run something in the poetry virtual environment, add poetry run as a prefix to the command you want to execute. For example, to run a python file: poetry run python3 script.py.

Credit

Citing Pearl

@misc{tangri2022pearl,
      title={Pearl: Parallel Evolutionary and Reinforcement Learning Library}, 
      author={Rohan Tangri and Danilo P. Mandic and Anthony G. Constantinides},
      year={2022},
      eprint={2201.09568},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Acknowledgements

Pearl was inspired by Stable Baselines 3 and Tonic

BESS: Balanced Evolutionary Semi-Stacking for Disease Detection via Partially Labeled Imbalanced Tongue Data

Balanced-Evolutionary-Semi-Stacking Code for the paper ''BESS: Balanced Evolutionary Semi-Stacking for Disease Detection via Partially Labeled Imbalan

0 Jan 16, 2022

Systemic Evolutionary Chemical Space Exploration for Drug Discovery

SECSE SECSE: Systemic Evolutionary Chemical Space Explorer Chemical space exploration is a major task of the hit-finding process during the pursuit of

64 Dec 16, 2022

Deep learning with dynamic computation graphs in TensorFlow

TensorFlow Fold TensorFlow Fold is a library for creating TensorFlow models that consume structured data, where the structure of the computation graph

1.8k Dec 28, 2022

A toolkit for developing and comparing reinforcement learning algorithms.

Status: Maintenance (expect bug fixes and minor updates) OpenAI Gym OpenAI Gym is a toolkit for developing and comparing reinforcement learning algori

29.6k Jan 8, 2023

PyTorch implementations of deep reinforcement learning algorithms and environments

Deep Reinforcement Learning Algorithms with PyTorch This repository contains PyTorch implementations of deep reinforcement learning algorithms and env

4.7k Jan 4, 2023

Pytorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Off-Policy Multi-Agent Reinforcement Learning (MARL) Algorithms This repository contains implementations of various off-policy multi-agent reinforceme

183 Dec 28, 2022

Reinforcement learning framework and algorithms implemented in PyTorch.

2.1k Jan 4, 2023

Independent and minimal implementations of some reinforcement learning algorithms using PyTorch (including PPO, A3C, A2C, ...).

PyTorch RL Minimal Implementations There are implementations of some reinforcement learning algorithms, whose characteristics are as follow: Less pack

4 Dec 31, 2022

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

4.7k Jan 1, 2023

Comments

Bump pillow from 9.0.0 to 9.0.1
Bumps pillow from 9.0.0 to 9.0.1.

Release notes

Sourced from pillow's releases.

9.0.1

https://pillow.readthedocs.io/en/stable/releasenotes/9.0.1.html

Changes

In show_file, use os.remove to remove temporary images. CVE-2022-24303 #6010 [@radarhere, @hugovk]

Restrict builtins within lambdas for ImageMath.eval. CVE-2022-22817 #6009 [radarhere]

Changelog

Sourced from pillow's changelog.

9.0.1 (2022-02-03)

In show_file, use os.remove to remove temporary images. CVE-2022-24303 #6010 [radarhere, hugovk]

Restrict builtins within lambdas for ImageMath.eval. CVE-2022-22817 #6009 [radarhere]

Commits

6deac9e 9.0.1 version bump

c04d812 Update CHANGES.rst [ci skip]

4fabec3 Added release notes for 9.0.1

02affaa Added delay after opening image with xdg-open

ca0b585 Updated formatting

427221e In show_file, use os.remove to remove temporary images

c930be0 Restrict builtins within lambdas for ImageMath.eval

75b69dd Dont need to pin for GHA

cd938a7 Autolink CWE numbers with sphinx-issues

2e9c461 Add CVE IDs

See full diff in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

@dependabot use these labels will set the current labels as the default for future PRs for this repo and language

@dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language

@dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language

@dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

You can disable automated security fix PRs for this repo from the Security Alerts page.

dependencies
opened by dependabot[bot] 1
Feature/hybrid

Overhaul models and base agent structure to accommodate RL, MARL, EC in optimizing static functions and RL environments and hybrid algorithms combining RL and EC.

opened by 09tangriro 1
MORE AGENTS

The more agents created the better proof that the tools underlying work as intended.

Agents should be tested on particular environments to ensure performance.
feature good first issue

opened by 09tangriro 0

Releases(v0.4.1)

v0.4.1(May 9, 2022)

Bug fixes and optimizations.

See PR #11
Source code(tar.gz)
Source code(zip)
v0.4.0(May 8, 2022)

Optimizations interfacing with GPU devices. See PR #10
Source code(tar.gz)
Source code(zip)
v0.3.1(Apr 5, 2022)
Bug fixes:

allow different size discrete space output for DiscreteHead.

Update docstrings for pearll/updaters/environment module.

Source code(tar.gz)
Source code(zip)
v0.3.0(Mar 28, 2022)
Introduce model-based RL tools.

Validate model-based RL tools with implementation of DynaQ algorithm.

Cleaner signal_processing module interface using functools.

Source code(tar.gz)
Source code(zip)
v0.2.2(Mar 4, 2022)

Fixed issue running multi-agent algorithms on cuda devices. Now full support for cuda.
Source code(tar.gz)
Source code(zip)
v0.2.1(Mar 2, 2022)
Various bug fixes:

to_numpy cuda support.

FlattenEncoder flattens inputs appropriately.

Callbacks more robust.

Also added a tutorial library.
Source code(tar.gz)
Source code(zip)
v0.2.0(Jan 25, 2022)

Various bug fixes and tweaks to the interface.
Source code(tar.gz)
Source code(zip)
v0.1.0(Jan 11, 2022)

Pre-release before paper submission.
Source code(tar.gz)
Source code(zip)

Owner

GitHub Repository

Numba-accelerated Pythonic implementation of MPDATA with examples in Python, Julia and Matlab

PyMPDATA PyMPDATA is a high-performance Numba-accelerated Pythonic implementation of the MPDATA algorithm of Smolarkiewicz et al. used in geophysical

15 Nov 23, 2022

Music Generation using Neural Networks Streamlit App

Music_Gen_Streamlit "Music Generation using Neural Networks" Streamlit App TO DO: Make a run_app.sh Introduction [~5 min] (Sohaib) Team Member names/i

6 Aug 09, 2022

Code for Iso-Points: Optimizing Neural Implicit Surfaces with Hybrid Representations

Implementation for Iso-Points (CVPR 2021) Official code for paper Iso-Points: Optimizing Neural Implicit Surfaces with Hybrid Representations paper |

66 Nov 08, 2022

minimizer-space de Bruijn graphs (mdBG) for whole genome assembly

rust-mdbg: Minimizer-space de Bruijn graphs (mdBG) for whole-genome assembly rust-mdbg is an ultra-fast minimizer-space de Bruijn graph (mdBG) impleme

148 Dec 01, 2022

Toolchain to build Yoshi's Island from source code

Project-Y Toolchain to build Yoshi's Island (J) V1.0 from source code, by MrL314 Last updated: September 17, 2021 Setup To begin, download this toolch

19 Apr 18, 2022

Generalized Data Weighting via Class-level Gradient Manipulation

Generalized Data Weighting via Class-level Gradient Manipulation This repository is the official implementation of Generalized Data Weighting via Clas

18 Nov 12, 2022

[ICCV2021] Official Pytorch implementation for SDGZSL (Semantics Disentangling for Generalized Zero-Shot Learning)

Semantics Disentangling for Generalized Zero-shot Learning This is the official implementation for paper Zhi Chen, Yadan Luo, Ruihong Qiu, Zi Huang, J

25 Dec 06, 2022

Simple tools for logging and visualizing, loading and training

TNT TNT is a library providing powerful dataloading, logging and visualization utilities for Python. It is closely integrated with PyTorch and is desi

1.5k Jan 02, 2023

BMN: Boundary-Matching Network

BMN: Boundary-Matching Network A pytorch-version implementation codes of paper: "BMN: Boundary-Matching Network for Temporal Action Proposal Generatio

260 Dec 06, 2022

Rule-based Customer Segmentation

Rule-based Customer Segmentation Business Problem A game company wants to create level-based new customer definitions (personas) by using some feature

2 Jan 03, 2022

This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.

Multimodal Deep Learning 🎆 🎆 🎆 Announcing the multimodal deep learning repository that contains implementation of various deep learning-based model

398 Dec 30, 2022

Adaptable tools to make reinforcement learning and evolutionary computation algorithms.

Related tags

Overview

Pearl

Main Features

User Guide

Installation

Module Guide

Agent Templates

Agent Performance

Python Scripts

Developer Guide

Scripts

Dependency Management

Credit

Citing Pearl

Acknowledgements

You might also like...

BESS: Balanced Evolutionary Semi-Stacking for Disease Detection via Partially Labeled Imbalanced Tongue Data

Systemic Evolutionary Chemical Space Exploration for Drug Discovery

Deep learning with dynamic computation graphs in TensorFlow

A toolkit for developing and comparing reinforcement learning algorithms.

PyTorch implementations of deep reinforcement learning algorithms and environments

Pytorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Reinforcement learning framework and algorithms implemented in PyTorch.

Independent and minimal implementations of some reinforcement learning algorithms using PyTorch (including PPO, A3C, A2C, ...).

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Comments

Bump pillow from 9.0.0 to 9.0.1

9.0.1

Changes

9.0.1 (2022-02-03)

Feature/hybrid

MORE AGENTS

Releases(v0.4.1)

v0.4.1(May 9, 2022)

v0.4.0(May 8, 2022)

v0.3.1(Apr 5, 2022)

v0.3.0(Mar 28, 2022)

v0.2.2(Mar 4, 2022)

v0.2.1(Mar 2, 2022)

v0.2.0(Jan 25, 2022)

v0.1.0(Jan 11, 2022)

Owner

Numba-accelerated Pythonic implementation of MPDATA with examples in Python, Julia and Matlab

Music Generation using Neural Networks Streamlit App

Code for Iso-Points: Optimizing Neural Implicit Surfaces with Hybrid Representations

minimizer-space de Bruijn graphs (mdBG) for whole genome assembly

Toolchain to build Yoshi's Island from source code

Generalized Data Weighting via Class-level Gradient Manipulation

[ICCV2021] Official Pytorch implementation for SDGZSL (Semantics Disentangling for Generalized Zero-Shot Learning)

Simple tools for logging and visualizing, loading and training

BMN: Boundary-Matching Network

Rule-based Customer Segmentation

This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.

Official implementation of our paper "LLA: Loss-aware Label Assignment for Dense Pedestrian Detection" in Pytorch.

A pytorch-based real-time segmentation model for autonomous driving

PyAF is an Open Source Python library for Automatic Time Series Forecasting built on top of popular pydata modules.

Feature extraction made simple with torchextractor

High-performance moving least squares material point method (MLS-MPM) solver.

Deep Learning pipeline for motor-imagery classification.

Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer"

An Efficient Training Approach for Very Large Scale Face Recognition or F²C for simplicity.

Image Deblurring using Generative Adversarial Networks