Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop

Last update: Dec 14, 2021

Overview

Guiding Evolutionary Strategies by Differentiable Robot Simulators

In recent years, Evolutionary Strategies were actively explored in robotic tasks for policy search as they provide a simpler alternative to reinforcement learning algorithms. However, this class of algorithms is often claimed to be extremely sample-inefficient. On the other hand, there is a growing interest in Differentiable Robot Simulators (DRS) as they potentially can find successful policies with only a handful of trajectories. But the resulting gradient is not always useful for the first-order optimization. In this work, we demonstrate how DRS gradient can be used in conjunction with Evolutionary Strategies. Preliminary results suggest that this combination can reduce sample complexity of Evolutionary Strategies by 3x-5x times in both simulation and the real world.

To appear in 4th Robot Learning Workshop: Self-Supervised and Lifelong Learning

Paper -- Video -- Poster

Citation

Please use the following Bibtex entry:

@misc{kurenkov2021guiding,
      title={Guiding Evolutionary Strategies by Differentiable Robot Simulators}, 
      author={Vladislav Kurenkov and Bulat Maksudov},
      year={2021},
      eprint={2110.00438},
      archivePrefix={arXiv},
      primaryClass={cs.RO}
}

Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop

Related tags

Overview

Guiding Evolutionary Strategies by Differentiable Robot Simulators

Citation

Owner

Vladislav Kurenkov

Time series annotation library.

PyTorch implementation for our paper "Deep Facial Synthesis: A New Challenge"

This GitHub repository contains code used for plots in NeurIPS 2021 paper 'Stochastic Multi-Armed Bandits with Control Variates.'

Using python and scikit-learn to make stock predictions

3D2Unet: 3D Deformable Unet for Low-Light Video Enhancement (PRCV2021)

Image Data Augmentation in Keras

⚖️🔁🔮🕵️‍♂️🦹🖼️ Code for Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances paper.

Implementation for paper "STAR: A Structure-aware Lightweight Transformer for Real-time Image Enhancement" (ICCV 2021).

PyTorch ,ONNX and TensorRT implementation of YOLOv4

Example for AUAV 2022 with obstacle avoidance.

Fast and accurate optimisation for registration with little learningconvexadam

Speech recognition tool to convert audio to text transcripts, for Linux and Raspberry Pi.

labelpix is a graphical image labeling interface for drawing bounding boxes

Bio-Computing Platform Featuring Large-Scale Representation Learning and Multi-Task Deep Learning “螺旋桨”生物计算工具集

Pytorch implementation of Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization https://arxiv.org/abs/2008.11646

Yolox-bytetrack-sample - Python sample of MOT (Multiple Object Tracking) using YOLOX and ByteTrack

Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021

Create time-series datacubes for supervised machine learning with ICEYE SAR images.

🔥 Real-time Super Resolution enhancement (4x) with content loss and relativistic adversarial optimization 🔥

Atomistic Line Graph Neural Network

Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop

Related tags

Overview

Guiding Evolutionary Strategies by Differentiable Robot Simulators

Citation

Owner

Vladislav Kurenkov

Time series annotation library.

PyTorch implementation for our paper "Deep Facial Synthesis: A New Challenge"

This GitHub repository contains code used for plots in NeurIPS 2021 paper 'Stochastic Multi-Armed Bandits with Control Variates.'

Using python and scikit-learn to make stock predictions

3D2Unet: 3D Deformable Unet for Low-Light Video Enhancement (PRCV2021)

Image Data Augmentation in Keras

⚖️🔁🔮🕵️‍♂️🦹🖼️ Code for *Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances* paper.

Implementation for paper "STAR: A Structure-aware Lightweight Transformer for Real-time Image Enhancement" (ICCV 2021).

PyTorch ,ONNX and TensorRT implementation of YOLOv4

Example for AUAV 2022 with obstacle avoidance.

Fast and accurate optimisation for registration with little learningconvexadam

Speech recognition tool to convert audio to text transcripts, for Linux and Raspberry Pi.

labelpix is a graphical image labeling interface for drawing bounding boxes

Bio-Computing Platform Featuring Large-Scale Representation Learning and Multi-Task Deep Learning “螺旋桨”生物计算工具集

Pytorch implementation of Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization https://arxiv.org/abs/2008.11646

Yolox-bytetrack-sample - Python sample of MOT (Multiple Object Tracking) using YOLOX and ByteTrack

Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021

Create time-series datacubes for supervised machine learning with ICEYE SAR images.

🔥 Real-time Super Resolution enhancement (4x) with content loss and relativistic adversarial optimization 🔥

Atomistic Line Graph Neural Network

⚖️🔁🔮🕵️‍♂️🦹🖼️ Code for Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances paper.