On Effective Scheduling of Model-based Reinforcement Learning

Last update: Oct 07, 2022

Related tags

Deep Learning autombpo

Overview

On Effective Scheduling of Model-based Reinforcement Learning

Code to reproduce the experiments in On Effective Scheduling of Model-based Reinforcement Learning.

Requirements

To install requirements:

pip install -r requirements.txt

Mujoco license is required to run the experiments on the Mujoco environments.

Training

To train the hyper-controller of the paper, run this command:

python train.py --env=

The env_name can be selected from [hopper,ant,humanoid,hopperbullet,walker2dbullet,halfcheetahbullet]. For example: python train.py --env=hopper

The trained hyper-controller will be saved in saved-models/. The computing infrastructure used in our experiments and the around computation time to train the hyper-controller is provided in Appendix G.

Evaluation

After training, to evaluate the trained hyper-controller, run:

python eval.py --config=config.
   
     --model_path=saved-models

The env_name can be selected from [hopper,ant,humanoid,hopperbullet,walker2dbullet,halfcheetahbullet]. For example: python eval.py --config=config.hopper --model_path=saved-models

Notice this command can only be run after finishing training the hyper-controller on the corresponding environments.

Pre-trained Models

We provided our pre-trained hyper-controller in pre-trained-models/ to better reproduce the experiments. To evaluate the pre-trained models, run:

python eval.py --config=config.
   
     --model_path=pre-trained-models

The env_name can be selected from [hopper,ant,humanoid,hopperbullet,walker2dbullet,halfcheetahbullet]. For example: python eval.py --config=config.hopper --model_path=pre-trained-models

On Effective Scheduling of Model-based Reinforcement Learning

Related tags

Overview

On Effective Scheduling of Model-based Reinforcement Learning

Requirements

Training

Evaluation

Pre-trained Models

Owner

laihang

CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing

Face Recognition Attendance Project

Code for paper "Vocabulary Learning via Optimal Transport for Neural Machine Translation"

Code for GNMR in ICDE 2021

Repository of best practices for deep learning in Julia, inspired by fastai

This is a pytorch implementation of the NeurIPS paper GAN Memory with No Forgetting.

A custom-designed Spider Robot trained to walk using Deep RL in a PyBullet Simulation

Self-supervised learning (SSL) is a method of machine learning

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

Simple-System-Convert--C--F - Simple System Convert With Python

A curated list of long-tailed recognition resources.

A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering.

Data and code from COVID-19 machine learning paper

PyTorch reimplementation of minimal-hand (CVPR2020)

A curated list and survey of awesome Vision Transformers.

Reinforcement learning for self-driving in a 3D simulation

An LSTM for time-series classification

A Learning-based Camera Calibration Toolbox

Anatomy of Matplotlib -- tutorial developed for the SciPy conference

The final project for "Applying AI to Wearable Device Data" course from "AI for Healthcare" - Udacity.