NovelD: A Simple yet Effective Exploration Criterion

Intro

This is an implementation of the method proposed in

NovelD: A Simple yet Effective Exploration Criterion and BeBold: Exploration Beyond the Boundary of Explored Regions

Citation

If you use this code in your own work, please cite our paper:

@article{zhang2021noveld,
  title={NovelD: A Simple yet Effective Exploration Criterion},
  author={Zhang, Tianjun and Xu, Huazhe and Wang, Xiaolong and Wu, Yi and Keutzer, Kurt and Gonzalez, Joseph E and Tian, Yuandong},
  journal={Advances in Neural Information Processing Systems},
  volume={34},
  year={2021}
}

@article{zhang2020bebold,
  title={BeBold: Exploration Beyond the Boundary of Explored Regions},
  author={Zhang, Tianjun and Xu, Huazhe and Wang, Xiaolong and Wu, Yi and Keutzer, Kurt and Gonzalez, Joseph E and Tian, Yuandong},
  journal={arXiv preprint arXiv:2012.08621},
  year={2020}
}

Installation

# Install Instructions
conda create -n ride python=3.7
conda activate noveld 
git clone [email protected]:tianjunz/NovelD.git
cd NovelD
pip install -r requirements.txt

Train NovelD on MiniGrid

OMP_NUM_THREADS=1 python main.py --model bebold --env MiniGrid-ObstructedMaze-2Dlhb-v0 --total_frames 500000000 --intrinsic_reward_coef 0.05 --entropy_cost 0.0005

Acknowledgements

Our vanilla RL algorithm is based on RIDE.

License

This code is under the CC-BY-NC 4.0 (Attribution-NonCommercial 4.0 International) license.

NovelD: A Simple yet Effective Exploration Criterion

Related tags

Overview

NovelD: A Simple yet Effective Exploration Criterion

Intro

Citation

Installation

Train NovelD on MiniGrid

Acknowledgements

License

Owner

The official pytorch implemention of the CVPR paper "Temporal Modulation Network for Controllable Space-Time Video Super-Resolution".

Data manipulation and transformation for audio signal processing, powered by PyTorch

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

Awesome Long-Tailed Learning

A collection of resources, problems, explanations and concepts that are/were important during my Data Science journey

Capstone-Project-2 - A game program written in the Python language

[ICLR 2022] Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics

Repository for RNNs using TensorFlow and Keras - LSTM and GRU Implementation from Scratch - Simple Classification and Regression Problem using RNNs

AI-UPV at IberLEF-2021 DETOXIS task: Toxicity Detection in Immigration-Related Web News Comments Using Transformers and Statistical Models

Barbershop: GAN-based Image Compositing using Segmentation Masks (SIGGRAPH Asia 2021)

A library for preparing, training, and evaluating scalable deep learning hybrid recommender systems using PyTorch.

Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations

The ICS Chat System project for NYU Shanghai Fall 2021

Code for the paper SphereRPN: Learning Spheres for High-Quality Region Proposals on 3D Point Clouds Object Detection, ICIP 2021.

Machine Learning University: Accelerated Computer Vision Class

Script that receives an Image (original) and a set of images to be used as "pixels" in reconstruction of the Original image using the set of images as "pixels"

On the model-based stochastic value gradient for continuous reinforcement learning

Examples of using f2py to get high-speed Fortran integrated with Python easily

Towhee is a flexible machine learning framework currently focused on computing deep learning embeddings over unstructured data.

Official code for "On the Frequency Bias of Generative Models", NeurIPS 2021