AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning

Last update: Dec 19, 2022

Related tags

Overview

SimSR

Code and dataset for the paper SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning (AAAI-22).

Requirements

We assume you have access to a gpu that can run CUDA 11. All of the dependencies are in the conda_env.yml file.

conda env create -f conda_env.yml

After the instalation ends you can activate your environment with

conda activate simsr

Instructions

To train a SimSR agent on the cartpole swingup task from image-based observations run bash run.sh from the root of this directory. The run.sh file contains the following command, which you can modify to try different environments / hyperparamters.

DOMAIN=cartpole
TASK=swingup
SEED=1

MUJOCO_GL="egl" CUDA_VISIBLE_DEVICES=0 nohup python -u train.py \
	--domain_name ${DOMAIN} \
	--task_name ${TASK} \
	--encoder_type pixel \
	--action_repeat 4 \
	--pre_transform_image_size 84 \
	--image_size 84 \
	--work_dir ./tmp \
	--agent simsr_sac \
	--frame_stack 3\
	--seed ${SEED} --critic_lr 1e-3 \
	--actor_lr 1e-3 \
	--eval_freq 10000 \
	--batch_size 128 \
	--num_train_steps 260000 > ${DOMAIN}_${TASK}_${SEED}.log &

Note that the MuJoCo Python bindings support three different OpenGL rendering backends: "glfw", "egl", or "osmesa". You can also specify a particular backend to use by setting the MUJOCO_GL= environment variable to one of them.

To visualize progress with tensorboard run:

tensorboard --logdir ./path/to/your/log --port 6006

References

Please cite the paper SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning if you found the resources in the repository useful.

AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning

Related tags

Overview

SimSR

Requirements

Instructions

References

Owner

PyTorch Implementation for AAAI'21 "Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection"

This package proposes simplified exporting pytorch models to ONNX and TensorRT, and also gives some base interface for model inference.

Source code of the paper "Deep Learning of Latent Variable Models for Industrial Process Monitoring".

Datasets for new state-of-the-art challenge in disentanglement learning

CLASP - Contrastive Language-Aminoacid Sequence Pretraining

Code for testing various M1 Chip benchmarks with TensorFlow.

The code is an implementation of Feedback Convolutional Neural Network for Visual Localization and Segmentation.

Weakly-supervised semantic image segmentation with CNNs using point supervision

Codes for NeurIPS 2021 paper "On the Equivalence between Neural Network and Support Vector Machine".

Official repository for CVPR21 paper "Deep Stable Learning for Out-Of-Distribution Generalization".

Vision transformers (ViTs) have found only limited practical use in processing images

Code for our paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021

LSTM model trained on a small dataset of 3000 names written in PyTorch

Implementation of the paper "Shapley Explanation Networks"

MODNet: Trimap-Free Portrait Matting in Real Time

The implementation of FOLD-R++ algorithm

Global Pooling, More than Meets the Eye: Position Information is Encoded Channel-Wise in CNNs, ICCV 2021

A Strong Baseline for Image Semantic Segmentation

Deep learning operations reinvented (for pytorch, tensorflow, jax and others)

Code for our paper at ECCV 2020: Post-Training Piecewise Linear Quantization for Deep Neural Networks