PyTorch implementation for paper StARformer: Transformer with State-Action-Reward Representations.

Last update: Dec 09, 2022

Related tags

Overview

StARformer

This repository contains the PyTorch implementation for our paper titled StARformer: Transformer with State-Action-Reward Representations. We learn local State-Action-Reward representations (StAR-representations) to improve (long) sequence modeling for reinforcement learning (and imitation learning).

Results

Installation

Dependencies can be installed by Conda:

conda env create -f my_env.yml

And install Atari ROMs.

Datasets

Please follow this instruction for datasets.

Example usage

See run.sh or below:

python run_star_atari.py --seed 123 --data_dir_prefix [data_directory] --epochs 10 --num_steps 500000 --num_buffers 50 --batch_size 64 --seq_len 30 --model_type 'star' --game 'Breakout'

[data_directory] is where you place the Atari dataset.

Variants (`model_type`):

'star' (imitation)
'star_rwd' (offline RL)
'star_fusion' (see Figure 4a in our paper)
'star_stack' (see Figure 4b in our paper)

Acknowledgement

This code is based on Decision-Transformer.

PyTorch implementation for paper StARformer: Transformer with State-Action-Reward Representations.

Related tags

Overview

StARformer

Results

Installation

Datasets

Example usage

Variants (`model_type`):

Acknowledgement

Owner

Jinghuan Shang

Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch.

Machine Unlearning with SISA

Fuzzification helps developers protect the released, binary-only software from attackers who are capable of applying state-of-the-art fuzzing techniques

Source for the paper "Universal Activation Function for machine learning"

TabNet for fastai

code for Multi-scale Matching Networks for Semantic Correspondence, ICCV

Weakly-supervised object detection.

Files for a tutorial to train SegNet for road scenes using the CamVid dataset

SelfRemaster: SSL Speech Restoration

A library for using chemistry in your applications

Code For TDEER: An Efficient Translating Decoding Schema for Joint Extraction of Entities and Relations (EMNLP2021)

Pytorch Implementation for NeurIPS (oral) paper: Pixel Level Cycle Association: A New Perspective for Domain Adaptive Semantic Segmentation

PyTorch reimplementation of REALM and ORQA

Using contrastive learning and OpenAI's CLIP to find good embeddings for images with lossy transformations

The implementation of "Shuffle Transformer: Rethinking Spatial Shuffle for Vision Transformer"

Interactive Image Generation via Generative Adversarial Networks

Code for sound field predictions in domains with impedance boundaries. Used for generating results from the paper

AI-based, context-driven network device ranking

Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)

Fully Convlutional Neural Networks for state-of-the-art time series classification

PyTorch implementation for paper StARformer: Transformer with State-Action-Reward Representations.

Related tags

Overview

StARformer

Results

Installation

Datasets

Example usage

Variants (model_type):

Acknowledgement

Owner

Jinghuan Shang

Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch.

Machine Unlearning with SISA

Fuzzification helps developers protect the released, binary-only software from attackers who are capable of applying state-of-the-art fuzzing techniques

Source for the paper "Universal Activation Function for machine learning"

TabNet for fastai

code for Multi-scale Matching Networks for Semantic Correspondence, ICCV

Weakly-supervised object detection.

Files for a tutorial to train SegNet for road scenes using the CamVid dataset

SelfRemaster: SSL Speech Restoration

A library for using chemistry in your applications

Code For TDEER: An Efficient Translating Decoding Schema for Joint Extraction of Entities and Relations (EMNLP2021)

Pytorch Implementation for NeurIPS (oral) paper: Pixel Level Cycle Association: A New Perspective for Domain Adaptive Semantic Segmentation

PyTorch reimplementation of REALM and ORQA

Using contrastive learning and OpenAI's CLIP to find good embeddings for images with lossy transformations

The implementation of "Shuffle Transformer: Rethinking Spatial Shuffle for Vision Transformer"

Interactive Image Generation via Generative Adversarial Networks

Code for sound field predictions in domains with impedance boundaries. Used for generating results from the paper

AI-based, context-driven network device ranking

Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)

Fully Convlutional Neural Networks for state-of-the-art time series classification

Variants (`model_type`):