Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Last update: Jan 07, 2023

Related tags

Overview

Decision Transformer

Lili Chen*, Kevin Lu*, Aravind Rajeswaran, Kimin Lee, Aditya Grover, Michael Laskin, Pieter Abbeel, Aravind Srinivas†, and Igor Mordatch†

*equal contribution, †equal advising

A link to our paper can be found on arXiv.

Overview

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling. Contains scripts to reproduce experiments.

Instructions

We provide code in two sub-directories: atari containing code for Atari experiments and gym containing code for OpenAI Gym experiments. See corresponding READMEs in each folder for instructions; scripts should be run from the respective directories. It may be necessary to add the respective directories to your PYTHONPATH.

Citation

Please cite our paper as:

@article{chen2021decisiontransformer,
  title={Decision Transformer: Reinforcement Learning via Sequence Modeling},
  author={Lili Chen and Kevin Lu and Aravind Rajeswaran and Kimin Lee and Aditya Grover and Michael Laskin and Pieter Abbeel and Aravind Srinivas and Igor Mordatch},
  journal={arXiv preprint arXiv:2106.01345},
  year={2021}
}

Note: this is not an official Google or Facebook product.

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Related tags

Overview

Decision Transformer

Overview

Instructions

Citation

Owner

Kevin Lu

Course on computational design, non-linear optimization, and dynamics of soft systems at UIUC.

This is the solution for 2nd rank in Kaggle competition: Feedback Prize - Evaluating Student Writing.

A collection of semantic image segmentation models implemented in TensorFlow

Code for layerwise detection of linguistic anomaly paper (ACL 2021)

Flower classification model that classifies flowers in 10 classes made using transfer learning (~85% accuracy).

Deep learning operations reinvented (for pytorch, tensorflow, jax and others)

Repository for the paper "PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation", CVPR 2021.

NovelD: A Simple yet Effective Exploration Criterion

BabelCalib: A Universal Approach to Calibrating Central Cameras. In ICCV (2021)

Configure SRX interfaces with Scrapli

Deep Learning Visuals contains 215 unique images divided in 23 categories

Code and data for paper "Deep Photo Style Transfer"

A Deep learning based streamlit web app which can tell with which bollywood celebrity your face resembles.

Directed Greybox Fuzzing with AFL

VOGUE: Try-On by StyleGAN Interpolation Optimization

Finite-temperature variational Monte Carlo calculation of uniform electron gas using neural canonical transformation.

YOLOX_AUDIO is an audio event detection model based on YOLOX

GBIM(Gesture-Based Interaction map)

A simple log parser and summariser for IIS web server logs

Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)