Asynchronous Advantage Actor-Critic in PyTorch

Last update: Dec 12, 2022

Related tags

Overview

Asynchronous Advantage Actor-Critic in PyTorch

This is PyTorch implementation of A3C as described in Asynchronous Methods for Deep Reinforcement Learning.

Since PyTorch has a easy method to control shared memory within multiprocess, we can easily implement asynchronous method like A3C.

Requirement

PyTorch 0.1.6
Python 3.5.2
gym 0.7.2

Usage

training

python run_a3c.py --atari

In default settings, num_process is 8. Set it as python run_a3c --num_process 4 to fit your number of cpu's cores.

test

After training

python test_a3c.py --render --monitor

Owner

Reiji Hatsugai

Graduate School of Information Science and Technology at The University of Tokyo

GitHub Repository

A fast model to compute optical flow between two input images.

DCVNet: Dilated Cost Volumes for Fast Optical Flow This repository contains our implementation of the paper: @InProceedings{jiang2021dcvnet, title={

8 Sep 27, 2021

Source code, datasets and trained models for the paper Learning Advanced Mathematical Computations from Examples (ICLR 2021), by François Charton, Amaury Hayat (ENPC-Rutgers) and Guillaume Lample

Maths from examples - Learning advanced mathematical computations from examples This is the source code and data sets relevant to the paper Learning a

171 Nov 23, 2022

HDR Video Reconstruction: A Coarse-to-fine Network and A Real-world Benchmark Dataset (ICCV 2021)

Code for HDR Video Reconstruction HDR Video Reconstruction: A Coarse-to-fine Network and A Real-world Benchmark Dataset (ICCV 2021) Guanying Chen, Cha

64 Nov 19, 2022

Zero-Cost Proxies for Lightweight NAS

Zero-Cost-NAS Companion code for the ICLR2021 paper: Zero-Cost Proxies for Lightweight NAS tl;dr A single minibatch of data is used to score neural ne

108 Dec 20, 2022

You Only Hypothesize Once: Point Cloud Registration with Rotation-equivariant Descriptors

You Only Hypothesize Once: Point Cloud Registration with Rotation-equivariant Descriptors In this paper, we propose a novel local descriptor-based fra

80 Dec 15, 2022

This is the first released system towards complex meters` detection and recognition, which is implemented by computer vision techniques.

A three-stage detection and recognition pipeline of complex meters in wild This is the first released system towards detection and recognition of comp

19 Nov 28, 2022

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

DocFormer - PyTorch Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for t

171 Jan 06, 2023

Deep learning algorithms for muon momentum estimation in the CMS Trigger System

Deep learning algorithms for muon momentum estimation in the CMS Trigger System The Compact Muon Solenoid (CMS) is a general-purpose detector at the L

2 Oct 06, 2021

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

TensorLayer is a novel TensorFlow-based deep learning and reinforcement learning library designed for researchers and engineers. It provides an extens

7.1k Dec 29, 2022

ESP32 python application to read data from a Tilt™ Hydrometer for homebrewing

TitlESP32 ESP32 MicroPython application to read and log data from a Tilt™ Hydrometer. Requirements A board with an ESP32 chip USB cable - USB A / micr

5 Dec 01, 2022

Code for our WACV 2022 paper "Hyper-Convolution Networks for Biomedical Image Segmentation"

Hyper-Convolution Networks for Biomedical Image Segmentation Code for our WACV 2022 paper "Hyper-Convolution Networks for Biomedical Image Segmentatio

17 Nov 02, 2022

A deep learning library that makes face recognition efficient and effective

Distributed Arcface Training in Pytorch This is a deep learning library that makes face recognition efficient, and effective, which can train tens of

10 Nov 23, 2021

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise

45 Dec 08, 2022

Asynchronous Advantage Actor-Critic in PyTorch

Related tags

Overview

Asynchronous Advantage Actor-Critic in PyTorch

Requirement

Usage

training

test

Owner

Reiji Hatsugai

A fast model to compute optical flow between two input images.

Source code, datasets and trained models for the paper Learning Advanced Mathematical Computations from Examples (ICLR 2021), by François Charton, Amaury Hayat (ENPC-Rutgers) and Guillaume Lample

HDR Video Reconstruction: A Coarse-to-fine Network and A Real-world Benchmark Dataset (ICCV 2021)

Zero-Cost Proxies for Lightweight NAS

You Only Hypothesize Once: Point Cloud Registration with Rotation-equivariant Descriptors

This is the first released system towards complex meters` detection and recognition, which is implemented by computer vision techniques.

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

Deep learning algorithms for muon momentum estimation in the CMS Trigger System

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

ESP32 python application to read data from a Tilt™ Hydrometer for homebrewing

Code for our WACV 2022 paper "Hyper-Convolution Networks for Biomedical Image Segmentation"

A deep learning library that makes face recognition efficient and effective

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

The code for SAG-DTA: Prediction of Drug–Target Affinity Using Self-Attention Graph Network.

Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device" @ CAD&Graphics2019

Supporting code for the Neograd algorithm

A forwarding MPI implementation that can use any other MPI implementation via an MPI ABI

Character Grounding and Re-Identification in Story of Videos and Text Descriptions

L-Verse: Bidirectional Generation Between Image and Text

DuBE: Duple-balanced Ensemble Learning from Skewed Data