A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env

Last update: Nov 23, 2022

Overview

Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env

This repository implements a simple algorithm for imitation learning: DAGGER. In this example, the agent only learns to control the steer [-1, 1], the speed is computed automatically in gym_torcs.TorcsEnv.

Requirements

Ubuntu (I only test on this)
Python 3
TensorLayer and TensorFlow
Gym-Torcs

Setting Up

It is a little bit boring to set up the environment, but any incorrect configurations will lead to FAILURE. After installing Gym-Torcs, please follow the instructions to confirm everything work well:

Open a terminal:
- Run sudo torcs -vision to start a game
- Race --> Practice --> Configure Race: set the driver to scr_server 1 instead of player
- Open Torcs server by selecting Race --> Practice --> New Race: This should result that Torcs keeps a blue screen with several text information.
Open another terminal:
- Run python snakeoil3_gym.py on another terminal, it will shows how the fake AI control the car.
- Press F2 to see the driver view.
Set image size to 64x64x3:
- The model is trained on 64x64 RGB observation.
- Run sudo torcs -vision to start a game
- Options --> Display --> select 64x64 --> Apply

Usage

Make sure everything above work well and then run:

python dagger.py

It will start a Torcs server at the beginning of every episode, and terminate the server when the car crashs or the speed is too low. Note that, the self-contained gym_torcs.py is modified from Gym-Torcs, you can try different settings (like default speed, terminated speed) by modifying it.

Results

After Episode 1, the car crashes after 315 steps.

After Episode 3, the car does not crash anymore !!!

The number of steps and episodes might vary depending on the parameters initialization.

ENJOY !

You might also like...

PyTorch implementation of SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching

SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching This is the official PyTorch implementation of SMODICE: Versatile Offline I

14 Aug 30, 2022

Neon-erc20-example - Example of creating SPL token and wrapping it with ERC20 interface in Neon EVM

Example of wrapping SPL token by ERC2-20 interface in Neon Requirements Install

7 Mar 28, 2022

Example-custom-ml-block-keras - Custom Keras ML block example for Edge Impulse

Custom Keras ML block example for Edge Impulse This repository is an example on

8 Nov 2, 2022

Python-kafka-reset-consumergroup-offset-example - Python Kafka reset consumergroup offset example

Python Kafka reset consumergroup offset example This is a simple example of how

1 Feb 16, 2022

Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)

This repo contains code for our paper State-only Imitation with Transition Dynamics Mismatch published at ICLR 2020. The code heavily uses the RL mach

20 Sep 8, 2022

[CVPR 2022] PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision (Oral)

PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision Kehong Gong*, Bingbing Li*, Jianfeng Zhang*, Ta

256 Dec 28, 2022

Learning to Estimate Hidden Motions with Global Motion Aggregation

Learning to Estimate Hidden Motions with Global Motion Aggregation (GMA) This repository contains the source code for our paper: Learning to Estimate

221 Dec 18, 2022

Official repository for the CVPR 2021 paper "Learning Feature Aggregation for Deep 3D Morphable Models"

Deep3DMM Official repository for the CVPR 2021 paper Learning Feature Aggregation for Deep 3D Morphable Models. Requirements This code is tested on Py

38 Dec 27, 2022

A pytorch reproduction of { Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation }.

A PyTorch Reproduction of HCN Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation. Ch

210 Dec 31, 2022

Comments

About the convergence and overfit

Hi, thanks for your job and I rewrite it using Keras in the attitude of learning. And I use your recommended hyper-parameters but when I run my program it's apt to overfit. Later on, I change the hyper-parameters , add BN and explicit initialization function of each layer. But it's still overfitting and the car runs 700 steps at the best time but still can't go through the all track. I have spent more than two weeks to tune it. I'm so confused of the tuning, why the same hyper-parameters can't achieve the same result? Why the network is so apt to overfit? For convenience, I update my programmer imitationLearning.py Can you give me some idea? Than you in advance.

opened by marooncn 0

A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env

Related tags

Overview

Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env

Requirements

Setting Up

Usage

Results

You might also like...

PyTorch implementation of SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching

Neon-erc20-example - Example of creating SPL token and wrapping it with ERC20 interface in Neon EVM

Example-custom-ml-block-keras - Custom Keras ML block example for Edge Impulse

Python-kafka-reset-consumergroup-offset-example - Python Kafka reset consumergroup offset example

Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)

[CVPR 2022] PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision (Oral)

Learning to Estimate Hidden Motions with Global Motion Aggregation

Official repository for the CVPR 2021 paper "Learning Feature Aggregation for Deep 3D Morphable Models"

A pytorch reproduction of { Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation }.

Comments

About the convergence and overfit

Releases(0.1)

0.1(Aug 10, 2017)

Owner

Hao

A new version of the CIDACS-RL linkage tool suitable to a cluster computing environment.

Annotate datasets with a semi-trained or fully trained YOLOv5 model

An implementation of IMLE-Net: An Interpretable Multi-level Multi-channel Model for ECG Classification

(ICCV 2021) ProHMR - Probabilistic Modeling for Human Mesh Recovery

Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation

Official MegEngine implementation of CREStereo(CVPR 2022 Oral).

implementation of the paper "MarginGAN: Adversarial Training in Semi-Supervised Learning"

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long*, Evan Shelhamer*, and Trevor Darrell. CVPR 2015 and PAMI 2016.

CLIPImageClassifier wraps clip image model from transformers

MARS: Learning Modality-Agnostic Representation for Scalable Cross-media Retrieva

A Deep learning based streamlit web app which can tell with which bollywood celebrity your face resembles.

SCALoss: Side and Corner Aligned Loss for Bounding Box Regression (AAAI2022).

A Python framework for conversational search

Spline is a tool that is capable of running locally as well as part of well known pipelines like Jenkins (Jenkinsfile), Travis CI (.travis.yml) or similar ones.

Build Low Code Automated Tensorflow, What-IF explainable models in just 3 lines of code.

codes for Image Inpainting with External-internal Learning and Monochromic Bottleneck

This repository contains several jupyter notebooks to help users learn to use neon, our deep learning framework

CyTran: Cycle-Consistent Transformers for Non-Contrast to Contrast CT Translation

CT Based COVID 19 Diagnose by Image Processing and Deep Learning

Simple-System-Convert--C--F - Simple System Convert With Python

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long, Evan Shelhamer, and Trevor Darrell. CVPR 2015 and PAMI 2016.