Self-driving car env with PPO algorithm from stable baseline3

Last update: Dec 22, 2022

Related tags

Deep Learning Self-Driving-car

Overview

Self-driving car with RL stable baseline3

Most of the project develop from https://github.com/GerardMaggiolino/Gym-Medium-Post Please check it out!

This project focus on training self-driving car env by implementing PPO algorithm from stable baseline3

Installation

Clone the project

git clone https://github.com/SornsiriP/Self-Driving-car

Then run Gym-Medium-Post/main.py

Update

Wrap env to change observation space from box to RGB image

from simple_driving.resources.wrapper import ProcessFrame84

env = ProcessFrame84(env)

Using PPO with CNN policy instead of TRPO

from stable_baselines3 import PPO

model = PPO('CnnPolicy', env, verbose=1,learning_rate = 0.00025,tensorboard_log="./Simple-driving/",n_steps=10000,batch_size=1000,gamma=0.9995)
model.learn(total_timesteps=150000)

Normalize action space

def map_action(self, action):
  speed_range = [0,1]
  steer_range = [-0.6,0.6]
  new_speed = np.interp(action[0],[-1,1],speed_range)
  new_steer = np.interp(action[0],[-1,1],steer_range)
  return [new_speed, new_steer]

Add limited timestep reset condition

if self.current_step >1000:
  self.current_step = 0
  self.done = True

Normalize distance in reward function

previous_dist_to_goal = np.linalg.norm(tuple(map(lambda i, j: i - j, self.goal, self.prev_pos)))
current_dist_to_goal =  np.linalg.norm(tuple(map(lambda i, j: i - j, self.goal, car_ob[0:2])))

Reference

https://github.com/GerardMaggiolino/Gym-Medium-Post

https://www.etedal.net/2020/04/pybullet-panda_3.html

Contributing

Sornsiri Promma

Thanks original project from Gerard Maggiolino

Please make sure to update tests as appropriate.

Self-driving car env with PPO algorithm from stable baseline3

Related tags

Overview

Self-driving car with RL stable baseline3

Installation

Update

Reference

Contributing

Owner

Sornsiri.P

A module for solving and visualizing Schrödinger equation.

SMPL-X: A new joint 3D model of the human body, face and hands together

City-seeds - A random generator of cultural characteristics intended to spark ideas and help draw threads

We provided a matlab implementation for an evolutionary multitasking AUC optimization framework (EMTAUC).

Object detection GUI based on PaddleDetection

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

Companion code for the paper "An Infinite-Feature Extension for Bayesian ReLU Nets That Fixes Their Asymptotic Overconfidence" (NeurIPS 2021)

Official implementation of "Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks", NeurIPS 2021.

Source Code for AAAI 2022 paper "Graph Convolutional Networks with Dual Message Passing for Subgraph Isomorphism Counting and Matching"

Node Editor Plug for Blender

A DCGAN to generate anime faces using custom mined dataset

Advbox is a toolbox to generate adversarial examples that fool neural networks in PaddlePaddle、PyTorch、Caffe2、MxNet、Keras、TensorFlow and Advbox can benchmark the robustness of machine learning models.

Pytorch implementation of the paper "Enhancing Content Preservation in Text Style Transfer Using Reverse Attention and Conditional Layer Normalization"

PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"

Official Pytorch Implementation of Unsupervised Image Denoising with Frequency Domain Knowledge

Official implementation of NeurIPS'21: Implicit SVD for Graph Representation Learning

[IROS'21] SurRoL: An Open-source Reinforcement Learning Centered and dVRK Compatible Platform for Surgical Robot Learning

Pytorch-Swin-Unet-V2 - a modified version of Swin Unet based on Swin Transfomer V2

A time series processing library

Exploring Cross-Image Pixel Contrast for Semantic Segmentation