Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

Overview

Reinforcement Learning with Q Learning Algorithm

Q learning algorithm is trained on the gym's frozen lake environment.

Libraries Used

  • gym
  • Numpy
  • tqdm
  • Pytorch Deep Learning Framework

  • Install Requirement Files

    clone the repository or download the 'requirement.txt' files, then open terminal in the working directory and type
    'pip install -r requirements.txt'
    to install all the requirements for this project.

    Demo Video

    Q-learning.mp4
    This repo is the official implementation of "L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization".

    L2ight is a closed-loop ONN on-chip learning framework to enable scalable ONN mapping and efficient in-situ learning. L2ight adopts a three-stage learning flow that first calibrates the complicated p

    Jiaqi Gu 9 Jul 14, 2022
    A list of all named GANs!

    The GAN Zoo Every week, new GAN papers are coming out and it's hard to keep track of them all, not to mention the incredibly creative ways in which re

    Avinash Hindupur 12.9k Jan 08, 2023
    A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization

    University1652-Baseline [Paper] [Slide] [Explore Drone-view Data] [Explore Satellite-view Data] [Explore Street-view Data] [Video Sample] [中文介绍] This

    Zhedong Zheng 335 Jan 06, 2023
    Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers

    Motionformer This is an official pytorch implementation of paper Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers. In this rep

    Facebook Research 192 Dec 23, 2022
    A PyTorch implementation of the baseline method in Panoptic Narrative Grounding (ICCV 2021 Oral)

    A PyTorch implementation of the baseline method in Panoptic Narrative Grounding (ICCV 2021 Oral)

    Biomedical Computer Vision @ Uniandes 52 Dec 19, 2022
    Time-Optimal Planning for Quadrotor Waypoint Flight

    Time-Optimal Planning for Quadrotor Waypoint Flight This is an example implementation of the paper "Time-Optimal Planning for Quadrotor Waypoint Fligh

    Robotics and Perception Group 38 Dec 02, 2022
    Automatic 2D-to-3D Video Conversion with CNNs

    Deep3D: Automatic 2D-to-3D Video Conversion with CNNs How To Run To run this code. Please install MXNet following the official document. Deep3D requir

    Eric Junyuan Xie 1.2k Dec 30, 2022
    PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

    PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR)

    Ilya Kostrikov 3k Dec 31, 2022
    Neural-fractal - Create Fractals Using Complex-Valued Neural Networks!

    Neural Fractal Create Fractals Using Complex-Valued Neural Networks! Home Page Features Define Dynamical Systems Using Complex-Valued Neural Networks

    Amirabbas Asadi 10 Dec 17, 2022
    Official code for UnICORNN (ICML 2021)

    UnICORNN (Undamped Independent Controlled Oscillatory RNN) [ICML 2021] This repository contains the implementation to reproduce the numerical experime

    Konstantin Rusch 21 Dec 22, 2022
    CSAC - Collaborative Semantic Aggregation and Calibration for Separated Domain Generalization

    CSAC Introduction This repository contains the implementation code for paper: Co

    ScottYuan 5 Jul 22, 2022
    RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching

    RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching This repository contains the source code for our paper: RAFT-Stereo: Multilevel

    Princeton Vision & Learning Lab 328 Jan 09, 2023
    🥇Samsung AI Challenge 2021 1등 솔루션입니다🥇

    MoT - Molecular Transformer Large-scale Pretraining for Molecular Property Prediction Samsung AI Challenge for Scientific Discovery This repository is

    Jungwoo Park 44 Dec 03, 2022
    PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and reinforcement learning

    safe-control-gym Physics-based CartPole and Quadrotor Gym environments (using PyBullet) with symbolic a priori dynamics (using CasADi) for learning-ba

    Dynamic Systems Lab 300 Dec 28, 2022
    Machine Learning Platform for Kubernetes

    Reproduce, Automate, Scale your data science. Welcome to Polyaxon, a platform for building, training, and monitoring large scale deep learning applica

    polyaxon 3.2k Dec 23, 2022
    SciPy fixes and extensions

    scipyx SciPy is large library used everywhere in scientific computing. That's why breaking backwards-compatibility comes as a significant cost and is

    Nico Schlömer 16 Jul 17, 2022
    [NeurIPS 2021] Introspective Distillation for Robust Question Answering

    Introspective Distillation (IntroD) This repository is the Pytorch implementation of our paper "Introspective Distillation for Robust Question Answeri

    Yulei Niu 13 Jul 26, 2022
    Scientific Computation Methods in C and Python (Open for Hacktoberfest 2021)

    Sci - cpy README is a stub. Do expand it. Objective This repository is meant to be a ready reference for scientific computation methods. Do ⭐ it if yo

    Sandip Dutta 7 Oct 12, 2022
    Minimal implementation of PAWS (https://arxiv.org/abs/2104.13963) in TensorFlow.

    PAWS-TF 🐾 Implementation of Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples (PAWS)

    Sayak Paul 43 Jan 08, 2023
    Official repository for "PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation"

    pair-emnlp2020 Official repository for the paper: Xinyu Hua and Lu Wang: PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long

    Xinyu Hua 31 Oct 13, 2022