Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

Overview

Reinforcement Learning with Q Learning Algorithm

Q learning algorithm is trained on the gym's frozen lake environment.

Libraries Used

  • gym
  • Numpy
  • tqdm
  • Pytorch Deep Learning Framework

  • Install Requirement Files

    clone the repository or download the 'requirement.txt' files, then open terminal in the working directory and type
    'pip install -r requirements.txt'
    to install all the requirements for this project.

    Demo Video

    Q-learning.mp4
    Weakly supervised medical named entity classification

    Trove Trove is a research framework for building weakly supervised (bio)medical named entity recognition (NER) and other entity attribute classifiers

    60 Nov 18, 2022
    This is the official github repository of the Met dataset

    The Met dataset This is the official github repository of the Met dataset. The official webpage of the dataset can be found here. What is it? This cod

    Nikolaos-Antonios Ypsilantis 35 Dec 17, 2022
    CoReD: Generalizing Fake Media Detection with Continual Representation using Distillation (ACMMM'21 Oral Paper)

    CoReD: Generalizing Fake Media Detection with Continual Representation using Distillation (ACMMM'21 Oral Paper) (Accepted for oral presentation at ACM

    Minha Kim 1 Nov 12, 2021
    Segment axon and myelin from microscopy data using deep learning

    Segment axon and myelin from microscopy data using deep learning. Written in Python. Using the TensorFlow framework. Based on a convolutional neural network architecture. Pixels are classified as eit

    NeuroPoly 103 Nov 29, 2022
    Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice

    Splicing ViT Features for Semantic Appearance Transfer [Project Page] Splice is a method for semantic appearance transfer, as described in Splicing Vi

    Omer Bar Tal 253 Jan 06, 2023
    (CVPR 2021) Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds

    BRNet Introduction This is a release of the code of our paper Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds,

    86 Oct 05, 2022
    A Python implementation of active inference for Markov Decision Processes

    A Python package for simulating Active Inference agents in Markov Decision Process environments. Please see our companion preprint on arxiv for an ove

    235 Dec 21, 2022
    To provide 100 JAX exercises over different sections structured as a course or tutorials to teach and learn for beginners, intermediates as well as experts

    JaxTon 💯 JAX exercises Mission 🚀 To provide 100 JAX exercises over different sections structured as a course or tutorials to teach and learn for beg

    Rohan Rao 512 Jan 01, 2023
    【CVPR 2021, Variational Inference Framework, PyTorch】 From Rain Generation to Rain Removal

    From Rain Generation to Rain Removal (CVPR2021) Hong Wang, Zongsheng Yue, Qi Xie, Qian Zhao, Yefeng Zheng, and Deyu Meng [PDF&&Supplementary Material]

    Hong Wang 48 Nov 23, 2022
    Bootstrapped Representation Learning on Graphs

    Bootstrapped Representation Learning on Graphs This is the PyTorch implementation of BGRL Bootstrapped Representation Learning on Graphs The main scri

    NerDS Lab :: Neural Data Science Lab 55 Jan 07, 2023
    Implementation of Graph Transformer in Pytorch, for potential use in replicating Alphafold2

    Graph Transformer - Pytorch Implementation of Graph Transformer in Pytorch, for potential use in replicating Alphafold2. This was recently used by bot

    Phil Wang 97 Dec 28, 2022
    Implementation of SSMF: Shifting Seasonal Matrix Factorization

    SSMF Implementation of SSMF: Shifting Seasonal Matrix Factorization, Koki Kawabata, Siddharth Bhatia, Rui Liu, Mohit Wadhwa, Bryan Hooi. NeurIPS, 2021

    Koki Kawabata 9 Jun 10, 2022
    Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)

    Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)

    Yihui He 1k Jan 03, 2023
    Program your own vulkan.gpuinfo.org query in Python. Used to determine baseline hardware for WebGPU.

    query-gpuinfo-data License This software is not presently released under a license. The data in data/ is obtained under CC BY 4.0 as specified there.

    Kai Ninomiya 5 Jul 18, 2022
    A transformer model to predict pathogenic mutations

    MutFormer MutFormer is an application of the BERT (Bidirectional Encoder Representations from Transformers) NLP (Natural Language Processing) model wi

    Wang Genomics Lab 2 Nov 29, 2022
    A knowledge base construction engine for richly formatted data

    Fonduer is a Python package and framework for building knowledge base construction (KBC) applications from richly formatted data. Note that Fonduer is

    HazyResearch 386 Dec 05, 2022
    Benchmarks for semi-supervised domain generalization.

    Semi-Supervised Domain Generalization This code is the official implementation of the following paper: Semi-Supervised Domain Generalization with Stoc

    Kaiyang 49 Dec 10, 2022
    Density-aware Single Image De-raining using a Multi-stream Dense Network (CVPR 2018)

    DID-MDN Density-aware Single Image De-raining using a Multi-stream Dense Network He Zhang, Vishal M. Patel [Paper Link] (CVPR'18) We present a novel d

    He Zhang 224 Dec 12, 2022
    Code for the AI lab course 2021/2022 of the University of Verona

    AI-Lab Code for the AI lab course 2021/2022 of the University of Verona Set-Up the environment for the curse Download Anaconda for your System. Instal

    Davide Corsi 5 Oct 19, 2022
    Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems

    Emotional-Support-Conversation Copyright © 2021 CoAI Group, Tsinghua University. All rights reserved. Data and codes are for academic research use onl

    126 Dec 21, 2022