Leaderboard and Visualization for RLCard

Last update: Dec 26, 2022

Related tags

Overview

RLCard Showdown

This is the GUI support for the RLCard project and DouZero project. RLCard-Showdown provides evaluation and visualization tools to help understand the performance of the agents. It includes a replay module, where you can analyze the replays, and a PvE module, where you can play with the AI interactively. Currently, we only support Leduc Hold'em and Dou Dizhu. The frontend is developed with React. The backend is based on Django and Flask. Have fun!

Official Website: http://www.rlcard.org
Tutorial in Jupyter Notebook: https://github.com/datamllab/rlcard-tutorial
Paper: https://www.ijcai.org/Proceedings/2020/764
Document: Click Here
Online Demo with DouZero: https://www.douzero.org/

Cite this work

Zha, Daochen, et al. "RLCard: A Platform for Reinforcement Learning in Card Games." IJCAI. 2020.

@inproceedings{zha2020rlcard,
  title={RLCard: A Platform for Reinforcement Learning in Card Games},
  author={Zha, Daochen and Lai, Kwei-Herng and Huang, Songyi and Cao, Yuanpu and Reddy, Keerthana and Vargas, Juan and Nguyen, Alex and Wei, Ruzhe and Guo, Junyu and Hu, Xia},
  booktitle={IJCAI},
  year={2020}
}

Installation

RLCard-Showdown has separated frontend and backend. The frontend is built with React and the backend is based on Django and Flask.

Prerequisite

To set up the frontend, you should make sure you have Node.js and NPM installed. Normally you just need to manually install Node.js, and the NPM package would be automatically installed together with Node.js for you. Please refer to its official website for installation of Node.js.

You can run the following commands to verify the installation

node -v
npm -v

For backend, make sure that you have Python 3.6+ and pip installed.

Install Frontend and Backend

The frontend can be installed with the help of NPM:

git clone -b master --single-branch --depth=1 https://github.com/datamllab/rlcard-showdown.git
cd rlcard-showdown
npm install

The backend of leaderboard can be installed with

pip3 install -r requirements.txt
cd server
python3 manage.py migrate
cd ..

Run RLCard-Showdown

Launch the backend of leaderboard with

cd server
python3 manage.py runserver

Download the pre-trained models in Google Drive or 百度网盘提取码: qh6s. Extract it in pve_server/pretrained.

In a new terminal, start the PvE server (i.e., human vs AI) of DouZero with

cd pve_server
python3 run_douzero.py

Alternatively, you can start the PvE server interfaced with RLCard:

cd pve_server
python3 run_dmc.py

They are conceptually the same with minor differences in state representation and training time of the pre-trained models (DouZero is fully trained with more than a month, while DMC in RLCard is only trained for hours).

Run the following command in another new terminal under the project folder to start frontend:

npm start

You can view leaderboard at http://127.0.0.1:3000/ and PvE demo of Dou Dizhu at http://127.0.0.1:3000/pve/doudizhu-demo. The backend of leaderboard will run in http://127.0.0.1:8000/. The PvE backend will run in http://127.0.0.1:5000/.

Demos

Contact Us

If you have any questions or feedback, feel free to drop an email to Songyi Huang for the frontend or Daochen Zha for backend.

Acknowledgements

We would like to thank JJ World Network Technology Co., LTD for the generous support, Chieh-An Tsai for user interface design, and Lei Pan for the help in visualizations.

Leaderboard and Visualization for RLCard

Related tags

Overview

RLCard Showdown

Cite this work

Installation

Prerequisite

Install Frontend and Backend

Run RLCard-Showdown

Demos

Contact Us

Acknowledgements

Owner

Data Analytics Lab at Texas A&M University

A Pytorch implementation of SMU: SMOOTH ACTIVATION FUNCTION FOR DEEP NETWORKS USING SMOOTHING MAXIMUM TECHNIQUE

Stacked Hourglass Network with a Multi-level Attention Mechanism: Where to Look for Intervertebral Disc Labeling

An Active Automata Learning Library Written in Python

WORD: Revisiting Organs Segmentation in the Whole Abdominal Region

Official implementation of deep Gaussian process (DGP)-based multi-speaker speech synthesis with PyTorch.

Implementation of Stochastic Image-to-Video Synthesis using cINNs.

the code for paper "Energy-Based Open-World Uncertainty Modeling for Confidence Calibration"

Code for the paper "Improved Techniques for Training GANs"

Python implementation of "Elliptic Fourier Features of a Closed Contour"

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Codes for "Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier"

MAVE: : A Product Dataset for Multi-source Attribute Value Extraction

Machine learning Bot detection technique, based on United States election dataset

Einshape: DSL-based reshaping library for JAX and other frameworks.

Official PyTorch code for "BAM: Bottleneck Attention Module (BMVC2018)" and "CBAM: Convolutional Block Attention Module (ECCV2018)"

DAFNe: A One-Stage Anchor-Free Deep Model for Oriented Object Detection

Scales, Chords, and Cadences: Practical Music Theory for MIR Researchers

Official PyTorch implementation of the paper "Graph-based Generative Face Anonymisation with Pose Preservation" in ICIAP 2021

Random Forests for Regression with Missing Entries

Official implementation for "QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation" (CVPR 2022)