A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.

Last update: Nov 23, 2022

Overview

SOFA

This repository is the implementation of SOFA, the Simulator for OFfline leArning and evaluation.

Keeping Dataset Biases out of the Simulation: A Debiased Simulator for Reinforcement Learning based Recommender Systems. Jin Huang, Harrie Oosterhuis, Maarten de Rijke, Herke van Hoof. Recsys 2020.

The framework shows how RL4Rec typically interacts with a simulation-based environment. A state is user historical interactions, an action is an item being recommended bytheRS, and a reward is related to user feedback.

As a solution to the effect of bias present in logged data, we introduce a debiasing step in the simulation pipeline, which corrects for the biases present in the logged data before it is used to simulate user behavior.

Running the code

$ cd examples
$ python run_dqn.py

More details

We provide the details of DQN-based Policy used in experiments and the related hyperparamters (See Appendix). And we also provide the slide used for presentation in recsys 2020.

Cite

If you use our code, please cite our paper:

@inproceedings{huang2020keeping,
  title={Keeping Dataset Biases out of the Simulation: A Debiased Simulator for Reinforcement Learning based Recommender Systems},
  author={Huang, Jin and Oosterhuis, Harrie and de Rijke, Maarten and van Hoof, Herke},
  booktitle={Fourteenth ACM Conference on Recommender Systems},
  pages={190--199},
  year={2020}
}

A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.

Related tags

Overview

SOFA

Running the code

More details

Cite

Owner

CodeContests is a competitive programming dataset for machine-learning

Yolov5+SlowFast: Realtime Action Detection Based on PytorchVideo

Python script to download the celebA-HQ dataset from google drive

Computer Vision and Pattern Recognition, NUS CS4243, 2022

Multi-modal co-attention for drug-target interaction annotation and Its Application to SARS-CoV-2

POPPY (Physical Optics Propagation in Python) is a Python package that simulates physical optical propagation including diffraction

The final project of "Applying AI to 3D Medical Imaging Data" from "AI for Healthcare" nanodegree - Udacity.

Python Library for Signal/Image Data Analysis with Transport Methods

Official implementation for "Low-light Image Enhancement via Breaking Down the Darkness"

This code is 3d-CNN model that can predict environmental value

Code for Dual Contrastive Learning for Unsupervised Image-to-Image Translation, NTIRE, CVPRW 2021.

Sample code from the Neural Networks from Scratch book.

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Automatic Number Plate Recognition using Contours and Convolution Neural Networks (CNN)

Anchor-free Oriented Proposal Generator for Object Detection

Neural-PIL: Neural Pre-Integrated Lighting for Reflectance Decomposition - NeurIPS2021

Hyper-parameter optimization for sklearn

Mmdetection3d Noted - MMDetection3D is an open source object detection toolbox based on PyTorch

VLGrammar: Grounded Grammar Induction of Vision and Language

Defending graph neural networks against adversarial attacks (NeurIPS 2020)