Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations

Last update: Sep 09, 2022

Related tags

Overview

HierarchicyBandit

Introduction

This is the implementation of WSDM 2022 paper : Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations
The reference codes for HCB and pHCB, which are based on three different base bandit algorithms.

LinUCB from A contextual-bandit approach to personalized news article recommendation
epsilon-Greedy [This strategy, with random exploration on an epsilon fraction of the traffic and greedy exploitation on the rest]
Thompson Sampling from Thompson Sampling for Contextual Bandits with Linear Payoffs

Files in the folder

data/
- MIND/ and TaoBao/
  - item_info.pkl: processed item file, including item id, item feature and embeddings for simulator;
  - user_info.pkl: processed user file, including user id, and embeddings for simulator;
  - item_info_ts.pkl: processed item file for Thompson sampling;
algs/: implementations of PCB and pHCB based on LinUCB.
algsE/: implementations of PCB and pHCB based on epsilon-Greedy.
algsTS/: implementations of PCB and pHCB based on Thompson Sampling.

Note

Before testing the algorithms, you should modify the settings in config.py.
For thompson sampling, we provide another 16 dimensonal feature vectors to run the experiments, since it can be faster . The original feature vectors are also work with the algorithms.
the user_info.pkl and item_info.pkl is formated as dictionary type.
The implementation of ConUCB is released at ConUCB. HMAB and ICTRUCB are specical case of CB-Category and CB-Leaf.

Usage:

Download the HierarchicyBandit.zip and unzip. You will get five folders, they are algs/, algsE/, algsTS/, data/, and logger/.

Parameters:
The config.py file contains:

dataset: is the dataset used in the experiment, it can be 'MIND' or 'TaoBao';  
T: the number of rounds of each bandit algorithm;  
k: the number of items recommended to user at each round, default is 1;  
activate_num: the hyper-papamter p for pHCB;  
activate_prob: the hyper-papamter q for pHCB;  
epsilon: the epsilon value for greedy-based algorithms;  
new_tree_file: the tree file name;  
noise_scale: the standard deviation of environmental noise;  
keep_prob: sample ratio; default is 1.0, which means testing all users.
linucb_para: the hyper-parameters for linucb algorithm;
ts_para: the hyper-parameters for thompson sampling algorithm;
poolsize: the size of candidate pool;
random_choice: whether random choice an item to user;

Environment: python 3.6 with Anaconda To run the bandit codes based on LinUCB:

$ cd algs
$ python simulator_multi_process.py

To run the bandit codes based on epsilon-Greedy:

$ cd algsE
$ python simulator_multi_process.py

To run the bandit codes based on Thompson sampling:

$ cd algsTS
$ python simulator_multi_process.py

Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations

Related tags

Overview

HierarchicyBandit

Introduction

Files in the folder

Usage:

Owner

yu song

Exadel CompreFace is a free and open-source face recognition GitHub project

Implementation of algorithms for continuous control (DDPG and NAF).

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

Einshape: DSL-based reshaping library for JAX and other frameworks.

Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP

Riemann Noise Injection With PyTorch

Avalanche RL: an End-to-End Library for Continual Reinforcement Learning

Official PyTorch implementation of our AAAI22 paper: TransMEF: A Transformer-Based Multi-Exposure Image Fusion Framework via Self-Supervised Multi-Task Learning. Code will be available soon.

Pytorch implementation for the EMNLP 2020 (Findings) paper: Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering

The repository contains reproducible PyTorch source code of our paper Generative Modeling with Optimal Transport Maps, ICLR 2022.

Tree Nested PyTorch Tensor Lib

PyTorch code for the NAACL 2021 paper "Improving Generation and Evaluation of Visual Stories via Semantic Consistency"

Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data.

Code of Puregaze: Purifying gaze feature for generalizable gaze estimation, AAAI 2022.

Original Pytorch Implementation of FLAME: Facial Landmark Heatmap Activated Multimodal Gaze Estimation

Instant Real-Time Example-Based Style Transfer to Facial Videos

POCO: Point Convolution for Surface Reconstruction

TAug :: Time Series Data Augmentation using Deep Generative Models

This repository contains code and data for "On the Multimodal Person Verification Using Audio-Visual-Thermal Data"

The Official PyTorch Implementation of "LSGM: Score-based Generative Modeling in Latent Space" (NeurIPS 2021)