Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021

Last update: Dec 20, 2022

Related tags

Overview

Fine-grained Post-training for Multi-turn Response Selection

Implements the model described in the following paper Fine-grained Post-training for Improving Retrieval-based Dialogue Systems in NAACL-2021.

@inproceedings{han-etal-2021-fine,
title = "Fine-grained Post-training for Improving Retrieval-based Dialogue Systems",
author = "Han, Janghoon  and Hong, Taesuk  and Kim, Byoungjae  and Ko, Youngjoong  and Seo, Jungyun",
booktitle = "Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies",
month = jun, year = "2021", address = "Online", publisher = "Association for Computational Linguistics", url = "https://www.aclweb.org/anthology/2021.naacl-main.122", pages = "1549--1558",
}

This code is reimplemented as a fork of huggingface/transformers.

Setup and Dependencies

This code is implemented using PyTorch v1.8.0, and provides out of the box support with CUDA 11.2 Anaconda is the recommended to set up this codebase.

# https://pytorch.org
conda install pytorch==1.8.0 torchvision==0.9.0 torchaudio==0.8.0 cudatoolkit=11.1 -c pytorch -c conda-forge
pip install -r requirements.txt

Preparing Data and Checkpoints

Post-trained and fine-tuned Checkpoints

We provide following post-trained and fine-tuned checkpoints.

Data pkl for Fine-tuning (Response Selection)

We used the following data for post-training and fine-tuning

fine-grained post-training dataset and fine-tuning dataset for 3 benchmarks (ubuntu, douban, e-commerce)

Original version for each dataset is availble in Ubuntu Corpus V1, Douban Corpus, and E-Commerce Corpus, respectively.

Fine-grained Post-Training

Making Data for post-training and fine-tuning

Data_processing.py

Post-training Examples

(Ubuntu Corpus V1, Douban Corpus, E-commerce Corpus)

python -u FPT/ubuntu_final.py --num_train_epochs 25
python -u FPT/douban_final.py --num_train_epochs 27
python -u FPT/e_commmerce_final.py --num_train_epochs 34

Fine-tuning Examples

(Ubuntu Corpus V1, Douban Corpus, E-commerce Corpus)

Taining

To train the model, set `--is_training`
python -u Fine-Tuning/Response_selection.py --task ubuntu --is_training
python -u Fine-Tuning/Response_selection.py --task douban --is_training
python -u Fine-Tuning/Response_selection.py --task e_commerce --is_training

Testing

python -u Fine-Tuning/Response_selection.py --task ubuntu
python -u Fine-Tuning/Response_selection.py --task douban 
python -u Fine-Tuning/Response_selection.py --task e_commerce

Training Response Selection Models

Model Arguments

Fine-grained post-training

task_name	data_dir	checkpoint_path
ubuntu	ubuntu_data/ubuntu_post_train.pkl	FPT/PT_checkpoint/ubuntu/bert.pt
douban	douban_data/douban_post_train.pkl	FPT/PT_checkpoint/douban/bert.pt
e-commerce	e_commerce_data/e_commerce_post_train.pkl	FPT/PT_checkpoint/e_commerce/bert.pt

Fine-tuning

task_name	data_dir	checkpoint_path
ubuntu	ubuntu_data/ubuntu_dataset_1M.pkl	Fine-Tuning/FT_checkpoint/ubuntu.0.pt
douban	douban_data/douban_dataset_1M.pkl	Fine-Tuning/FT_checkpoint/douban.0.pt
e-commerce	e_commerce_data/e_commerce_dataset_1M.pkl	Fine-Tuning/FT_checkpoint/e_commerce.0.pt

Performance

We provide model checkpoints of BERT_FP, which obtained new state-of-the-art, for each dataset.

Ubuntu	[email protected]	[email protected]	[email protected]
[BERT_FP]	0.911	0.962	0.994

Douban	MAP	MRR	[email protected]	[email protected]	[email protected]	[email protected]
[BERT_FP]	0.644	0.680	0.512	0.324	0.542	0.870

E-Commerce	[email protected]	[email protected]	[email protected]
[BERT_FP]	0.870	0.956	0.993

Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021

Related tags

Overview

Fine-grained Post-training for Multi-turn Response Selection

Setup and Dependencies

Preparing Data and Checkpoints

Post-trained and fine-tuned Checkpoints

Data pkl for Fine-tuning (Response Selection)

Fine-grained Post-Training

Making Data for post-training and fine-tuning

Post-training Examples

(Ubuntu Corpus V1, Douban Corpus, E-commerce Corpus)

Fine-tuning Examples

(Ubuntu Corpus V1, Douban Corpus, E-commerce Corpus)

Taining

Testing

Training Response Selection Models

Model Arguments

Fine-grained post-training

Fine-tuning

Performance

Owner

Janghoon Han

Official Implementation of "Tracking Grow-Finish Pigs Across Large Pens Using Multiple Cameras"

PyTorch implementation of Higher Order Recurrent Space-Time Transformer

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

[CVPR 2021] Official PyTorch Implementation for "Iterative Filter Adaptive Network for Single Image Defocus Deblurring"

This Artificial Intelligence program can take a black and white/grayscale image and generate a realistic or plausible colorized version of the same picture.

(JMLR'19) A Python Toolbox for Scalable Outlier Detection (Anomaly Detection)

Testing the Facial Emotion Recognition (FER) algorithm on animations

A playable implementation of Fully Convolutional Networks with Keras.

[ICCV '21] In this repository you find the code to our paper Keypoint Communities

This is a demo app to be used in the video streaming applications

Structural Constraints on Information Content in Human Brain States

[ACMMM 2021, Oral] Code release for "Elastic Tactile Simulation Towards Tactile-Visual Perception"

Python implementation of NARS (Non-Axiomatic-Reasoning-System)

Vector Quantized Diffusion Model for Text-to-Image Synthesis

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021 Accepted

The description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.

This project is based on our SIGGRAPH 2021 paper, ROSEFusion: Random Optimization for Online DenSE Reconstruction under Fast Camera Motion .

This is a simple framework to make object detection dataset very quickly

EfficientNetV2 implementation using PyTorch

Tensorflow-seq2seq-tutorials - Dynamic seq2seq in TensorFlow, step by step