Official implementation for paper Knowledge Bridging for Empathetic Dialogue Generation (AAAI 2021).

Last update: Dec 20, 2022

Related tags

Deep Learning KEMP

Overview

Knowledge Bridging for Empathetic Dialogue Generation

This is the official implementation for paper Knowledge Bridging for Empathetic Dialogue Generation (AAAI 2021).

Model Architecture

Setup

Check the packages needed or simply run the command:

pip install -r requirements.txt

Download GloVe vectors from here (glove.6B.300d.txt) and put it into /data/.
Download other data sources regarding ConceptNet and NRC_VAD lexicon, please visit Google Drive and place processed dataset kemp_dataset_preproc.json into /data/.
For reproducibility purposes, we place the model checkpoints at Google Drive. You could download and move it under /result/[MODELNAME]/result/, e.g., /result/KEMP/result/KEMP_best.tar.
To skip training, please check folder /result/[MODELNAME]/predicition/.

Data preprocessing

The dataset (EmpatheticDialogue) is preprocessed and stored under data in pickle format

python preprocess.py

Training

KEMP (Our)

python main.py \
--cuda \
--label_smoothing \
--noam \
--emb_dim 300 \
--hidden_dim 300 \
--hop 1 \
--heads 2 \
--pretrain_emb \
--model KEMP \
--device_id 0 \
--concept_num 1 \
--total_concept_num 10 \
--attn_loss \
--pointer_gen \
--save_path result/KEMP/ \
--emb_file data/glove.6B.300d.txt

KEMP w/o ECE

This model does not consider the emotional context graph of Emotional Context Encoder (ECE).

In ECE, we enrich the dialogue history with external knowledge into an emotional context graph. Then, the emotional signals of context are distilled based on the embeddings and emotion intensity values from the emotional context graph.

python main.py \
--cuda \
--label_smoothing \
--noam \
--emb_dim 300 \
--hidden_dim 300 \
--hop 1 \
--heads 2 \
--pretrain_emb \
--model wo_ECE \
--device_id 0 \
--concept_num 1 \
--total_concept_num 10 \
--pointer_gen \
--save_path result/wo_ECE/ \
--emb_file data/glove.6B.300d.txt

KEMP w/o EDD

This model does not consider the emotional dependency strategies of Emotion-Dependency Decoder (EDD).

In EDD, given emotional signal and emotional context graph, we incorporate an emotional cross-attention mechanism to selectively learn the emotional dependencies.

python main.py \
--cuda \
--label_smoothing \
--noam \
--emb_dim 300 \
--hidden_dim 300 \
--hop 1 \
--heads 2 \
--pretrain_emb \
--model wo_EDD \
--device_id 0 \
--concept_num 1 \
--total_concept_num 10 \
--pointer_gen \
--save_path result/wo_EDD/ \
--emb_file data/glove.6B.300d.txt

Testing

Add --test into above commands.

You can directly run /result/cal_metrics.py script to evaluate the model predictions.

Citation

If you find our work useful, please cite our paper as follows:

@article{li-etal-2022-kemp,
  title={Knowledge Bridging for Empathetic Dialogue Generation},
  author={Qintong Li and Piji Li and Zhaochun Ren and Pengjie Ren and Zhumin Chen},
  booktitle={AAAI},
  year={2022},
}

Official implementation for paper Knowledge Bridging for Empathetic Dialogue Generation (AAAI 2021).

Related tags

Overview

Knowledge Bridging for Empathetic Dialogue Generation

Model Architecture

Setup

Data preprocessing

Training

KEMP (Our)

KEMP w/o ECE

KEMP w/o EDD

Testing

Citation

Owner

Qintong Li

Official repository for Fourier model that can generate periodic signals

Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"

All course materials for the Zero to Mastery Deep Learning with TensorFlow course.

An End-to-End Machine Learning Library to Optimize AUC (AUROC, AUPRC).

Google-drive-to-sqlite - Create a SQLite database containing metadata from Google Drive

Resilient projection-based consensus actor-critic (RPBCAC) algorithm

Web service for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation based on OpenFace 2.0

Code from Daniel Lemire, A Better Alternative to Piecewise Linear Time Series Segmentation

Lightweight mmm - Lightweight (Bayesian) Media Mix Model

Zero-Shot Text-to-Image Generation VQGAN+CLIP Dockerized

A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env

A SAT-based sudoku solver

Official PyTorch implementation of Segmenter: Transformer for Semantic Segmentation

A flexible submap-based framework towards spatio-temporally consistent volumetric mapping and scene understanding.

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Genpass - A Passwors Generator App With Python3

Experimental solutions to selected exercises from the book [Advances in Financial Machine Learning by Marcos Lopez De Prado]

Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks

The ICS Chat System project for NYU Shanghai Fall 2021

A deep learning model for style-specific music generation.