Keyword2Text This repository contains the code of the paper: "A Plug-and-Play Method for Controlled Text Generation"

Last update: Dec 27, 2022

Related tags

Overview

Keyword2Text

This repository contains the code of the paper: "A Plug-and-Play Method for Controlled Text Generation", if you find this useful and use it for your own research, please cite us.

Setup

Download and unzip the repository.
Create a new conda environment and install the required libraries from the requirements.txt file.

conda create -n k2t python=3.6
conda activate k2t
pip install -r requirements.txt

A GPU will be required to run the experiments. Make sure you have a results folder.

Run Model

Hyperparameter Study

Uncomment the appropriate lines of run.sh to run the hyperparameter experiments from the paper. For example,

python main.py -mode='next' -file_name=/data/50_keywordsets_eval/word_sets.txt -results_subfolder=guide_vs_no_guide_beams -weight=10.0 -top_p=0.9 -n_generated_sentences=90 -do_guarantee=True

runs K2T with ordered guide words (mode='next') on the random keywords dataset. It runs with lambda=weight=10, nucleus sampling with top-p=0.9, number of generated tokens = 90, and no weight annealing to guarantee word appearance. The results are saved in results/tmp

ROC Story dataset

Uncomment the appropriate line of run.sh to run the model on the ROC story dataset:

python main.py -mode='max' -file_name=/data/ROC/ROCStories_20_storylines_500_0.txt -results_subfolder=final4_ -weight=5.0 -top_p=0.9 -n_generated_sentences=-7 -n_beams=4 -do_guarantee=True -task='ROC'

News Article dataset

Uncomment the appropriate line of run.sh to run the model on the News Article story dataset:

python main_DBS.py -mode='max' -file_name=/data/keyword_to_articles -results_subfolder=tmp -weight=5.0 -top_p=0.9 -n_generated_sentences=-15 -n_beams=4 -do_guarantee=True -task='key2article'

├── data
│   ├── 50_keywordsets_eval
│   │   └── word_sets.txt
│   ├── keyword_to_articles
│   │   ├── test_10.txt
│   │   ├── test_12.txt
│   │   ├── test_13.txt
│   │   ├── test_14.txt
│   │   ├── test_15.txt
│   │   ├── test_16.txt
│   │   ├── test_4.txt
│   │   ├── test_5.txt
│   │   ├── test_8.txt
│   │   └── test_9.txt
│   └── ROC
│       └── ROCStories_20_storylines_500_0.txt
├── encode_keywords.py
├── encode_keywords_word2vec.py
├── main.py
├── metrics_degen.py
├── metrics_degen_run.sh
├── perplexity.py
├── README.md
├── requirements.txt
├── results
├── run.sh
└── utility_gpt.py

Keyword2Text This repository contains the code of the paper: "A Plug-and-Play Method for Controlled Text Generation"

Related tags

Overview

Keyword2Text

Setup

Run Model

Hyperparameter Study

ROC Story dataset

News Article dataset

Contents

Owner

DrWhy is the collection of tools for eXplainable AI (XAI). It's based on shared principles and simple grammar for exploration, explanation and visualisation of predictive models.

Node Dependent Local Smoothing for Scalable Graph Learning

Neural Network to colorize grayscale images

Sparse-dense operators implementation for Paddle

Azua - build AI algorithms to aid efficient decision-making with minimum data requirements.

This is the repository for Learning to Generate Piano Music With Sustain Pedals

MegEngine implementation of YOLOX

Scripts used to make and evaluate OpenAlex's concept tagging model

Collect super-resolution related papers, data, repositories

simple artificial intelligence utilities

La source de mon module 'pyfade' disponible sur Pypi.

Hand gesture recognition model that can be used as a remote control for a smart tv.

This repository contains code used to audit the stability of personality predictions made by two algorithmic hiring systems

Embeddinghub is a database built for machine learning embeddings.

[CVPR 2021] Generative Hierarchical Features from Synthesizing Images

GradAttack is a Python library for easy evaluation of privacy risks in public gradients in Federated Learning

Weakly Supervised Learning of Rigid 3D Scene Flow

PyExplainer: A Local Rule-Based Model-Agnostic Technique (Explainable AI)

Geneva is an artificial intelligence tool that defeats censorship by exploiting bugs in censors

Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.