Code and datasets for our paper "PTR: Prompt Tuning with Rules for Text Classification"

Last update: Dec 30, 2022

Related tags

Overview

PTR

Code and datasets for our paper "PTR: Prompt Tuning with Rules for Text Classification"

If you use the code, please cite the following paper:

@article{han2021ptr,
  title={PTR: Prompt Tuning with Rules for Text Classification},
  author={Han, Xu and Zhao, Weilin and Ding, Ning and Liu, Zhiyuan and Sun, Maosong},
  journal={arXiv preprint arXiv:2105.11259},
  year={2021}
}

Requirements

The model is implemented using PyTorch. The versions of packages used are shown below.

numpy>=1.18.0
scikit-learn>=0.22.1
scipy>=1.4.1
torch>=1.3.0
tqdm>=4.41.1
transformers>=4.0.0

Baselines

Some baselines, especially the baselines using entity markers, come from the project [RE_improved_baseline].

Datasets

We provide all the datasets and prompts used in our experiments.

Run the experiments

(1) For TACRED

mkdir results
cd results
mkdir tacred
cd tacred
mkdir train
mkdir val
mkdir test
cd ..
cd ..
cd code_script
bash run_large_tacred.sh

(2) For TACREV

mkdir results
cd results
mkdir tacrev
cd tacrev
mkdir train
mkdir val
mkdir test
cd ..
cd ..
cd code_script
bash run_large_tacrev.sh

(3) For RETACRED

mkdir results
cd results
mkdir retacred
cd retacred
mkdir train
mkdir val
mkdir test
cd ..
cd ..
cd code_script
bash run_large_retacred.sh

Code and datasets for our paper "PTR: Prompt Tuning with Rules for Text Classification"

Related tags

Overview

PTR

Requirements

Baselines

Datasets

Run the experiments

(1) For TACRED

(2) For TACREV

(3) For RETACRED

Owner

THUNLP

Official PyTorch implementation of SegFormer

Cherche (search in French) allows you to create a neural search pipeline using retrievers and pre-trained language models as rankers.

An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.

What are the best Systems? New Perspectives on NLP Benchmarking

2021搜狐校园文本匹配算法大赛baseline

Course project of [email protected]

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

This is the code for the EMNLP 2021 paper AEDA: An Easier Data Augmentation Technique for Text Classification

Linear programming solver for paper-reviewer matching and mind-matching

TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

Seq2seq attn - Use the Seq2Seq method to implement machine translation and introduce Attention mechanism to improve the results

Random Directed Acyclic Graph Generator

a CTF web challenge about making screenshots

Gold standard corpus annotated with verb-preverb connections for Hungarian.

PyTorch Implementation of the paper Single Image Texture Translation for Data Augmentation

Dé op-de-vlucht Pieton vertaler. Wereldwijd gebruikt door meer dan 1.000+ succesvolle bedrijven!

Baseline code for Korean open domain question answering(ODQA)

Unet-TTS: Improving Unseen Speaker and Style Transfer in One-shot Voice Cloning

A program that uses real statistics to choose the best times to bet on BloxFlip's crash gamemode

BERN2: an advanced neural biomedical namedentity recognition and normalization tool