Code for our paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021

Last update: Dec 12, 2022

Related tags

Deep Learning SimCLS

Overview

SimCLS

Code for our paper: "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021

1. How to Install

Requirements

python3
conda create --name env --file spec-file.txt
pip3 install -r requirements.txt

Description of Codes

main.py -> training and evaluation procedure
model.py -> models
data_utils.py -> dataloader
utils.py -> utility functions
preprocess.py -> data preprocessing

Workspace

Following directories should be created for our experiments.

./cache -> storing model checkpoints
./result -> storing evaluation results

2. Preprocessing

We use the following datasets for our experiments.

CNN/DailyMail -> https://github.com/abisee/cnn-dailymail
XSum -> https://github.com/EdinburghNLP/XSum

For data preprocessing, please run

python preprocess.py --src_dir [path of the raw data] --tgt_dir [output path] --split [train/val/test] --cand_num [number of candidate summaries]

src_dir should contain the following files (using test split as an example):

test.source
test.source.tokenized
test.target
test.target.tokenized
test.out
test.out.tokenized

Each line of these files should contain a sample. In particular, you should put the candidate summaries for one data sample at neighboring lines in test.out and test.out.tokenized.

The preprocessing precedure will store the processed data as seperate json files in tgt_dir.

We have provided an example file in ./example.

3. How to Run

Hyper-parameter Setting

You may specify the hyper-parameters in main.py.

Train

python main.py --cuda --gpuid [list of gpuid] -l

Fine-tune

python main.py --cuda --gpuid [list of gpuid] -l --model_pt [model path]

Evaluate

python main.py --cuda --gpuid [single gpu] -e --model_pt [model path]

4. Results

CNNDM

	ROUGE-1	ROUGE-2	ROUGE-L
BART	44.39	21.21	41.28
Ours	46.67	22.15	43.54

XSum

	ROUGE-1	ROUGE-2	ROUGE-L
Pegasus	47.10	24.53	39.23
Ours	47.61	24.57	39.44

Our model outputs on these datasets can be found in ./output.

Code for our paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021

Related tags

Overview

SimCLS

1. How to Install

Requirements

Description of Codes

Workspace

2. Preprocessing

3. How to Run

Hyper-parameter Setting

Train

Fine-tune

Evaluate

4. Results

CNNDM

XSum

Owner

Yixin Liu

Points2Surf: Learning Implicit Surfaces from Point Clouds (ECCV 2020 Spotlight)

novel deep learning research works with PaddlePaddle

From Perceptron model to Deep Neural Network from scratch in Python.

History Aware Multimodal Transformer for Vision-and-Language Navigation

Multi-Scale Geometric Consistency Guided Multi-View Stereo

Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample

tensorflow code for inverse face rendering

Over-the-Air Ensemble Inference with Model Privacy

This is the official implementation for the paper "Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization" in NeurIPS 2021.

A package for "Procedural Content Generation via Reinforcement Learning" OpenAI Gym interface.

Python SDK for building, training, and deploying ML models

Neural-fractal - Create Fractals Using Complex-Valued Neural Networks!

Official pytorch implementation of the AAAI 2021 paper Semantic Grouping Network for Video Captioning

HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision

Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL

PyTorch implementation of PP-LCNet

A faster pytorch implementation of faster r-cnn

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Multi-robot collaborative exploration and mapping through Voronoi partition and DRL in unknown environment

Distributed DataLoader For Pytorch Based On Ray