Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Last update: Dec 25, 2022

Related tags

Overview

ConSERT

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Requirements

torch==1.6.0
cudatoolkit==10.0.103
cudnn==7.6.5
sentence-transformers==0.3.9
transformers==3.4.0
tensorboardX==2.1
pandas==1.1.5
sentencepiece==0.1.85
matplotlib==3.4.1
apex==0.1.0

Get Started

Download pre-trained language model (e.g. bert-base-uncased) from HuggingFace's Library
Download STS datasets to ./data folder using SentEval toolkit

Run the following script to run the unsupervised experiment:

python3 main.py --no_pair --seed 1 --use_apex_amp --apex_amp_opt_level O1 --batch_size 96 --max_seq_length 64 --evaluation_steps 200 --add_cl --cl_loss_only --cl_rate 0.15 --temperature 0.1 --learning_rate 0.0000005 --train_data stssick --num_epochs 10 --da_final_1 feature_cutoff --da_final_2 shuffle --cutoff_rate_final_1 0.2 --model_name_or_path [PRETRAINED_BERT_FOLDER] --model_save_path ./output/unsup-base-feature_cutoff-shuffle --force_del --no_dropout --patience 10

where [PRETRAINED_BERT_FOLDER] should be replaced to the folder that contains downloaded pre-trained language model

Citation

@article{yan2021consert,
  title={ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer},
  author={Yan, Yuanmeng and Li, Rumei and Wang, Sirui and Zhang, Fuzheng and Wu, Wei and Xu, Weiran},
  journal={arXiv preprint arXiv:2105.11741},
  year={2021}
}

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Related tags

Overview

ConSERT

Requirements

Get Started

Citation

Owner

Yan Yuanmeng

Understanding the Difficulty of Training Transformers

Deep Learning for Natural Language Processing - Lectures 2021

The proliferation of disinformation across social media has led the application of deep learning techniques to detect fake news.

🏆 • 5050 most frequent words in 109 languages

Reading Wikipedia to Answer Open-Domain Questions

A linter to manage all your python exceptions and try/except blocks (limited only for those who like dinosaurs).

A 30000+ Chinese MRC dataset - Delta Reading Comprehension Dataset

DAGAN - Dual Attention GANs for Semantic Image Synthesis

Code for Findings at EMNLP 2021 paper: "Learn Continually, Generalize Rapidly: Lifelong Knowledge Accumulation for Few-shot Learning"

Develop open-source Python Arabic NLP libraries that the Arab world will easily use in all Natural Language Processing applications

NLP project that works with news (NER, context generation, news trend analytics)

Transformers Wav2Vec2 + Parlance's CTCDecodeTransformers Wav2Vec2 + Parlance's CTCDecode

Fidibo.com comments Sentiment Analyser

End-to-end text to speech system using gruut and onnx. There are 40 voices available across 8 languages.

Awesome-NLP-Research (ANLP)

SDL: Synthetic Document Layout dataset

This is the source code of RPG (Reward-Randomized Policy Gradient)

ConvBERT-Prod

A library for end-to-end learning of embedding index and retrieval model

ChainKnowledgeGraph, 产业链知识图谱包括A股上市公司、行业和产品共3类实体