Black-Box-Tuning

Source code for paper "Black-Box Tuning for Language-Model-as-a-Service".

Being busy recently, the code in this repo and this tutorial will be very brief. Please let me know if you find any issues.

Prepare your environment

The implementation of Black-Box Tuning is quite simple, you can check our code and easily implement it in your own environment. Or you can create a new environment to run our implementation, which is based on Nevergrad, Transformers and FastNLP. Optionally, we use fitlog to monitor experimental results. You can uncomment the fitlog-related lines in our code to use it.

conda create --name bbt python=3.8
conda activate bbt
pip install transformers==4.1.1
pip install datasets
pip install fastNLP
pip install nevergrad
pip install sklearn
git clone https://github.com/txsun1997/Black-Box-Tuning
cd Black-Box-Tuning

Optimize your prompt without gradients

Now you can run Black-Box Tuning with run.sh:

bash run.sh

Results will be saved in a directory named results/. In general, you will obtain the following results:

SST-2 split	Best Accuracy
Train	100
Dev	96.87
Test	88.19

To reproduce other experiments in our paper, change the arguments of bbt.py, for example,

python bbt.py --task_name "agnews" --n_prompt_tokens 50 --intrinsic_dim 500 --k_shot 16 --device "cuda:0" --seed 42 --loss_type "hinge" --cat_or_add "add" --budget 8000

Cite

If you find this work helpful, please cite:

@article{sun2022bbt,
  title={Black-Box Tuning for Language-Model-as-as-Service}, 
  author={Tianxiang Sun and Yunfan Shao and Hong Qian and Xuanjing Huang and Xipeng Qiu},
  journal={arXiv preprint arXiv:2201.03514},
  year={2022}
}

Black-Box-Tuning - Black-Box Tuning for Language-Model-as-a-Service

Related tags

Overview

Black-Box-Tuning

Prepare your environment

Optimize your prompt without gradients

Cite

Owner

Tianxiang Sun

Code for Efficient Visual Pretraining with Contrastive Detection

"Neural Turing Machine" in Tensorflow

MetaDrive: Composing Diverse Scenarios for Generalizable Reinforcement Learning

Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme (NeurIPS2021)

Official implementation of the MM'21 paper Constrained Graphic Layout Generation via Latent Optimization

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

Repo for my Tensorflow/Keras CV experiments. Mostly revolving around the Danbooru20xx dataset

(ICCV 2021 Oral) Re-distributing Biased Pseudo Labels for Semi-supervised Semantic Segmentation: A Baseline Investigation.

CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training

Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.

Datasets, tools, and benchmarks for representation learning of code.

Deep Learning: Architectures & Methods Project: Deep Learning for Audio Super-Resolution

GradAttack is a Python library for easy evaluation of privacy risks in public gradients in Federated Learning

Prompt-BERT: Prompt makes BERT Better at Sentence Embeddings

[CVPR2021] The source code for our paper 《Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning》.

Deep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening

End-to-end speech secognition toolkit

Pytorch implementation of our paper under review — Lottery Jackpots Exist in Pre-trained Models

Credit fraud detection in Python using a Jupyter Notebook

Implementation of the paper "Self-Promoted Prototype Refinement for Few-Shot Class-Incremental Learning"