This is a library for training and applying sparse fine-tunings with torch and transformers.

Last update: Dec 30, 2022

Related tags

Overview

This is a library for training and applying sparse fine-tunings with torch and transformers. Please refer to our paper Composable Sparse Fine-Tuning for Cross Lingual Transfer for background.

Installation

First, install Python 3.9 and PyTorch >= 1.9 (earlier versions may work but haven't been tested), e.g. using conda:

conda create -n sft python=3.9
conda activate sft
conda install pytorch cudatoolkit=11.1 -c pytorch -c conda-forge

Then download and install composable-sft:

git clone https://github.com/cambridgeltl/composable-sft.git
cd composable-sft
pip install -e .

Using pre-trained SFTs

Pre-trained SFTs can be downloaded directly and applied to models as follows:

from transformers import AutoConfig, AutoModelForTokenClassification
from sft import SFT

config = AutoConfig.from_pretrained(
    'bert-base-multilingual-cased',
    num_labels=17,
)

model = AutoModelForTokenClassification.from_pretrained(
    'bert-base-multilingual-cased',
    config=config,
)

language_sft = SFT('cambridgeltl/mbert-lang-sft-bxr-small') # SFT for Buryat
task_sft = SFT('cambridgeltl/mbert-task-sft-pos') # SFT for POS tagging

# Apply SFTs to pre-trained mBERT TokenClassification model
language_sft.apply(model)
task_sft.apply(model)

For a full list of pre-trained SFTs available, see MODELS

Example Scripts

Example scripts are provided in examples/ to show how to train SFTs using LT-SFT and evaluate them.

Citation

If you use this software, please cite the following paper:

@misc{ansell2021composable,
      title={Composable Sparse Fine-Tuning for Cross-Lingual Transfer},
      author={Alan Ansell and Edoardo Maria Ponti and Anna Korhonen and Ivan Vuli\'{c}},
      year={2021},
      eprint={2110.07560},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

This is a library for training and applying sparse fine-tunings with torch and transformers.

Related tags

Overview

Installation

Using pre-trained SFTs

Example Scripts

Citation

Owner

Cambridge Language Technology Lab

Towards the D-Optimal Online Experiment Design for Recommender Selection (KDD 2021)

Revisiting Temporal Alignment for Video Restoration

Implementation of the GBST block from the Charformer paper, in Pytorch

Pytorch code for our paper Beyond ImageNet Attack: Towards Crafting Adversarial Examples for Black-box Domains)

LineBoard - Python+React+MySQL-白板即時系統改善人群行為

Streamlit component for TensorBoard, TensorFlow's visualization toolkit

Doods2 - API for detecting objects in images and video streams using Tensorflow

The code for MM2021 paper "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning"

TiP-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling

DPC: Unsupervised Deep Point Correspondence via Cross and Self Construction (3DV 2021)

This repository contains the map content ontology used in narrative cartography

AirPose: Multi-View Fusion Network for Aerial 3D Human Pose and Shape Estimation

PyTorch implementation of "Representing Shape Collections with Alignment-Aware Linear Models" paper.

PyTorch(Geometric) implementation of G^2GNN in "Imbalanced Graph Classification via Graph-of-Graph Neural Networks"

Sleep staging from ECG, assisted with EEG

Solution of Kaggle competition: Sartorius - Cell Instance Segmentation

PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)

Projecting interval uncertainty through the discrete Fourier transform

ARAE-Tensorflow for Discrete Sequences (Adversarially Regularized Autoencoder)

PyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT