RITA is a family of autoregressive protein models, developed by LightOn in collaboration with the OATML group at Oxford and the Debora Marks Lab at Harvard.

Last update: Dec 22, 2022

Overview

RITA: a Study on Scaling Up Generative Protein Sequence Models

RITA is a family of autoregressive protein models, developed by a collaboration of Lighton, the OATML group at Oxford, and the Debbie Marks Lab at Harvard.

Model	#Params	d_model	layers	lm loss uniref-100
Small	85M	768	12	2.31
Medium	300M	1024	24	2.01
Large	680M	1536	24	1.82
XLarge	1.2B	2048	24	1.70

Results

For full results see our preprint: https://arxiv.org/abs/2205.05789

Usage

Instantiate a model like so:

from transformers import AutoModel, AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("lightonai/RITA_s, trust_remote_code=True")
tokenizer = AutoTokenizer.from_pretrained("lightonai/RITA_s")

for generation we support pipelines:

from transformers import pipeline
rita_gen = pipeline('text-generation', model=model, tokenizer=tokenizer)
sequences = rita_gen("MAB", max_length=20, do_sample=True, top_k=950, repetition_penalty=1.2, 
                     num_return_sequences=2, eos_token_id=2)
for seq in sequences:
    print(f"seq: {seq['generated_text'].replace(' ', '')}")

Or see example.py

How to cite

@article{hesslow2022rita,
  title={RITA: a Study on Scaling Up Generative Protein Sequence Models},
  author={Hesslow, Daniel and Zanichelli, Niccol{\'o} and Notin, Pascal and Poli, Iacopo and Marks, Debora},
  journal={arXiv preprint arXiv:2205.05789},
  year={2022}
}

RITA is a family of autoregressive protein models, developed by LightOn in collaboration with the OATML group at Oxford and the Debora Marks Lab at Harvard.

Related tags

Overview

RITA: a Study on Scaling Up Generative Protein Sequence Models

Results

Usage

How to cite

Owner

LightOn

A small demonstration of using WebDataset with ImageNet and PyTorch Lightning

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

Individual Treatment Effect Estimation

Fast Scattering Transform with CuPy/PyTorch

Code associated with the paper "Towards Understanding the Data Dependency of Mixup-style Training".

[CVPR 2021] Anycost GANs for Interactive Image Synthesis and Editing

Contenido del curso Bases de datos del DCC PUC versión 2021-2

some academic posters as references. May we have in-person poster session soon!

Learning where to learn - Gradient sparsity in meta and continual learning

Lorien: A Unified Infrastructure for Efficient Deep Learning Workloads Delivery

U2-Net: Going Deeper with Nested U-Structure for Salient Object Detection

Python version of the amazing Reaction Mechanism Generator (RMG).

It is an open dataset for object detection in remote sensing images.

DC540 hacking challenge 0x00005a.

OpenMMLab Image and Video Editing Toolbox

Hierarchical Metadata-Aware Document Categorization under Weak Supervision (WSDM'21)

A Quick and Dirty Progressive Neural Network written in TensorFlow.

Awesome Weak-Shot Learning

This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described in the paper.

Detectron2 for Document Layout Analysis