mkultra

mkultra is a prompt tuning toolkit for GPT-2 and GPT-Neo.

Prompt tuning injects a string of 20-100 special tokens into the context in order to influence text generation. These tokens are trained on a corpus much like a finetune, but take up a fraction of the space. The Neuromancer example is only 401kb for 100 tokens.

Read the original paper: https://arxiv.org/abs/2104.08691

Text Generation

model = GPT2SoftPromptLM.from_pretrained("gpt2")
tokenizer = GPT2SPTokenizerFast.from_pretrained("gpt2")
generator = pipeline('text-generation', model=model, tokenizer=tokenizer)

sp = SoftPrompt.from_file("sample_sps/finetune/neuromancer_gpt2.json")
prompt = sp + "The sky over the port"
output = generator(prompt)

SoftPrompts can be concatenated at any point into your context as if they were strings. When the context is printed, SoftPrompts show up as human-readable tags for debugging. They also tokenize to the underlying number of tokens for easy budgeting.

See the text generation notebook for pointers on adding mkultra to your generator.

Training

For finetune-like soft prompts, the finetune notebook demonstrates training on a corpus.

For AI text adventures or writing, the World Info notebook notebook demonstrates tuning a soft prompt to describe a character or setting. This is highly experimental.

Limitations (for now)

The Huggingface Trainer class should work as long as you set params=[model.get_soft_params()] on the optimizer, but it will still save full model checkpoints.
mkultra syncs a set of special tokens between its tokenizers the scenes. Adding your own tokens may result in unexpected behaviour.

Prompt tuning toolkit for GPT-2 and GPT-Neo

Related tags

Overview

mkultra

Text Generation

Training

Limitations (for now)

Owner

Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition

मराठी भाषा वाचविण्याचा एक प्रयास. इंग्रजी ते मराठीचा शब्दकोश. An attempt to preserve the Marathi language. A lightweight and ad free English to Marathi thesaurus.

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

A website which allows you to play with the GPT-2 transformer

Built for cleaning purposes in military institutions

VampiresVsWerewolves - Our Implementation of a MiniMax algorithm with alpha beta pruning in the context of an in-class competition

Using context-free grammar formalism to parse English sentences to determine their structure to help computer to better understand the meaning of the sentence.

A number of methods in order to perform Natural Language Processing on live data derived from Twitter

Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing

SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples

Lumped-element impedance calculator and frequency-domain plotter.

A spaCy wrapper of OpenTapioca for named entity linking on Wikidata

PRAnCER is a web platform that enables the rapid annotation of medical terms within clinical notes.

sangha, pronounced "suhng-guh", is a social networking, booking platform where students and teachers can share their practice.

This converter will create the exact measure for your cappuccino recipe from the grandiose Rafaella Ballerini!

✨Rubrix is a production-ready Python framework for exploring, annotating, and managing data in NLP projects.

BiNE: Bipartite Network Embedding

Nystromformer: A Nystrom-based Algorithm for Approximating Self-Attention

Opal-lang - A WIP programming language based on Python

A combination of autoregressors and autoencoders using XLNet for sentiment analysis