Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Last update: Dec 23, 2022

Related tags

Overview

Surface Form Competition

This is the official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right" We provide scripts for downloading/processing datasets and for reproducing our results on GPT-2 and GPT-3. We do not guarantee exact reproducibility, as library versions and GPUs may cause small differences, but these should be extremely minor.

Dependencies

We use python3 and pytorch 1.7.0, but we do not use cutting-edge features from either and expect to be largely forward and backward compatible. That is not a guarantee or promise.

You can use pip install -r requirements.txt to install the required libraries.

OpenAI Beta

To use GPT-3 you must use OpenAI Beta, which is limited access. You can apply for access here. Once you have access you will need to point the score.py to your API key with the --key argument or put your key in api.key which is the default path.

Downloading Datasets

DATA_README.md has thorough instructions for downloading and processing datasets. We provide automatic downloaders and processers for datasets where possible in data_downloaders/ but see DATA_README for full instructions.

Running Scorers

Once you have a dataset downloaded, running all the zero-shot scoring strategies at once is as simple as:

python score.py 
   
     --model

where is the abbreviation for a given dataset used for table rows in the paper. If there is any confusion, simply look in score.py to see how dataset selection works. is the name of either a GPT-2 or GPT-3 model e.g. xl, davinci, etc. To speed things up you can use a larger --batch if you have enough GPU memory.

Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Related tags

Overview

Surface Form Competition

Dependencies

OpenAI Beta

Downloading Datasets

Running Scorers

Owner

Peter West

Repository for the paper "Exploring the Sensory Spaces of English Perceptual Verbs in Natural Language Data"

Generate indoor scenes with Transformers

Pairwise learning neural link prediction for ogb link prediction

Code associated with the paper "Deep Optics for Single-shot High-dynamic-range Imaging"

Methods to get the probability of a changepoint in a time series.

Official implementation of Rethinking Graph Neural Architecture Search from Message-passing (CVPR2021)

Program your own vulkan.gpuinfo.org query in Python. Used to determine baseline hardware for WebGPU.

[ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark

Election Exit Poll Prediction and U.S.A Presidential Speech Analysis using Machine Learning

PyTorch implemention of ICCV'21 paper SGPA: Structure-Guided Prior Adaptation for Category-Level 6D Object Pose Estimation

DumpSMBShare - A script to dump files and folders remotely from a Windows SMB share

Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop

Deep Markov Factor Analysis (NeurIPS2021)

Pytorch implementation of RED-SDS (NeurIPS 2021).

tf2onnx - Convert TensorFlow, Keras and Tflite models to ONNX.

Examples of how to create colorful, annotated equations in Latex using Tikz.

Pydantic models for pywttr and aiopywttr.

Framework for estimating the structures and parameters of Bayesian networks (DAGs) at per-sample resolution

AI Summer's complete catalog of articles

Explaining in Style: Training a GAN to explain a classifier in StyleSpace