The SVO-Probes Dataset for Verb Understanding

This repository contains the SVO-Probes benchmark designed to probe for Subject, Verb, and Object understanding in image--language models. This benchmark provides two positive and negative images for a given sentence. The negative image differs from the positive one with respect to either subject, verb, or object. Given a sentence, we test if a model can correctly classify both positive and negative images.

For a detailed description of our benchmark, please see the paper Probing Image–Language Transformers for Verb Understanding. Please cite this paper if you use the SVO-Probes benchmark in your work.

Files

svo_probes.csv: our raw data. Each row in the dataset consists of two <sentence,positive-image> and <sentence,negative-image> pairs. Each image is identified by a url and a unique id: pos_image_id (pos_url) or neg_image_id (neg_url) to mark the positive and negative images, respectively. Each image is also associated with subject-verb-object triplets (pos_triplet or neg_triplet) that can be seen in the image. The subj_neg, verb_neg, obj_neg columns specify the type of the negative: for example, subj_neg is True if the negative example is a subject negative.
image_urls.txt: a list of image urls used in our benchmark.
A Colab to analyze pre-trained models on SVO-Probes.

Disclaimer

This is not an official Google product. The SVO-Probes benchmark is created solely for research purposes and is not intended to be used in products. The images in our benchmark are retrieved from the Google Image Search; we expect our images to reflect distributional properties and biases similar to those returned by the Google Image Search API. Furthermore, our dataset is designed to have a similar vocabulary to the Conceptual Captions dataset so we expect our <Subject, Verb, Object> triplets to reflect biases in the Conceptual Captions.

License

The data is made available under the terms of the Creative Commons Attribution 4.0 International Public License (CC BY 4.0). You can find details at: https://creativecommons.org/licenses/by/4.0/legalcode")

If you have concerns or comments about the benchmark, please contact [email protected] and [email protected].

The SVO-Probes Dataset for Verb Understanding

Related tags

Overview

The SVO-Probes Dataset for Verb Understanding

Files

Disclaimer

License

Owner

DeepMind

LOT: A Benchmark for Evaluating Chinese Long Text Understanding and Generation

Faster, modernized fork of the language identification tool langid.py

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Sapiens is a human antibody language model based on BERT.

Code for the ACL 2021 paper "Structural Guidance for Transformer Language Models"

Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"

Training code for Korean multi-class sentiment analysis

DLO8012: Natural Language Processing & CSL804: Computational Lab - II

A natural language processing model for sequential sentence classification in medical abstracts.

CJK computer science terms comparison / 中日韓電腦科學術語對照 / 日中韓のコンピュータ科学の用語対照 / 한·중·일 전산학 용어 대조

Natural Language Processing Tasks and Examples.

Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple

A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion

An evaluation toolkit for voice conversion models.

New Modeling The Background CodeBase

Fixes mojibake and other glitches in Unicode text, after the fact.

Host your own GPT-3 Discord bot

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Code examples for my Write Better Python Code series on YouTube.

Words_And_Phrases - Just a repo for useful words and phrases that might come handy in some scenarios. Feel free to add yours