Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"

Last update: Oct 21, 2022

Related tags

Overview

Can NLI Models Verify QA Systems' Predictions?

This repository contains the data and code for the following paper:

**Can NLI Models Verify QA Systems' Predictions? **
Jifan Chen, Eunsol Choi, Greg Durrett
EMNLP 2021 Findings

@article{chen2021can,
  title={Can NLI Models Verify QA Systems' Predictions?},
  author={Chen, Jifan and Choi, Eunsol and Durrett, Greg},
  journal={EMNLP Findings},
  year={2021}
}

Datasets

The NLI data converted from QA datasets through our pipeline described in the paper can be found here

Data Format

The data files are formatted as jsonlines; each example is described as the following:

Field	Description
`example_id`	Example ID
`title_text`	Title of the Wikipedia page of the example, could be NONE
`paragraph_text`	Paragraph containing the answer
`question_text`	Question
`answer_text`	Answer of the question
`answer_sent_text`	Sentence containing the answer
`decontext_answer_sent_text`	Decontextualized sentence containing the answer
`question_statement_text`	Declarative version of the question by combining the answer
`answer_scores`	Top 5 Answer score computed by the QA(BERT-joint) model
`is_correct`	Whether the answer is correct
`answer_sent_text`	Sentence containing the answer

Models

Getting started

git clone https://github.com/jifan-chen/QA-Verification-Via-NLI.git

Install the dependencies by running pip install -r requirements.txt

Question Converter & Decontextualizer

See README in seq2seq_converter.

NQ-NLI

coming soon

Contact

Please contact at [email protected] if you have any questions.

Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"

Related tags

Overview

Can NLI Models Verify QA Systems' Predictions?

Datasets

Data Format

Models

Getting started

Question Converter & Decontextualizer

NQ-NLI

Contact

Owner

Jifan Chen

Subtitle Workshop (subshop): tools to download and synchronize subtitles

Chinese version of GPT2 training code, using BERT tokenizer.

PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers

Collection of useful (to me) python scripts for interacting with napari

A framework for cleaning Chinese dialog data

Traditional Chinese Text Recognition Dataset: Synthetic Dataset and Labeled Data

Open-source offline translation library written in Python. Uses OpenNMT for translations

Auto_code_complete is a auto word-completetion program which allows you to customize it on your needs

使用Mask LM预训练任务来预训练Bert模型。训练垂直领域语料的模型表征，提升下游任务的表现。

Code voor mijn Master project omtrent VideoBERT

TextFlint is a multilingual robustness evaluation platform for natural language processing tasks,

GCRC: A Gaokao Chinese Reading Comprehension dataset for interpretable Evaluation

Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)

NL. The natural language programming language.

Cherche (search in French) allows you to create a neural search pipeline using retrievers and pre-trained language models as rankers.

Artificial Conversational Entity for queries in Eulogio "Amang" Rodriguez Institute of Science and Technology (EARIST)

Concept Modeling: Topic Modeling on Images and Text

Let Xiao Ai speakers control third-party devices

Count the frequency of letters or words in a text file and show a graph.

Optimal Transport Tools (OTT), A toolbox for all things Wasserstein.