The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".

Last update: Oct 30, 2022

Overview

Code for "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval" (ACL 2021, Long)

This is the repository for baseline models and annotated data for this paper: Akari Asai and Eunsol Choi. Challenges in Information Seeking QA:Unanswerable Questions and Paragraph Retrieval. In: Proceedings of ACL. 2021

In the paper, we carefully analyze unanswerable questions in information-seeking QA dataset (i.e., Natural Questions and TyDi QA) and attempt to identify the remaining headrooms. We conduct both a range of controlled experiments and insensitive human annotations on around 800 examples across across 6 languages.

Annotated data

In human_annotated_data, we provide human annotated data from TyDi QA and Natural Questions.

Dataset	language	# of annotated questions	file name
Natural Questions	English	450	NQ.tsv
TyDi QA	Bengali	50	TyDi-Bn.tsv
TyDi QA	Japanese	100	TyDi-Ja.tsv
TyDi QA	Korean	100	TyDi-Bn.tsv
TyDi QA	Russian	50	TyDi-Ru.tsv
TyDi QA	Telugu	50	TyDi-Te.tsv

Baselines

In this work, we conduct several baseline experiments to identify the remaining headrooms in information-seeking QA. This repository include baselines for question only baseline. See the training and evaluation details in README.md. We thank the authors of Riki Net, Retro-reader, and ETC for providing their models' predictions that are used to analyze those state-of-the-art models behaviors.

Citation and Contact

If you find this codebase is useful or use in your work, please cite our paper.

@inproceedings{
asai2020learning,
title={Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval},
author={Akari Asai and Eunsol Choi},
booktitle={ACL-IJCNLP},
year={2021}
}

Please contact Akari Asai (@AkariAsai, akari[at]cs.washington.edu) for questions and suggestions.

The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".

Related tags

Overview

Code for "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval" (ACL 2021, Long)

Annotated data

Baselines

Citation and Contact

Owner

Akari Asai

Provided is code that demonstrates the training and evaluation of the work presented in the paper: "On the Detection of Digital Face Manipulation" published in CVPR 2020.

Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs

High performance distributed framework for training deep learning recommendation models based on PyTorch.

An Efficient Training Approach for Very Large Scale Face Recognition or F²C for simplicity.

Official implementation of "UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer"

Implementation of the pix2pix model on satellite images

The official repository for BaMBNet

Official code repository for the work: "The Implicit Values of A Good Hand Shake: Handheld Multi-Frame Neural Depth Refinement"

MG-GCN: Scalable Multi-GPU GCN Training Framework

DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

The AugNet Python module contains functions for the fast computation of image similarity.

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

SplineConv implementation for Paddle.

Code for EMNLP 2021 paper Contrastive Out-of-Distribution Detection for Pretrained Transformers.

PoolFormer: MetaFormer is Actually What You Need for Vision

A sequence of Jupyter notebooks featuring the 12 Steps to Navier-Stokes

Compute FID scores with PyTorch.

This tool converts a Nondeterministic Finite Automata (NFA) into a Deterministic Finite Automata (DFA)

Robust & Reliable Route Recommendation on Road Networks

This is the repo for the paper "Improving the Accuracy-Memory Trade-Off of Random Forests Via Leaf-Refinement".