PyTorch reimplementation of REALM and ORQA

Last update: Aug 20, 2022

Related tags

Overview

PyTorch Reimplementation of REALM and ORQA

This is PyTorch reimplementation of REALM (paper, codebase) and ORQA (paper, codebase).

Some features have not been implemented yet, currently the predictor and finetuning script are available.

The term retriever and searcher in the code are basically interchangeable, their difference is that retriever is for REALM pretraining, and searcher is for ORQA finetuning.

Prerequisite

cd transformers && pip install -U -e ".[dev]"
pip install -U scann, apache_beam

Data

To download pretrained checkpoints and preprocessed data, please follow the instructions below:

cd data
pip install -U -r requirements.txt
sh download.sh

Finetune (Experimental)

The default finetuning dataset is Natural Question(NQ). To laod your custom dataset, please change the loading function in data.py.

Training:

python run_finetune.py --is_train \
    --model_dir "./" \
    --num_epochs 2 \
    --device cuda

Evaluation:

python run_finetune.py \
    --retriever_pretrained_name "retriever" \
    --checkpoint_pretrained_name "reader" \
    --model_dir "./" \
    --device cuda

Predict

The default checkpoints of retriever and reader are orqa_nq_model_from_realm. To change them, kindly specify --retriever_path and --checkpoint_path.

python predictor.py --question "Who is the pioneer in modern computer science?"

Output: alan mathison turing

License

Apache License 2.0

PyTorch reimplementation of REALM and ORQA

Related tags

Overview

PyTorch Reimplementation of REALM and ORQA

Prerequisite

Data

Finetune (Experimental)

Predict

License

Owner

Li-Huai (Allan) Lin

Narya API allows you track soccer player from camera inputs, and evaluate them with an Expected Discounted Goal (EDG) Agent

Activating More Pixels in Image Super-Resolution Transformer

The official repository for BaMBNet

Code for the Convolutional Vision Transformer (ConViT)

A small library for creating and manipulating custom JAX Pytree classes

COCO Style Dataset Generator GUI

AutoVideo: An Automated Video Action Recognition System

Artifacts for paper "MMO: Meta Multi-Objectivization for Software Configuration Tuning"

Bio-Computing Platform Featuring Large-Scale Representation Learning and Multi-Task Deep Learning “螺旋桨”生物计算工具集

A clean and scalable template to kickstart your deep learning project 🚀 ⚡ 🔥

[TNNLS 2021] The official code for the paper "Learning Deep Context-Sensitive Decomposition for Low-Light Image Enhancement"

Official PyTorch implementation of Retrieve in Style: Unsupervised Facial Feature Transfer and Retrieval.

Moon-patrol - A faithful recreation of the 1983 hit classic Moon Patrol for the Atari 2600 created using the Pygame library for Python

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

Official implementation of Rethinking Graph Neural Architecture Search from Message-passing (CVPR2021)

A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).

RODD: A Self-Supervised Approach for Robust Out-of-Distribution Detection

【CVPR 2021, Variational Inference Framework, PyTorch】 From Rain Generation to Rain Removal

An end-to-end implementation of intent prediction with Metaflow and other cool tools

Combinatorially Hard Games where the levels are procedurally generated