Code and data of the Fine-Grained R2R Dataset proposed in paper Sub-Instruction Aware Vision-and-Language Navigation

Last update: Nov 15, 2022

Overview

Fine-Grained R2R

Code and data of the Fine-Grained R2R Dataset proposed in the EMNLP2020 paper Sub-Instruction Aware Vision-and-Language Navigation.

Code of the navigator will be released soon.

This dataset enriches the benchmark Room-to-Room (R2R) dataset by dividing the instructions into sub-instructions and pairing each of those with their corresponding viewpoints in the path.

The copyright resides with the authors of the paper Sub-Instruction Aware Vision-and-Language Navigation.
This dataset is build upon the Room-to-Room (R2R) dataset, we refer the readers to its repository for more details.

Data

The Fine-Grained R2R data, which enriches the R2R dataset with sub-instructions and their corresponding paths. The overall instruction and trajectory of each sample remains the same.

For paths in the train, the validation seen and the validation unseen splits, we add two new entries:
- new_instructions: A list of sub-instructions produced by the Chunking Function from the complete instructions. You can use import ast and ast.literal_eval() to read it a list.
- chunk_view: A list of sub-paths corresponding to the sub-instructions, where each number in the list is an index of a viewpoint in the ground-truth path. The index starts at 1.
Some sub-instructions which refer to camera rotation or a STOP action could match to a single viewpoint.
For the test unseen split, we only provide the sub-instructions but not the sub-paths.

Source

The code of the proposed Chunking Function for generating sub-instructions.

Install the StanfordNLP package (v0.1.2 in our experiment) and download the English models for the neural pipeline.
Run make_subinstr.py to generate data with sub-instructions from the original R2R data.
The generated files had been sent to the Amazon Mechanical Turk (AMT) for annotating the sub-paths.

Reference

If you use or dicsuss the Fine-Grained R2R dataset in your work, please cite our paper:

@article{hong2020sub,
  title={Sub-Instruction Aware Vision-and-Language Navigation},
  author={Hong, Yicong and Rodriguez-Opazo, Cristian and Wu, Qi and Gould, Stephen},
  journal={arXiv preprint arXiv:2004.02707},
  year={2020}
}

Contact

If you have any question regarding the dataset or publication, please create an issue in this repository or email to [email protected].

Code and data of the Fine-Grained R2R Dataset proposed in paper Sub-Instruction Aware Vision-and-Language Navigation

Related tags

Overview

Fine-Grained R2R

Data

Source

Reference

Contact

Owner

YicongHong

LightLog is an open source deep learning based lightweight log analysis tool for log anomaly detection.

ICON: Implicit Clothed humans Obtained from Normals

A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis

CLDF dataset derived from Robbeets et al.'s "Triangulation Supports Agricultural Spread" from 2021

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

A simplistic and efficient pure-python neural network library from Phys Whiz with CPU and GPU support.

Source code for "Pack Together: Entity and Relation Extraction with Levitated Marker"

Identifying Stroke Indicators Using Rough Sets

This is the repo for the paper "Improving the Accuracy-Memory Trade-Off of Random Forests Via Leaf-Refinement".

StyleMapGAN - Official PyTorch Implementation

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

Codes for our IJCAI21 paper: Dialogue Discourse-Aware Graph Model and Data Augmentation for Meeting Summarization

This repository stores the code to reproduce the results published in "TiWS-iForest: Isolation Forest in Weakly Supervised and Tiny ML scenarios"

Official PyTorch implementation for paper "Efficient Two-Stage Detection of Human–Object Interactions with a Novel Unary–Pairwise Transformer"

Rainbow DQN implementation that outperforms the paper's results on 40% of games using 20x less data 🌈

A collection of inference modules for fastai2

Pytorch implementation of Decoupled Spatial-Temporal Transformer for Video Inpainting

Anomaly Localization in Model Gradients Under Backdoor Attacks Against Federated Learning