Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17

Last update: Dec 11, 2022

Overview

2017 VQA Challenge Winner (CVPR'17 Workshop)

pytorch implementation of Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge by Teney et al.

Prerequisites

python 3.6+
numpy
pytorch 0.4
tqdm
nltk
pandas

Data

Preparation

To download and extract vqav2, glove, and pretrained visual features:
```
bash scripts/download_extract.sh
```
To prepare data for training:
```
python scripts/preproc.py
```

The structure of data/ directory should look like this:

- data/
  - zips/
    - v2_XXX...zip
    - ...
    - glove...zip
    - trainval_36.zip
  - glove/
    - glove...txt
    - ...
  - v2_XXX.json
  - ...
  - trainval_resnet...tsv
  (The above are files created after executing scripts/download_extract.sh)
  - tokenizers/
    - ...
  - dict_ans.pkl
  - dict_q.pkl
  - glove_pretrained_300.npy
  - train_qa.pkl
  - val_qa.pkl
  - train_vfeats.pkl
  - val_vfeats.pkl
  (The above are files created after executing scripts/preproc.py)

Train

Use default parameters:

bash scripts/train.sh

Notes

Huge re-factor (especially data preprocessing), tested based on pytorch 0.4.1 and python 3.6
Training for 20 epochs reach around 50% training accuracy. (model seems buggy in my implementation)
After all the preprocessing, data/ directory may be up to 38G+
Some of preproc.py and utils.py are based on this repo

Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17

Related tags

Overview

2017 VQA Challenge Winner (CVPR'17 Workshop)

Prerequisites

Data

Preparation

Train

Notes

Resources

Owner

Mark Dong

Reproducing the Linear Multihead Attention introduced in Linformer paper (Linformer: Self-Attention with Linear Complexity)

多语言降噪预训练模型MBart的中文生成任务

Wikipedia-Utils: Preprocessing Wikipedia Texts for NLP

Automated question generation and question answering from Turkish texts using text-to-text transformers

:mag: Transformers at scale for question answering & neural search. Using NLP via a modular Retriever-Reader-Pipeline. Supporting DPR, Elasticsearch, HuggingFace's Modelhub...

PRAnCER is a web platform that enables the rapid annotation of medical terms within clinical notes.

뉴스 도메인 질의응답 시스템 (21-1학기 졸업 프로젝트)

Text editor on python tkinter to convert english text to other languages with the help of ployglot.

Neural-Machine-Translation - Implementation of revolutionary machine translation models

Reformer, the efficient Transformer, in Pytorch

MMDA - multimodal document analysis

nlp基础任务

SimCSE: Simple Contrastive Learning of Sentence Embeddings

Pretrained Japanese BERT models

PG-19 Language Modelling Benchmark

Random-Word-Generator - Generates meaningful words from dictionary with given no. of letters and words.

:P Some basic stuff I'm gonna use for my upcoming Agile Software Development and Devops

Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch

Mlcode - Continuous ML API Integrations

Code and datasets for our paper "PTR: Prompt Tuning with Rules for Text Classification"