Multi-query Video Retrieval

This repository contains the code for the paper:

@misc{wang2022multiquery,
      title={Multi-query Video Retrieval}, 
      author={Zeyu Wang and Yu Wu and Karthik Narasimhan and Olga Russakovsky},
      year={2022},
      eprint={2201.03639},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Data Preparation

Download raw videos for MSR-VTT, MSVD and VATEX, and put them into data/{dataset}/raw_videos folder.
Run the script data/extract_frames.sh to extract frames from raw videos.

The resulting data folder structures like this:

├── data
    ├── msrvtt
        ├── msrvtt_train.json
        ├── msrvtt_test.json
        ├── msrvtt_test_varying_query_sample_1-20.json
        ├── raw_videos
            ├── video0.mp4
            ├── ...
        ├── extracted_frames
            ├── video0.mp4
                ├── 0.jpg
                ├── ...
            ├── ...
    ├── msvd
        ├── ...
    ├── vatex
        ├── ...

For Frozen model, download the pretrained checkpoint provided by the original authors here, and put into record/pretrained folder.

Training

Run command: python train.py -c configs/{config_path}

Evaluation

Run command: python evaluate.py -c configs/{config_path}

Acknowledgements

The structure of this repository is based on https://github.com/victoresque/pytorch-template. Some of the code are adpated from https://github.com/m-bain/frozen-in-time and https://github.com/ArrowLuo/CLIP4Clip.

Multi-query Video Retreival

Related tags

Overview

Multi-query Video Retrieval

Data Preparation

Training

Evaluation

Acknowledgements

Owner

Princeton Visual AI Lab

A Next Generation ConvNet by FaceBookResearch Implementation in PyTorch(Original) and TensorFlow.

This repository contains source code for the Situated Interactive Language Grounding (SILG) benchmark

Code for "Long-tailed Distribution Adaptation"

NAACL2021 - COIL Contextualized Lexical Retriever

Hi Guys, here I am providing examples, which will help you in Lerarning Python

Geometric Algebra package for JAX

A pytorch implementation of Pytorch-Sketch-RNN

Example Of Fine-Tuning BERT For Named-Entity Recognition Task And Preparing For Cloud Deployment Using Flask, React, And Docker

Data Preparation, Processing, and Visualization for MoVi Data

GLNet for Memory-Efficient Segmentation of Ultra-High Resolution Images

Experiments for distributed optimization algorithms

Code for Ditto: Building Digital Twins of Articulated Objects from Interaction

Code for our EMNLP 2021 paper “Heterogeneous Graph Neural Networks for Keyphrase Generation”

Sub-Cluster AdaCos: Learning Representations for Anomalous Sound Detection.

Multi-agent reinforcement learning algorithm and environment

Official implementation of NeurIPS'21: Implicit SVD for Graph Representation Learning

PSANet: Point-wise Spatial Attention Network for Scene Parsing, ECCV2018.

Exploiting Robust Unsupervised Video Person Re-identification

This is an open solution to the Home Credit Default Risk challenge 🏡

Pytorch implementation for the paper: Contrastive Learning for Cold-start Recommendation