Activity image-based video retrieval

Last update: Oct 21, 2021

Related tags

Overview

Cross-modal-retrieval

Our approach is focus on Activity Image-to-Video Retrieval (AIVR) task. The compared methods are state-of-the-art single modality hashing methods, multiple modalities hashing methods and cross-modal retrieval methods.

Single modality hashing methods

Some hashing baselines for image retrieval can be found in https://github.com/willard-yuan/hashing-baseline-for-image-retrieval.

Multiple modalities hashing methods

More details refer to https://github.com/czxxjtu/Hash-Learning.github.io. Some details about hashing methods are in hashing-baseline-for-image-retrieval-master folder.

Cross-modal retrieval methods

The compared cross-modal retrieval methods are according to the paper:

Datasets

THUMOS'14 Dataset:

https://pan.baidu.com/s/1H6c8nh_Hs7gVkhESpxtvAg 提取码：qp26

ActivityNet Dataset:

https://pan.baidu.com/s/1P0jRecEmplCPaTPwFoOpVQ 提取码：pnw9

Bibtex

When using images from our dataset, please cite our paper using the following BibTeX[PDF]：

@article{pba2020,
author    = {Ruicong Xu and Li Niu and Jianfu Zhang and Liqing Zhang},
title     = {A Proposal-based Approach for Activity Image-to-Video Retrieval},
journal   = {AAAI},
year      = {2020}}

Activity image-based video retrieval

Related tags

Overview

Cross-modal-retrieval

Single modality hashing methods

Multiple modalities hashing methods

Cross-modal retrieval methods

Datasets

THUMOS'14 Dataset:

ActivityNet Dataset:

Bibtex

Owner

BCMI

Boundary IoU API (Beta version)

Official Repository for the ICCV 2021 paper "PixelSynth: Generating a 3D-Consistent Experience from a Single Image"

Codes for the compilation and visualization examples to the HIF vegetation dataset

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Source code and Dataset creation for the paper "Neural Symbolic Regression That Scales"

[EMNLP 2021] Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training

An AI Assistant More Than a Toolkit

PyTorch module to use OpenFace's nn4.small2.v1.t7 model

This repository contains the implementation of the HealthGen model, a generative model to synthesize realistic EHR time series data with missingness

Learning Synthetic Environments and Reward Networks for Reinforcement Learning

Unofficial PyTorch implementation of "RTM3D: Real-time Monocular 3D Detection from Object Keypoints for Autonomous Driving" (ECCV 2020)

Official repository for the CVPR 2021 paper "Learning Feature Aggregation for Deep 3D Morphable Models"

Code for paper "Document-Level Argument Extraction by Conditional Generation". NAACL 21'

This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (KDD'21).

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Pretrained models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet.

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling

Sample Prior Guided Robust Model Learning to Suppress Noisy Labels

MINOS: Multimodal Indoor Simulator

meProp: Sparsified Back Propagation for Accelerated Deep Learning (ICML 2017)