HOI Transformer

Code for CVPR 2021 accepted paper End-to-End Human Object Interaction Detection with HOI Transformer.

Reproduction

We recomend you to setup in the following steps:

1.Clone the repo.

git clone https://github.com/bbepoch/HoiTransformer.git

2.Download the MS-COCO pretrained DETR model.

cd data/detr_coco && bash download_model.sh

3.You are supposed to make a soft link named 'images' in 'data/hico/' to refer to your HICO-DET path, or your will have to modify the data path manually in hico.py.

ln -s /path-to-your-hico-det-dataset/hico_20160224_det/images images

4.Train a model.

python3 -m torch.distributed.launch --nproc_per_node=8 --use_env main.py --epochs=250 --lr_drop=200 --dataset_file=hico --batch_size=2 --backbone=resnet50

5.Test a model.

python3 test.py --dataset_file=hico --batch_size=1 --log_dir=./ --model_path=your_model_path

Citation

@inproceedings{zou2021_hoitrans,
author = {Zou, Cheng and Wang, Bohan and Hu, Yue and Liu, Junqi and Wu, Qian and Zhao, Yu and Li, Boxun and Zhang, Chenguang and Zhang, Chi and Wei, Yichen and Sun, Jian},
title = {End-to-End Human Object Interaction Detection with HOI Transformer},
booktitle={CVPR},
year = {2021},
}

Acknowledgement

We sincerely thank all previous works, especially DETR, PPDM, iCAN, for some of the codes are built upon them.

This is the code for HOI Transformer

Related tags

Overview

HOI Transformer

Reproduction

Citation

Acknowledgement

Owner

BigBangEpoch

Pytorch GUI(demo) for iVOS(interactive VOS) and GIS (Guided iVOS)

Reading list for research topics in Masked Image Modeling

scikit-learn: machine learning in Python

Align and Prompt: Video-and-Language Pre-training with Entity Prompts

A Python library for common tasks on 3D point clouds

Unofficial implement with paper SpeakerGAN: Speaker identification with conditional generative adversarial network

Cognate Detection Repository

Google Brain - Ventilator Pressure Prediction

Code for HodgeNet: Learning Spectral Geometry on Triangle Meshes, in SIGGRAPH 2021.

To provide 100 JAX exercises over different sections structured as a course or tutorials to teach and learn for beginners, intermediates as well as experts

Autoencoder - Reducing the Dimensionality of Data with Neural Network

YoloV3 Implemented in Tensorflow 2.0

Hide screen when boss is approaching.

LightningFSL: Pytorch-Lightning implementations of Few-Shot Learning models.

Turn based roguelike in python

Official Pytorch implementation of C3-GAN

The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

Implementation of our paper "Video Playback Rate Perception for Self-supervised Spatio-Temporal Representation Learning".

PyTorch source code for Distilling Knowledge by Mimicking Features

Label-Free Model Evaluation with Semi-Structured Dataset Representations