Multiview 3D object detection on MultiviewC dataset through moft3d.

Last update: Dec 21, 2022

Overview

Voxelized 3D Feature Aggregation for Multiview Detection [arXiv]

Multiview 3D object detection on MultiviewC dataset through VFA.

Introduction

We propose a novel method, VFA, for multiview 3D object detection and MultiviewC, a synthetic dataset, for multi-view detection in occlusion scenarios.

Content

MultiviewC dataset
- Download MultivewC
- Build your own version
VFA Code

MultiviewC dataset

The MultiviewC dataset mainly contributes to multiview cattle action recognition, 3D objection detection and tracking. We build a novel synthetic dataset MultiviewC through UE4 based on real cattle video dataset which is offered by CISRO.

The MultiviewC dataset is generated on a 37.5 meter by 37.5 meter square field. It contains 7 cameras monitoring cattle activities. The images in MultiviewC are of high resolution, 1280x720 and synthetic animals in our dataset are highly realistic.

Download MultiviewC

download dataset and copy the annotations, images and calibrations folder into this repo.

Build your own version

Please refer to this repo for MultiviewC dataset toolkits.

VFA

This repo is contributed to the code for VFA.

Data Preparation

In this project, we use MultiviewC, MultiviewX and Wildtrack. Download and unzip the dataset in the ~/Data folder. Your ~/Data/ folder should look like this

Data
├── MultiviewC/
│   └── ...
|
├── MultiviewX/
│   └── ...
|
└── Wildtrack/ 
    └── ...

Training and Inference

Training from scratch.

# For MultiviewC
python .\train.py --data MultiviewC

# For MultiviewX
python .\train.py --data MultiviewX

# For Wildtrack
python .\train.py --data Wildtrack

We provide the training documents contains the checkpoints of model, optimizer and scheduler and tensorboard containing the training details. Download the latest training documents to ~/experiments folder from BaiduDrivepwd:6666 or GoogleDrive and unzip them. Your ~/experiments/ folder should look like this

experiments
└── MultiviewC/
    ├── checkpoints
    |   └── ...
    └── evaluation
    |   └── ...
    └── tensorboard
        └── ...

Evaluation

There are two metrics to evaluate the performance of model. MODA, MODP, Precission and Recall are used to evaluate detection performance such as the detection in occlusion scenes. These metrics need to successfully run in matlab environment. Please refer to here for more details. Even though, the python implementation of these metrics mentioned above is also provided, it need to select the distance threshould to detemine to positive samples，which is not objective enough. Thus, it is recommended to select the official implementation of matlab.

When it comes to the AP, AOS, OS metrics, we need to install cuda environment and build the toolkit for 3D rotated IoUs calculation. Please refer to this repo for more details.

Multiview 3D object detection on MultiviewC dataset through moft3d.

Related tags

Overview

Voxelized 3D Feature Aggregation for Multiview Detection [arXiv]

Introduction

Content

MultiviewC dataset

Download MultiviewC

Build your own version

VFA

Data Preparation

Training and Inference

Evaluation

Owner

Jiahao Ma

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

On the adaptation of recurrent neural networks for system identification

High-fidelity 3D Model Compression based on Key Spheres

Python suite to construct benchmark machine learning datasets from the MIMIC-III clinical database.

[ICCV 2021] Our work presents a novel neural rendering approach that can efficiently reconstruct geometric and neural radiance fields for view synthesis.

RCT-ART is an NLP pipeline built with spaCy for converting clinical trial result sentences into tables through jointly extracting intervention, outcome and outcome measure entities and their relations.

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

This is code to fit per-pixel environment map with spherical Gaussian lobes, using LBFGS optimization

a Pytorch easy re-implement of "YOLOX: Exceeding YOLO Series in 2021"

Model Zoo for AI Model Efficiency Toolkit

A scikit-learn-compatible module for estimating prediction intervals.

Title: Graduate-Admissions-Predictor

(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation

CTF Challenge for CSAW Finals 2021

Official implementation for (Refine Myself by Teaching Myself : Feature Refinement via Self-Knowledge Distillation, CVPR-2021)

Code for the ACL2021 paper "Lexicon Enhanced Chinese Sequence Labelling Using BERT Adapter"

Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

Honours project, on creating a depth estimation map from two stereo images of featureless regions

PyArmadillo: an alternative approach to linear algebra in Python

exponential adaptive pooling for PyTorch