Codes_APN

Official codes of CVPR21 paper: Normal Learning in Videos with Attention Prototype Network (https://arxiv.org/abs/2108.11055)

Overview of our approach based on APU and CAU model:

Introduction

Frame reconstruction (current or future frame) based on Auto-Encoder (AE) is a popular method for video anomaly detection. With models trained on the normal data, the reconstruction errors of anomalous scenes are usually much larger than those of normal ones. Previous methods introduced the memory bank into AE, for encoding diverse normal patterns across the training videos. However, they are memory consuming and cannot cope with unseen new scenarios in the testing data. In this work, we propose a self-attention prototype unit (APU) to encode the normal latent space as prototypes in real time, free from extra memory cost. In addition, we introduce circulative attention mechanism to our backbone to form a novel feature extracting learner, namely Circulative Attention Unit(CAU). It enables the fast adaption capability on new scenes by only consuming a few iterations of update. Extensive experiments are conducted on various benchmarks. The superior performance over the state-of-the-art demonstrates the effectiveness of our method.

Performance

We achieved SOTA on many video anomaly detection datasets.

Unsupervised Anomaly Detection Model Training

bash train.sh

Unsupervised Anomaly Detection Model Testing

bash test.sh

If you find this work helpful, please cite:

@inproceedings{Nv2021APN,
  author    = {Chao Hu and
	       Fan Wu and
               Weijie Wu and
               Weibin Qiu and
               Shengxin Lai},
  title     = {Normal Learning in Videos with Attention Prototype Network},
  booktitle = {Computer Vision and Pattern Recognition},
  year      = {2021}
}

Normal Learning in Videos with Attention Prototype Network

Related tags

Overview

Codes_APN

Introduction

Performance

Unsupervised Anomaly Detection Model Training

Unsupervised Anomaly Detection Model Testing

Owner

AI Face Mesh: This is a simple face mesh detection program based on Artificial intelligence.

FL-WBC: Enhancing Robustness against Model Poisoning Attacks in Federated Learning from a Client Perspective

Exemplo de implementação do padrão circuit breaker em python

Python Library for learning (Structure and Parameter) and inference (Statistical and Causal) in Bayesian Networks.

An attempt at the implementation of GLOM, Geoffrey Hinton's paper for emergent part-whole hierarchies from data

[ICCV 2021] A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation

Voice of Pajlada with model and weights.

BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.

Official code of paper "PGT: A Progressive Method for Training Models on Long Videos" on CVPR2021

Code for "Unsupervised State Representation Learning in Atari"

Improving Calibration for Long-Tailed Recognition (CVPR2021)

The AugNet Python module contains functions for the fast computation of image similarity.

Official Datasets and Implementation from our Paper "Video Class Agnostic Segmentation in Autonomous Driving".

Turi Create simplifies the development of custom machine learning models.

A really easy-to-use and powerful sudoku solver.

Process text, including tokenizing and representing sentences as vectors and Applying some concepts like RNN, LSTM and GRU to create a classifier can detect the language in which a sentence is written from among 17 languages.

Pretty Tensor - Fluent Neural Networks in TensorFlow

Unofficial PyTorch Implementation of "Augmenting Convolutional networks with attention-based aggregation"

Occlusion robust 3D face reconstruction model in CFR-GAN (WACV 2022)

On-device speech-to-intent engine powered by deep learning