[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Last update: Dec 12, 2022

Related tags

Overview

Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Code for Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion. To acquire dataset, please contact [email protected].

Introduction

We proposed a unified network called CorrFusionNet for scene change detection. The proposed CorrFusionNet firstly extracts the features of the bi-temporal inputs with deep convolutional networks. Then the extracted features will be projected into a lower dimension space to computed the instance level canonical correlation. The cross-temporal fusion will be performed based on the computed correlation in the CorrFusion module. The final scene classification and scene change results are obtained with softmax activation layers. In the objective function, we introduced a new formulation for calculating the temporal correlation. The visual results and quantitative assessments both demonstrated that our proposed CorrFusionNet could outperform other scene change detection methods and some state-of-the-art methods for image classification.

CorrFusion Module

The proposed CorrFusion module:

The proposed CorrFusionNet:

Requirements

scipy==1.1.0
matplotlib==3.0.3
h5py==2.8.0
numpy==1.16.3
tensorflow_gpu==1.8.0
Pillow==6.2.1
scikit_learn==0.21.3

Data

Overview of our Wuhan dataset

The images are stored in npz format.

├─trn
│      0-5000.npz
│      10000-15000.npz
│      15000-16488.npz
│      5000-10000.npz
│
├─tst
│      0-4712.npz
│
└─val
       0-2355.npz

Usage

Install the requirements

pip install -r requirements.txt

Run the training code

python train_cnn.py [-h] [-g GPU] [-b BATCH_SIZE] [-e EPOCHES]
                    [-n NUM_CLASSES] [-tb USE_TFBOARD] [-sm SAVE_MODEL]
                    [-log SAVE_LOG] [-trn TRN_DIR] [-tst TST_DIR]
                    [-val VAL_DIR] [-lpath LOG_PATH] [-mpath MODEL_PATH]
                    [-tbpath TB_PATH] [-rpath RESULT_PATH]

(see parser.py)

Evaluate on a trained model:

Download a trained model here.
Evaluation

python evaluate_model.py [-h] [-g GPU] [-m MODEL_DIR] [-tst TST_DIR]
                         [-val VAL_DIR]

optional arguments:
  -h, --help            show this help message and exit
  -g GPU, --gpu GPU     gpu device ID
  -m MODEL_DIR, --model_dir MODEL_DIR
                        model directory
  -tst TST_DIR, --tst_dir TST_DIR
                        testing file dir
  -val VAL_DIR, --val_dir VAL_DIR
                        validation file dir

Results

The results of quantitative assessments:

Predictions on our dataset:

Contact

For any questions, you're welcomed to contact Lixiang Ru.

[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Related tags

Overview

Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Introduction

CorrFusion Module

Requirements

Data

Usage

Install the requirements

Run the training code

Evaluate on a trained model:

Results

Contact

Owner

Lixiang Ru

YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931)

[ICCV 2021] Relaxed Transformer Decoders for Direct Action Proposal Generation

This is the code of using DQN to play Sekiro .

A whale detector design for the Kaggle whale-detector challenge!

Code for the paper titled "Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages"

This solves the autonomous driving issue which is supported by deep learning technology. Given a video, it splits into images and predicts the angle of turning for each frame.

A PyTorch implementation of SlowFast based on ICCV 2019 paper "SlowFast Networks for Video Recognition"

A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env

Object detection and instance segmentation toolkit based on PaddlePaddle.

Exploring Simple Siamese Representation Learning

Deep learning for spiking neural networks

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

Monk is a low code Deep Learning tool and a unified wrapper for Computer Vision.

Face Recognize System on camera AI OAK1

Framework web SnakeServer.

Implementation for Learning to Track with Object Permanence

Poisson Surface Reconstruction for LiDAR Odometry and Mapping

PaRT: Parallel Learning for Robust and Transparent AI

Roach: End-to-End Urban Driving by Imitating a Reinforcement Learning Coach

[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Related tags

Overview

Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Introduction

CorrFusion Module

Requirements

Data

Usage

Install the requirements

Run the training code

Evaluate on a trained model:

Results

Contact

Owner

Lixiang Ru

YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931)

[ICCV 2021] Relaxed Transformer Decoders for Direct Action Proposal Generation

This is the code of using DQN to play Sekiro .

A whale detector design for the Kaggle whale-detector challenge!

Code for the paper titled "Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages"

This solves the autonomous driving issue which is supported by deep learning technology. Given a video, it splits into images and predicts the angle of turning for each frame.

A PyTorch implementation of SlowFast based on ICCV 2019 paper "SlowFast Networks for Video Recognition"

A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env

Object detection and instance segmentation toolkit based on PaddlePaddle.

Exploring Simple Siamese Representation Learning

Deep learning for spiking neural networks

AI创造营 ：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

Monk is a low code Deep Learning tool and a unified wrapper for Computer Vision.

Face Recognize System on camera AI OAK1

Framework web SnakeServer.

Implementation for Learning to Track with Object Permanence

Poisson Surface Reconstruction for LiDAR Odometry and Mapping

PaRT: Parallel Learning for Robust and Transparent AI

Roach: End-to-End Urban Driving by Imitating a Reinforcement Learning Coach

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人