Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

Last update: Aug 03, 2022

Overview

SSWS-loss_function_based_on_MS-TCN

Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

Abstract

Recently, more and more videos have been uploaded to the network, so that video analysis task has been one of the most important applications in various fields. At present, video analysis methods can be divided into two kinds: weakly supervised video action segmentation and supervised video action segmentation. The former uses a sliding window or Markov model, while the latter uses the TCN model. In this paper, we introduce the Supervised Sliding Window Smooth Loss Function (SSWS) into the TCN baseline, which is a complement to MS-TCN smoothing loss function TMSE. In this method, three discriminant frames are selected from the video prediction sequence and combined into an adaptive sliding window to selectively smooth the whole prediction sequence. In particular, it doubles the penalty when it slides to the wrong place in the category. Compared to TMSE, our method effectively increases the receptive field of smoothing loss function. And, the proposed new supervised loss function only penalizes error frames. The experiment shows that compared with the Smoothing loss function TMSE of MS-TCN, SSWS has significantly improved in the three datasets: 50Salads, GTEA and the Breakfast Dataset.

Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

Related tags

Overview

SSWS-loss_function_based_on_MS-TCN

Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

Abstract

Owner

ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation

source code for 'Finding Valid Adjustments under Non-ignorability with Minimal DAG Knowledge' by A. Shah, K. Shanmugam, K. Ahuja

Official Pytorch implementation of the paper: "Locally Shifted Attention With Early Global Integration"

Python library to receive live stream events like comments and gifts in realtime from TikTok LIVE.

ReSSL: Relational Self-Supervised Learning with Weak Augmentation

Pseudo-mask Matters in Weakly-supervised Semantic Segmentation

Answering Open-Domain Questions of Varying Reasoning Steps from Text

Sequence to Sequence (seq2seq) Recurrent Neural Network (RNN) for Time Series Forecasting

Make Watson Assistant send messages to your Discord Server

Face Detection and Alignment using Multi-task Cascaded Convolutional Networks (MTCNN)

FPSAutomaticAiming——基于YOLOV5的FPS类游戏自动瞄准AI

Generalized Proximal Policy Optimization with Sample Reuse (GePPO)

Repository sharing code and the model for the paper "Rescoring Sequence-to-Sequence Models for Text Line Recognition with CTC-Prefixes"

Implementation of the final project of the course DDA6309 Probabilistic Graphical Model

This is the official PyTorch implementation for "Mesa: A Memory-saving Training Framework for Transformers".

Analysing poker data from home games with friends

A set of tools for Namebase and HNS

🔮 A refreshing functional take on deep learning, compatible with your favorite libraries

Distributed Asynchronous Hyperparameter Optimization better than HyperOpt.

Repo for EMNLP 2021 paper "Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression"