Contrastively Disentangled Sequential Variational Audoencoder

Last update: Dec 24, 2022

Related tags

Overview

Contrastively Disentangled Sequential Variational Audoencoder (C-DSVAE)

Overview

This is the implementation for our C-DSVAE, a novel self-supervised disentangled sequential representation learning method.

Requirements

Python 3
PyTorch 1.7
Numpy 1.18.5

Dataset

Sprites

We provide the raw Sprites .npy files. One can also find the dataset on a third-party repo.

For each split (train/test), we expect the following components for each sequence sample

x: raw sample of shape [8, 3, 64, 64]
c_aug: content augmentation of shape [8, 3, 64, 64]
m_aug: motion augmentation of shape [8, 3, 64, 64]
motion factors: action (3 classes), direction (3 classes)
content factors: skin, tops, pants, hair (each with 6 classes)

Running

Train

./run_cdsvae.sh

Test

./run_test_sprite.sh

Classification Judge

The judge classifiers are pretrained with full supervision separately.

Sprites judge

C-DSVAE Checkpoints

We provide a sample Sprites checkpoint. Checkpoint parameters can be found in ./run_test_sprite.sh.

Paper

If you are inspired by our work, please cite the following paper:

@inproceedings{bai2021contrastively,
  title={Contrastively Disentangled Sequential Variational Autoencoder},
  author={Bai, Junwen and Wang, Weiran and Gomes, Carla},
  booktitle={Advances in Neural Information Processing Systems},
  volume={},
  year={2021}
}

Contrastively Disentangled Sequential Variational Audoencoder

Related tags

Overview

Contrastively Disentangled Sequential Variational Audoencoder (C-DSVAE)

Overview

Requirements

Dataset

Sprites

Running

Train

Test

Classification Judge

C-DSVAE Checkpoints

Paper

Owner

Junwen Bai

This is the paddle code for SeBoW(Self-Born wiring for neural trees), a kind of neural tree born form a large search space

Python implementation of ADD: Frequency Attention and Multi-View based Knowledge Distillation to Detect Low-Quality Compressed Deepfake Images, AAAI2022.

A user-friendly research and development tool built to standardize RL competency assessment for custom agents and environments.

Supervised forecasting of sequential data in Python.

[NeurIPS 2021]: Are Transformers More Robust Than CNNs? (Pytorch implementation & checkpoints)

Back to the Feature: Learning Robust Camera Localization from Pixels to Pose (CVPR 2021)

LightningFSL: Pytorch-Lightning implementations of Few-Shot Learning models.

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Code For TDEER: An Efficient Translating Decoding Schema for Joint Extraction of Entities and Relations (EMNLP2021)

RCDNet: A Model-driven Deep Neural Network for Single Image Rain Removal (CVPR2020)

The coda and data for "Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach" (ACL '21)

VIL-100: A New Dataset and A Baseline Model for Video Instance Lane Detection (ICCV 2021)

Search Youtube Video and Get Video info

A Quick and Dirty Progressive Neural Network written in TensorFlow.

Double pendulum simulator using a symplectic Euler's method and Hamiltonian mechanics

An optimization and data collection toolbox for convenient and fast prototyping of computationally expensive models.

Code for "PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds", CVPR 2021

Reproduced Code for Image Forgery Detection papers.

Source code for the plant extraction workflow introduced in the paper “Agricultural Plant Cataloging and Establishment of a Data Framework from UAV-based Crop Images by Computer Vision”

DeepFaceLive - Live Deep Fake in python, Real-time face swap for PC streaming or video calls