[ICCV2021] Official Pytorch implementation for SDGZSL (Semantics Disentangling for Generalized Zero-Shot Learning)

Last update: Dec 06, 2022

Related tags

Overview

Semantics Disentangling for Generalized Zero-shot Learning

This is the official implementation for paper

Zhi Chen, Yadan Luo, Ruihong Qiu, Zi Huang, Jingjing Li, Zheng Zhang.
Semantics Disentangling for Generalized Zero-shot Learning
International Conference on Computer Vision (ICCV) 2021.

Abstract: Generalized zero-shot learning (GZSL) aims to classify samples under the assumption that some classes are not observable during training. To bridge the gap between the seen and unseen classes, most GZSL methods attempt to associate the visual features of seen classes with attributes or to generate unseen samples directly. Nevertheless, the visual features used in the prior approaches do not necessarily encode semantically related information that the shared attributes refer to, which degrades the model generalization to unseen classes. To address this issue, in this paper, we propose a novel semantics disentangling framework for the generalized zero-shot learning task (SDGZSL), where the visual features of unseen classes are firstly estimated by a conditional VAE and then factorized into semantic-consistent and semantic-unrelated latent vectors. In particular, a total correlation penalty is applied to guarantee the independence between the two factorized representations, and the semantic consistency of which is measured by the derived relation network. Extensive experiments conducted on four GZSL benchmark datasets have evidenced that the semantic-consistent features disentangled by the proposed SDGZSL are more generalizable in tasks of canonical and generalized zero-shot learning.

Requirements

The implementation runs on

Python 3.6
torch 1.3.1
Numpy
Sklearn
Scipy

Usage

Put your datasets in SDGZSL_data folder and run the scripts:

The extracted features for APY and AWA datasets are from [1], FLO and CUB datasets are from [2]. For the fine-tuned features, AWA,FLO and CUB are from [3]. The APY fine-tuned features are extracted from us.

[1] Xian, Yongqin, et al. "Feature generating networks for zero-shot learning." Proceedings of the IEEE conference on computer vision and pattern recognition. 2018.

[2] Yu, Yunlong, et al. "Episode-based prototype generating network for zero-shot learning." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020.

[3] Narayan, Sanath, et al. "Latent embedding feedback and discriminative features for zero-shot classification." ECCV 2020.

Citation:

If you find this useful, please cite our work as follows:

@inproceedings{chen2021semantics,
	title={Semantics Disentangling for Generalized Zero-shot Learning},
	author={Chen, Zhi and Luo, Yadan and Qiu, Ruihong and Huang, Zi and Li, Jingjing and Zhang, Zheng},
	booktitle={ICCV},
	year={2021}
}

[ICCV2021] Official Pytorch implementation for SDGZSL (Semantics Disentangling for Generalized Zero-Shot Learning)

Related tags

Overview

Semantics Disentangling for Generalized Zero-shot Learning

Requirements

Usage

Citation:

Owner

Code for "Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks", CVPR 2021

How to use TensorLayer

MOOSE (Multi-organ objective segmentation) a data-centric AI solution that generates multilabel organ segmentations to facilitate systemic TB whole-person research

Multiple Object Tracking with Yolov5!

Video Instance Segmentation with a Propose-Reduce Paradigm (ICCV 2021)

Towards Fine-Grained Reasoning for Fake News Detection

Facial recognition project

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing.

Contrastive Loss Gradient Attack (CLGA)

learning and feeling SLAM together with hands-on-experiments

The Instructed Glacier Model (IGM)

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, wav2lip, picture repair, image editing, photo2cartoon, image style transfer, and so on.

BBScan py3 - BBScan py3 With Python

Segmentation vgg16 fcn - cityscapes

object detection; robust detection; ACM MM21 grand challenge; Security AI Challenger Phase VII

A denoising diffusion probabilistic model (DDPM) tailored for conditional generation of protein distograms

Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines

tsflex - feature-extraction benchmarking

Automatic packaging of the open-composite libs for OvGME

This is a simple framework to make object detection dataset very quickly

[ICCV2021] Official Pytorch implementation for SDGZSL (Semantics Disentangling for Generalized Zero-Shot Learning)

Related tags

Overview

Semantics Disentangling for Generalized Zero-shot Learning

Requirements

Usage

Citation:

Owner

Code for "Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks", CVPR 2021

How to use TensorLayer

MOOSE (Multi-organ objective segmentation) a data-centric AI solution that generates multilabel organ segmentations to facilitate systemic TB whole-person research

Multiple Object Tracking with Yolov5!

Video Instance Segmentation with a Propose-Reduce Paradigm (ICCV 2021)

Towards Fine-Grained Reasoning for Fake News Detection

Facial recognition project

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren*, Raymond A. Yeh*, Alexander G. Schwing.

Contrastive Loss Gradient Attack (CLGA)

learning and feeling SLAM together with hands-on-experiments

The Instructed Glacier Model (IGM)

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, wav2lip, picture repair, image editing, photo2cartoon, image style transfer, and so on.

BBScan py3 - BBScan py3 With Python

Segmentation vgg16 fcn - cityscapes

object detection; robust detection; ACM MM21 grand challenge; Security AI Challenger Phase VII

A denoising diffusion probabilistic model (DDPM) tailored for conditional generation of protein distograms

Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines

tsflex - feature-extraction benchmarking

Automatic packaging of the open-composite libs for OvGME

This is a simple framework to make object detection dataset very quickly

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing.