Official repository of Semantic Image Matting

Last update: Dec 29, 2022

Related tags

Overview

Semantic Image Matting

This is the official repository of Semantic Image Matting (CVPR2021).

Overview

Natural image matting separates the foreground from background in fractional occupancy which can be caused by highly transparent objects, complex foreground (e.g., net or tree), and/or objects containing very fine details (e.g., hairs). Although conventional matting formulation can be applied to all of the above cases, no previous work has attempted to reason the underlying causes of matting due to various foreground semantics.

We show how to obtain better alpha mattes by incorporating into our framework semantic classification of matting regions. Specifically, we consider and learn 20 classes of matting patterns, and propose to extend the conventional trimap to semantic trimap. The proposed semantic trimap can be obtained automatically through patch structure analysis within trimap regions. Meanwhile, we learn a multi-class discriminator to regularize the alpha prediction at semantic level, and content-sensitive weights to balance different regularization losses.

Dataset

Download our semantic image matting dataset (SIMD) here. SIMD is composed self-collected images and a subset of adobe images. To obtain the complete dataset, please contact Brian Price ([email protected]) for the Adobe Image Matting dataset first and follow the instructions within SIMD.zip.

Requirements

The codes are tested in the following environment:

Python 3.7
Pytorch 1.9.0
CUDA 10.2 & CuDNN 7.6.5

Performance

Some pretrained models are listed below with their performance.

Methods	SAD	MSE	Grad	Conn	Link
SIMD	27.9	4.7	11.6	20.8	model
Composition-1K (paper)	28.0	5.8	10.8	24.8
Composition-1K (repo)	27.7	5.6	10.7	24.4	model

Run

Download the model and put it under checkpoints/DIM or checkpoints/Adobe in the root directory. Download the classifier here and put it under checkpoints. Run the inference and evaluation by

python scripts/main.py -c config/CONFIG.yaml

Results

Reference

If you find our work useful in your research, please consider citing:

@inproceedings{sun2021sim,
  author    = {Yanan Sun and Chi-Keung Tang and Yu-Wing Tai}
  title     = {Semantic Image Matting},
  booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  year      = {2021},
}

Acknowledgment

This repo borrows code from several repos, like GCA and FBA.

Official repository of Semantic Image Matting

Related tags

Overview

Semantic Image Matting

Overview

Dataset

Requirements

Performance

Run

Results

Reference

Acknowledgment

Owner

HEAM: High-Efficiency Approximate Multiplier Optimization for Deep Neural Networks

AutoML library for deep learning

Predictive AI layer for existing databases.

Faster Convex Lipschitz Regression

Text to image synthesis using thought vectors

Code for the paper "A Study of Face Obfuscation in ImageNet"

This repository contains the source codes for the paper AtlasNet V2 - Learning Elementary Structures.

official Pytorch implementation of ICCV 2021 paper FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.

Replication package for the manuscript "Using Personality Detection Tools for Software Engineering Research: How Far Can We Go?" submitted to TOSEM

This is the repository for CVPR2021 Dynamic Metric Learning: Towards a Scalable Metric Space to Accommodate Multiple Semantic Scales

Official implementation for “Unsupervised Low-Light Image Enhancement via Histogram Equalization Prior”

Plug and play transformer you can find network structure and official complete code by clicking List

The full training script for Enformer (Tensorflow Sonnet) on TPU clusters

Supporting code for short YouTube series Neural Networks Demystified.

Source code and Dataset creation for the paper "Neural Symbolic Regression That Scales"

A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks

A new framework, collaborative cascade prediction based on graph neural networks (CCasGNN) to jointly utilize the structural characteristics, sequence features, and user profiles.

The code for Expectation-Maximization Attention Networks for Semantic Segmentation (ICCV'2019 Oral)

Neural Scene Flow Fields using pytorch-lightning, with potential improvements

GazeScroller - Using Facial Movements to perform Hands-free Gesture on the system