A Strong Baseline for Image Semantic Segmentation

Introduction

This project is an open source semantic segmentation toolbox based on PyTorch. It is based on the codes of our Tianchi competition in 2021 (https://tianchi.aliyun.com/competition/entrance/531860/introduction).
In the competition, our team won the third place (please see Tianchi_README.md).

Overview

The master branch works with PyTorch 1.6+.The project now supports popular and contemporary semantic segmentation frameworks, e.g. UNet, DeepLabV3+, HR-Net etc.

Requirements

Support

Backbone

ResNet (CVPR'2016)
SeNet (CVPR'2018)
IBN-Net (CVPR'2018)
EfficientNet (CVPR'2020)

Methods

Tricks

Tools

large image inference (cut and merge)
post process (crf/superpixels)

Quick Start

Train a model

python train.py --config_file ${CONFIG_FILE}

CONFIG_FILE: File of training config about model

Examples:
We trained our model in Tianchi competition according to the following script:
Stage 1 (160e)

python train.py --config_file configs/tc_seg/tc_seg_res_unet_r34_ibn_a_160e.yml

Stage 2 (swa 24e)

python train.py --config_file configs/tc_seg/tc_seg_res_unet_r34_ibn_a_swa.yml

Inference with pretrained models

python inference.py --config_file ${CONFIG_FILE}

CONFIG_FILE: File of inference config about model

Predict large image with pretrained models

python predict_demo.py --config_file ${CONFIG_FILE} --rs_img_file ${IMAGE_FILE_PATH} --temp_img_save_path ${TEMP_CUT_PATH} -temp_seg_map_save_path ${TEMP_SAVE_PATH} --save_seg_map_file ${SAVE_SEG_FILE}

CONFIG_FILE: File of inference config about model
IMAGE_FILE_PATH: File of large input image to predict
TEMP_CUT_PATH: Temp folder of small cutting samples
TEMP_SAVE_PATH: Temp folder of predict results of cutting samples
SAVE_SEG_FILE: Predict result of the large image

A Strong Baseline for Image Semantic Segmentation

Related tags

Overview

A Strong Baseline for Image Semantic Segmentation

Introduction

Overview

Requirements

Support

Backbone

Methods

Tricks

Tools

Quick Start

Train a model

Inference with pretrained models

Predict large image with pretrained models

Owner

Clark He

This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP

中文语音识别系列，读者可以借助它快速训练属于自己的中文语音识别模型，或直接使用预训练模型测试效果。

DeepLab resnet v2 model in pytorch

Weakly Supervised 3D Object Detection from Point Cloud with Only Image Level Annotation

Code for "Multi-Compound Transformer for Accurate Biomedical Image Segmentation"

Deep Distributed Control of Port-Hamiltonian Systems

Retrieve and analysis data from SDSS (Sloan Digital Sky Survey)

Official PyTorch implemention of our paper "Learning to Rectify for Robust Learning with Noisy Labels".

Kaggle competition: Springleaf Marketing Response

DLFlow is a deep learning framework.

Generate saved_model, tfjs, tf-trt, EdgeTPU, CoreML, quantized tflite and .pb from .tflite.

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

It is modified Tensorflow 2.x version of Mask R-CNN

Machine learning framework for both deep learning and traditional algorithms

Official implementation for ICDAR 2021 paper "Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer"

An open source implementation of CLIP.

PyTorch source code for Distilling Knowledge by Mimicking Features

RE3: State Entropy Maximization with Random Encoders for Efficient Exploration