Official pytorch implementation of paper "Inception Convolution with Efficient Dilation Search" (CVPR 2021 Oral).

Last update: Dec 31, 2022

Related tags

Deep Learning IC-Conv

Overview

IC-Conv

This repository is an official implementation of the paper Inception Convolution with Efficient Dilation Search.

Getting Started

Download ImageNet pre-trained checkpoints.

Extract the file to get the following directory tree

|-- README.md
|-- ckpt
|   |-- detection
|   |-- human_pose
|   |-- segmentation
|-- config
|-- model
|-- pattern_zoo

Easy Use

The current implementation is coupled to specific downstream tasks. OpenMMLab users can quickly use IC-Conv in the following simple ways.

from models import IC_ResNet
import torch
net = IC_ResNet(depth=50,pattern_path='pattern_zoo/detection/ic_r50_k9.json')
net.eval()
inputs = torch.rand(1, 3, 32, 32)
outputs = net.forward(inputs)

For 2d Human Pose Estimation using MMPose

Copying the config files to the config path of mmpose, such as

cp config/human_pose/ic_res50_k13_coco_640x640.py your_mmpose_path/mmpose/configs/bottom_up/resnet/coco/ic_res50_k13_coco_640x640.py

Copying the inception conv files to the model path of mmpose,

cp model/ic_conv2d.py your_mmpose_path/mmpose/mmpose/models/backbones/ic_conv2d.py
cp model/ic_resnet.py your_mmpose_path/mmpose/mmpose/models/backbones/ic_resnet.py

Running it directly like MMPose.

Model Zoo

We provided the pre-trained weights of IC-ResNet-50, IC-ResNet-101and IC-ResNeXt-101 (32x4d) on ImageNet and the weights trained on specific tasks.

For users with limited computing power, you can directly reuse our provided IC-Conv and ImageNet pre-training weights for detection, segmentation, and 2d human pose estimation tasks on other datasets.

Attentions: The links in the tables below are relative paths. Therefore, you should clone the repository and download checkpoints.

Object Detection

Detector	Backbone	Lr	AP	dilation_pattern	checkpoint
Faster-RCNN-FPN	IC-R50	1x	38.9	pattern	ckpt/imagenet_retrain_ckpt
Faster-RCNN-FPN	IC-R101	1x	41.9	pattern	ckpt/imagenet_retrain_ckpt
Faster-RCNN-FPN	IC-X101-32x4d	1x	42.1	pattern	ckpt/imagenet_retrain_ckpt
Cascade-RCNN-FPN	IC-R50	1x	42.4	pattern	ckpt/imagenet_retrain_ckpt
Cascade-RCNN-FPN	IC-R101	1x	45.0	pattern	ckpt/imagenet_retrain_ckpt
Cascade-RCNN-FPN	IC-X101-32x4d	1x	45.7	pattern	ckpt/imagenet_retrain_ckpt

Instance Segmentation

Detector	Backbone	Lr	box AP	mask AP	dilation_pattern	checkpoint
Mask-RCNN-FPN	IC-R50	1x	40.0	35.9	pattern	ckpt/imagenet_retrain_ckpt
Mask-RCNN-FPN	IC-R101	1x	42.6	37.9	pattern	ckpt/imagenet_retrain_ckpt
Mask-RCNN-FPN	IC-X101-32x4d	1x	43.4	38.4	pattern	ckpt/imagenet_retrain_ckpt
Cascade-RCNN-FPN	IC-R50	1x	43.4	36.8	pattern	ckpt/imagenet_retrain_ckpt
Cascade-RCNN-FPN	IC-R101	1x	45.7	38.7	pattern	ckpt/imagenet_retrain_ckpt
Cascade-RCNN-FPN	IC-X101-32x4d	1x	46.4	39.1	pattern	ckpt/imagenet_retrain_ckpt

2d Human Pose Estimation

We adjust the learning rate of resnet backbone in MMPose and get better baseline results. Please see the specific config files in config/human_pose/.

Results on COCO val2017 without multi-scale test

Backbone	Input Size	AP	dilation_pattern	checkpoint
R50(mmpose)	640x640	47.9	~	~
R50	640x640	51.0	~	~
IC-R50	640x640	62.2	pattern	ckpt/imagenet_retrain_ckpt
R101	640x640	55.5	~	~
IC-R101	640x640	63.3	pattern	ckpt/imagenet_retrain_ckpt

Results on COCO val2017 with multi-scale test. 3 default scales ([2, 1, 0.5]) are used

Backbone	Input Size	AP
R50(mmpose)	640x640	52.5
R50	640x640	55.8
IC-R50	640x640	65.8
R101	640x640	60.2
IC-R101	640x640	68.5

Acknowledgement

The human pose estimation experiments are built upon MMPose.

Citation

If our paper helps your research, please cite it in your publications:

@article{liu2020inception,
 title={Inception Convolution with Efficient Dilation Search},
 author={Liu, Jie and Li, Chuming and Liang, Feng and Lin, Chen and Sun, Ming and Yan, Junjie and Ouyang, Wanli and Xu, Dong},
 journal={arXiv preprint arXiv:2012.13587},
 year={2020}
}

Official pytorch implementation of paper "Inception Convolution with Efficient Dilation Search" (CVPR 2021 Oral).

Related tags

Overview

IC-Conv

Getting Started

Easy Use

For 2d Human Pose Estimation using MMPose

Model Zoo

Object Detection

Instance Segmentation

2d Human Pose Estimation

Results on COCO val2017 without multi-scale test

Results on COCO val2017 with multi-scale test. 3 default scales ([2, 1, 0.5]) are used

Acknowledgement

Citation

Owner

Jie Liu

We have made you a wrapper you can't refuse

A PyTorch implementation of EfficientDet.

Machine learning and Deep learning models, deploy on telegram (the best social media)

Open-source python package for the extraction of Radiomics features from 2D and 3D images and binary masks.

PROJECT - Az Residential Real Estate Analysis

Graph Convolutional Networks in PyTorch

Implementation of the "PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences" paper.

Lightweight Cuda Renderer with Python Wrapper.

Vehicle Detection Using Deep Learning and YOLO Algorithm

NeurIPS'21 Tractable Density Estimation on Learned Manifolds with Conformal Embedding Flows

Disagreement-Regularized Imitation Learning

Patch-Diffusion Code (AAAI2022)

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

Code and data of the ACL 2021 paper: Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision

GLM (General Language Model)

A non-linear, non-parametric Machine Learning method capable of modeling complex datasets

ATAC: Adversarially Trained Actor Critic

Code for our NeurIPS 2021 paper 'Exploiting the Intrinsic Neighborhood Structure for Source-free Domain Adaptation'

PyTorch implementation for our paper "Deep Facial Synthesis: A New Challenge"

StyleGAN-Human: A Data-Centric Odyssey of Human Generation