BoxInst: High-Performance Instance Segmentation with Box Annotations

Last update: Dec 21, 2022

Related tags

Overview

Introduction

This repository is the code that needs to be submitted for OpenMMLab Algorithm Ecological Challenge, the paper is BoxInst: High-Performance Instance Segmentation with Box Annotations

License

This project is released under the Apache 2.0 license.

Benchmark and model zoo

BoxInst (CVPR'2021)
ConInst (ECCV'2020)

BoxInst

Name	box AP	mask AP	log	download
BoxInst_MS_R_50_1x	0.390	0.304	log	model
BoxInst_MS_R_50_90k	0.388	0.302	log	model
BoxInst_MS_R_101_90k	0.410	0.318	-	model

Some other methods in MMDetection are also supported.

Getting Started

Our project is totally based on MMCV and MMDetection. Please see get_started.md for the basic usage of MMDetection.

Train

Please see doc to start training. Example,

CUDA_VISIBLE_DEVICES=0,1,2,3 PORT=29500 ./tools/dist_train.sh configs/boxinst/boxinst_r50_caffe_fpn_coco_mstrain_1x.py 4

please following linear linear scaling rule to adjust batch size, learning rate and iterations.

Inference and Eval

python tools/test.py configs/boxinst/boxinst_r50_caffe_fpn_coco_mstrain_1x.py work_dirs/boxinst_r50_caffe_fpn_coco_mstrain_1x.py/latest.pth --eval bbox segm

Acknowledgement

MMCV: OpenMMLab foundational library for computer vision.
MMDetection: OpenMMLab detection toolbox and benchmark.

BoxInst: High-Performance Instance Segmentation with Box Annotations

Related tags

Overview

Introduction

License

Benchmark and model zoo

BoxInst

Getting Started

Train

Inference and Eval

Acknowledgement

Owner

CCCL: Contrastive Cascade Graph Learning.

A PyTorch implementation of "DGC-Net: Dense Geometric Correspondence Network"

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

Voice Conversion Using Speech-to-Speech Neuro-Style Transfer

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.

A data-driven maritime port simulator

This implements the learning and inference/proposal algorithm described in "Learning to Propose Objects, Krähenbühl and Koltun"

Wikidated : An Evolving Knowledge Graph Dataset of Wikidata’s Revision History

multimodal transformer

Thermal Control of Laser Powder Bed Fusion using Deep Reinforcement Learning

Oriented Response Networks, in CVPR 2017

[Link]mareteutral - pars tradg wth M []

Simple Tensorflow implementation of "Adaptive Convolutions for Structure-Aware Style Transfer" (CVPR 2021)

Latent Execution for Neural Program Synthesis

A modular, research-friendly framework for high-performance and inference of sequence models at many scales

PyTorch implementation of "LayoutTransformer: Layout Generation and Completion with Self-attention"

Goal of the project : Detecting Temporal Boundaries in Sign Language videos

Train CPPNs as a Generative Model, using Generative Adversarial Networks and Variational Autoencoder techniques to produce high resolution images.

Hierarchical probabilistic 3D U-Net, with attention mechanisms (—𝘈𝘵𝘵𝘦𝘯𝘵𝘪𝘰𝘯 𝘜-𝘕𝘦𝘵, 𝘚𝘌𝘙𝘦𝘴𝘕𝘦𝘵) and a nested decoder structure with deep supervision (—𝘜𝘕𝘦𝘵++).

This is an example of object detection on Micro bacterium tuberculosis using Mask-RCNN