Context Axial Reverse Attention Network for Small Medical Objects Segmentation

Last update: Dec 23, 2022

Overview

CaraNet: Context Axial Reverse Attention Network for Small Medical Objects Segmentation

This repository contains the implementation of a novel attention based network (CaraNet) to segment the polyp (CVC-T, CVC-ClinicDB, CVC-ColonDB, ETIS and Kvasir) and brain tumor (BraTS). The CaraNet show great overall segmentation performance (mean dice) on polyp and brain tumor, but also show great performance on small medical objects (small polyps and brain tumors) segmentation.

The technique report is here: CaraNet

Architecture of CaraNet

Backbone

We use Res2Net as our backbone.

Context module

We choose our CFP module as context module, and choose the dilation rate is 8. For the details of CFP module you can find here: CFPNet. The architecture of CFP module as shown in following figure:

Axial Reverse Attention

As shown in architecture of CaraNet, the Axial Reverse Attention (A-RA) module contains two routes: 1) Reverse attention; 2) Axial-attention.

Installation & Usage

Enviroment

Enviroment: Python 3.6;
Install some packages:

conda install pytorch==1.1.0 torchvision==0.3.0 cudatoolkit=10.0 -c pytorch

conda install opencv-python pillow numpy matplotlib

Clone this repository

git clone https://github.com/AngeLouCN/CaraNet

Training

Download the training and texting dataset from this link: Experiment Dataset
Change the --train_path & --test_path in Train.py
Run Train.py
Testing dataset is ordered as follow:

|-- TestDataset
|   |-- CVC-300
|   |   |-- images
|   |   |-- masks
|   |-- CVC-ClinicDB
|   |   |-- images
|   |   |-- masks
|   |-- CVC-ColonDB
|   |   |-- images
|   |   |-- masks
|   |-- ETIS-LaribPolypDB
|   |   |-- images
|   |   |-- masks
|   |-- Kvasir
|       |-- images
|       |-- masks

Testing

Change the data_path in Test.py

Evaluation

Change the image_root and gt_root in eval_Kvasir.py
You can also run the matlab code in eval fold, it contains other four measurement metrics results.
You can download the segmentation maps of CaraNte from this link: CaraNet

Segmentation Results

Polyp Segmentation Results

Small polyp analysis

The x-axis is the proportion size (%) of polyp; y-axis is the average mean dice coefficient.

Kvasir	CVC-ClinicDB	CVC-ColonDB	ETIS	CVC-300

Brain Tumor Segmentation Results

Small tumor analysis

Citation

@article{lou2021cfpnet,
  title={CFPNet: Channel-wise Feature Pyramid for Real-Time Semantic Segmentation},
  author={Lou, Ange and Loew, Murray},
  journal={arXiv preprint arXiv:2103.12212},
  year={2021}
}

Context Axial Reverse Attention Network for Small Medical Objects Segmentation

Related tags

Overview

CaraNet: Context Axial Reverse Attention Network for Small Medical Objects Segmentation

Architecture of CaraNet

Backbone

Context module

Axial Reverse Attention

Installation & Usage

Enviroment

Training

Testing

Evaluation

Segmentation Results

Citation

Owner

This repository gives an example on how to preprocess the data of the HECKTOR challenge

Trajectory Extraction of road users via Traffic Camera

A no-BS, dead-simple training visualizer for tf-keras

Civsim is a basic civilisation simulation and modelling system built in Python 3.8.

Illuminated3D This project participates in the Nasa Space Apps Challenge 2021.

Repository for the paper "From global to local MDI variable importances for random forests and when they are Shapley values"

StyleGAN of All Trades: Image Manipulation withOnly Pretrained StyleGAN

Deep generative modeling for time-stamped heterogeneous data, enabling high-fidelity models for a large variety of spatio-temporal domains.

This repository holds code and data for our PETS'22 article 'From "Onion Not Found" to Guard Discovery'.

Direct LiDAR Odometry: Fast Localization with Dense Point Clouds

Trustworthy AI related projects

An algorithm that handles large-scale aerial photo co-registration, based on SURF, RANSAC and PyTorch autograd.

Ultra-Data-Efficient GAN Training: Drawing A Lottery Ticket First, Then Training It Toughly

Time Series Cross-Validation -- an extension for scikit-learn

Official implementation of the paper 'Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution'

A video scene detection algorithm is designed to detect a variety of different scenes within a video

Memory-efficient optimum einsum using opt_einsum planning and PyTorch kernels.

Fashion Recommender System With Python

PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World [ACL 2021]

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022