we propose a novel deep network, named feature aggregation and refinement network (FARNet), for the automatic detection of anatomical landmarks.

Last update: Aug 28, 2022

Related tags

Deep Learning FARNet

Overview

Feature Aggregation and Refinement Network for 2D Anatomical Landmark Detection

Overview

Localization of anatomical landmarks is essential for clinical diagnosis, treatment planning, and research. In this paper, we propose a novel deep network, named feature aggregation and refinement network (FARNet), for the automatic detection of anatomical landmarks. To alleviate the problem of limited training data in the medical domain, our network adopts a CNN pre-trained on natural images as the backbone network and several popular networks have been compared. Our FARNet also includes a multi-scale feature aggregation module for multiscale feature fusion and a feature refinement module for high-resolution heatmap regression. Coarse-to-fine supervisions are applied to the two modules to facilitate the endto-end training. We further propose a novel loss function named Exponential Weighted Center loss for more accurate heatmap regression, which focuses on the losses from the pixels near landmarks and suppresses the ones from far away. Our network has been evaluated on three publicly available anatomical landmark detection datasets, including cephalometric radiographs, hand radiographs, and spine radiographs, and achieves state-of-art performances on all three datasets.

Data

In this paper, we evaluate our landmark detection network on three public benchmark data sets, a cephalometric X-rays dataset [1], a hand X-rays dataset [2] and a Spinal AnteriorPosterior (AP) X-rays dataset [3].

How to use

Dependencies

This tutorial depends on the following libraries:

pytorch = 1.0.1
numpy = 1.18.5
python >= 3.6
xlwt

config.py

You should set the image path in config by yourself

Run main.py

Run main.py to train the model and test its performance

Some results

Reference

[1] C.-W. Wang, C.-T. Huang, J.-H. Lee, C.-H. Li, S.-W. Chang, M.-J.Siao, T.-M. Lai, B. Ibragimov, T. Vrtovec, O. Ronneberger, et al., “A benchmark for comparison of dental radiography analysis algorithms,” Medical image analysis, vol. 31, pp. 63–76, 2016.
[2] C. Payer, D. ˇStern, H. Bischof, and M. Urschler, “Integrating spatial configuration into heatmap regression based cnns for landmark localization,” Medical Image Analysis, vol. 54, pp. 207–219, 2019.
[3] H. Wu, C. Bailey, P. Rasoulinejad, and S. Li, “Automatic landmark estimation for adolescent idiopathic scoliosis assessment using boostnet,” in International Conference on Medical Image Computing and ComputerAssisted Intervention, 2017.

we propose a novel deep network, named feature aggregation and refinement network (FARNet), for the automatic detection of anatomical landmarks.

Related tags

Overview

Feature Aggregation and Refinement Network for 2D Anatomical Landmark Detection

Overview

Data

How to use

Dependencies

config.py

Run main.py

Some results

Reference

Owner

aoyueyuan

Official implementation of Influence-balanced Loss for Imbalanced Visual Classification in PyTorch.

Official Chainer implementation of GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral)

this is a lite easy to use virtual keyboard project for anyone to use

codes for paper Combining Dynamic Local Context Focus and Dependency Cluster Attention for Aspect-level sentiment classification

Code for our paper "Multi-scale Guided Attention for Medical Image Segmentation"

Reinforcement-learning - Repository of the class assignment questions for the course on reinforcement learning

Accelerate Neural Net Training by Progressively Freezing Layers

Implementation of Auto-Conditioned Recurrent Networks for Extended Complex Human Motion Synthesis

This repo contains the official code of our work SAM-SLR which won the CVPR 2021 Challenge on Large Scale Signer Independent Isolated Sign Language Recognition.

Source code for "Roto-translated Local Coordinate Framesfor Interacting Dynamical Systems"

Project code for weakly supervised 3D object detectors using wide-baseline multi-view traffic camera data: WIBAM.

A pre-trained language model for social media text in Spanish

StyleGAN2-ADA-training-jupyter - Training custom datasets in styleGAN2-ADA by NVIDIA using Jupyter

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision

Filtering variational quantum algorithms for combinatorial optimization

Alpha-IoU: A Family of Power Intersection over Union Losses for Bounding Box Regression

Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"

Angular & Electron desktop UI framework. Angular components for native looking and behaving macOS desktop UI (Electron/Web)

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers

Code for the paper BERT might be Overkill: A Tiny but Effective Biomedical Entity Linker based on Residual Convolutional Neural Networks