Industrial KNN-based Anomaly Detection

⭐ Now has streamlit support! ⭐ Run $ streamlit run streamlit_app.py

This repo aims to reproduce the results of the following KNN-based anomaly detection methods:

SPADE (Cohen et al. 2021) - knn in z-space and distance to feature maps
PaDiM* (Defard et al. 2020) - distance to multivariate Gaussian of feature maps
PatchCore (Roth et al. 2021) - knn distance to avgpooled feature maps

* actually does not have any knn mechanism, but shares many things implementation-wise.

Install

$ pipenv install -r requirements.txt

Note: I used torch cu11 wheels.

Usage

CLI:

$ python indad/run.py METHOD [--dataset DATASET]

Results can be found under ./results/.

Code example:

from indad.model import SPADE

model = SPADE(k=5, backbone_name="resnet18")

# feed healthy dataset
model.fit(...)

# get predictions
img_lvl_anom_score, pxl_lvl_anom_score = model.predict(...)

Custom datasets

👁️

Check out one of the downloaded MVTec datasets. Naming of images should correspond among folders. Right now there is no support for no ground truth pixel masks.

📂datasets
 ┗ 📂your_custom_dataset
  ┣ 📂 ground_truth/defective
  ┃ ┣ 📂 defect_type_1
  ┃ ┗ 📂 defect_type_2
  ┣ 📂 test
  ┃ ┣ 📂 defect_type_1
  ┃ ┣ 📂 defect_type_2
  ┃ ┗ 📂 good
  ┗ 📂 train/good

$ python indad/run.py METHOD --dataset your_custom_dataset

Results

📝 = paper, 👇 = this repo

Image-level

class	SPADE 📝	SPADE 👇	PaDiM 📝	PaDiM 👇	PatchCore 📝	PatchCore 👇
bottle	-	98.3	98.3	99.9	100.0	100.0
cable	-	88.1	96.7	87.8	99.5	96.2
capsule	-	80.4	98.5	87.6	98.1	95.3
carpet	-	62.5	99.1	99.5	98.7	98.7
grid	-	25.6	97.3	95.5	98.2	93.0
hazelnut	-	92.8	98.2	86.1	100.0	100.0
leather	-	85.6	99.2	100.0	100.0	100.0
metal_nut	-	78.6	97.2	97.6	100.0	98.3
pill	-	78.8	95.7	92.7	96.6	92.8
screw	-	66.1	98.5	79.6	98.1	96.7
tile	-	96.4	94.1	99.5	98.7	99.0
toothbrush	-	83.9	98.8	94.7	100.0	98.1
transistor	-	89.4	97.5	95.0	100.0	99.7
wood	-	85.3	94.7	99.4	99.2	98.8
zipper	-	97.1	98.5	93.8	99.4	98.4
averages	85.5	80.6	97.5	93.9	99.1	97.7

Pixel-level

class	SPADE 📝	SPADE 👇	PaDiM 📝	PaDiM 👇	PatchCore 📝	PatchCore 👇
bottle	97.5	97.7	94.8	97.6	98.6	97.8
cable	93.7	94.4	88.8	95.5	98.5	97.4
capsule	97.6	98.7	93.5	98.1	98.9	98.3
carpet	87.4	99.0	96.2	98.7	99.1	98.3
grid	88.5	96.4	94.6	96.4	98.7	96.7
hazelnut	98.4	98.4	92.6	97.3	98.7	98.1
leather	97.2	99.1	97.8	98.6	99.3	98.4
metal_nut	99.0	96.1	85.6	95.8	98.4	96.2
pill	99.1	93.5	92.7	94.4	97.6	98.7
screw	98.1	98.9	94.4	97.5	99.4	98.4
tile	96.5	93.1	86.0	92.6	95.9	94.0
toothbrush	98.9	98.9	93.1	98.5	98.7	98.1
transistor	97.9	95.8	84.5	96.9	96.4	97.5
wood	94.1	94.5	91.1	92.9	95.1	91.9
zipper	96.5	98.3	95.9	97.0	98.9	97.6
averages	96.9	96.6	92.1	96.5	98.1	97.2

PatchCore-10 was used.

Hyperparams

The following parameters were used to calculate the results. They more or less correspond to the parameters used in the papers.

spade:
  backbone: wide_resnet50_2
  k: 50
padim:
  backbone: wide_resnet50_2
  d_reduced: 250
  epsilon: 0.04
patchcore:
  backbone: wide_resnet50_2
  f_coreset: 0.1
  n_reweight: 3

Progress

Design considerations

Data is processed in single images to avoid batch statistics interference.
I decided to implement greedy kcenter from scratch and there is room for improvement.
torch.nn.AdaptiveAvgPool2d for feature map resizing, torch.nn.functional.interpolate for score map resizing.
GPU is used for backbones and coreset selection. GPU coreset selection currently runs at:
- 400-500 it/s @ float32 (RTX3080)
- 1000+ it/s @ float16 (RTX3080)

Acknowledgements

hcw-00 for tipping sklearn.random_projection.SparseRandomProjection

References

SPADE:

@misc{cohen2021subimage,
      title={Sub-Image Anomaly Detection with Deep Pyramid Correspondences}, 
      author={Niv Cohen and Yedid Hoshen},
      year={2021},
      eprint={2005.02357},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

PaDiM:

@misc{defard2020padim,
      title={PaDiM: a Patch Distribution Modeling Framework for Anomaly Detection and Localization}, 
      author={Thomas Defard and Aleksandr Setkov and Angelique Loesch and Romaric Audigier},
      year={2020},
      eprint={2011.08785},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

PatchCore:

@misc{roth2021total,
      title={Towards Total Recall in Industrial Anomaly Detection}, 
      author={Karsten Roth and Latha Pemula and Joaquin Zepeda and Bernhard Schölkopf and Thomas Brox and Peter Gehler},
      year={2021},
      eprint={2106.08265},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Industrial knn-based anomaly detection for images. Visit streamlit link to check out the demo.

Related tags

Overview

Industrial KNN-based Anomaly Detection

Install

Usage

Custom datasets

Results

Image-level

Pixel-level

Hyperparams

Progress

Design considerations

Acknowledgements

References

Owner

aventau

STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech

Pytorch library for fast transformer implementations

Disturbing Target Values for Neural Network regularization: attacking the loss layer to prevent overfitting

Tf alloc - Simplication of GPU allocation for Tensorflow2

Contains supplementary materials for reproduce results in HMC divergence time estimation manuscript

Implementation of Neural Distance Embeddings for Biological Sequences (NeuroSEED) in PyTorch

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

Image-popularity-score - A novel deep regression method for image scoring.

Self-Learning - Books Papers, Courses & more I have to learn soon

Binary Stochastic Neurons in PyTorch

CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms

level1-image-classification-level1-recsys-09 created by GitHub Classroom

Moiré Attack (MA): A New Potential Risk of Screen Photos [NeurIPS 2021]

DC3: A Learning Method for Optimization with Hard Constraints

Classification of ecg datas for disease detection

Space robot - (Course Project) Using the space robot to capture the target satellite that is disabled and spinning, then stabilize and fix it up

Official implementation of the paper "Steganographer Detection via a Similarity Accumulation Graph Convolutional Network"

[CVPR 2022] CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation

Files for a tutorial to train SegNet for road scenes using the CamVid dataset

AWS provides a Python SDK, "Boto3" ,which can be used to access the AWS-account from the local.