A criticism of a recent paper on buggy image downsampling methods in popular image processing and deep learning libraries.

Last update: Jul 12, 2022

Related tags

Deep Learning buggy-resizing-critique

Overview

A Criticism of the Paper On Buggy Resizing Libraries

This repository contains:

a Jupyter notebook for reproducing the aliased image downsampling fenomenon, as demonstrated in the On Buggy Resizing Libraries paper, which argues that the image downsampling methods of the OpenCV, Tensorflow and PyTorch libraries are "buggy", with only PIL being correct.
simple solutions for antialiasing in every framework, which solves the issue in all cases using the same functions, simply by setting parameters appropriately:
- OpenCV: change the interpolation from bilinear to area (from cv2.INTER_LINEAR to cv2.INTER_AREA)
- Tensorflow: set the antialias flag to True
- PyTorch: change the interpolation mode from bilinear to area, or simply use torchvision.transforms.Resize() instead of torch.nn.functional.interpolate()

Try it out in a Colab Notebook:

My opinion:

neither of the used image downsampling methods is "buggy", not applying antialiasing by default is an understandable design decision for both image and tensor operations.
the main figure of the paper is misleading, and it only illustrates the issues of aliasing for image resizing.
the aliasing issue with downsampling can be solved in all frameworks by simply setting a few parameters correctly. My criticism is that this is not mentioned in the paper.
torchvision.transforms.Resize() is claimed to only be a "a wrapper around the PIL library" in a note in Section 3.2 of the paper. This is true for PIL image inputs, but is incorrect for torch.Tensors, which are resized using torchvision interpolation operations.
the remaining parts of the paper provide valuable insights into the effects of interpolation methods, quantization and compression on the FID score of generative models.

Update: Just found out that there is another, very thorough investigation of the same issue. Highly recommend checking the blogpost out. They also implement an OpenCV-compatible Pillow-equivalent resizing that provides proper antialiasing for all interpolations.

Bilinear downsampling results with and without aliasing:

The main figure (Figure 1) of the paper:

A criticism of a recent paper on buggy image downsampling methods in popular image processing and deep learning libraries.

Related tags

Overview

A Criticism of the Paper On Buggy Resizing Libraries

Owner

Just Go with the Flow: Self-Supervised Scene Flow Estimation

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

New AidForBlind - Various Libraries used like OpenCV and other mentioned in Requirements.txt

Source code for "Progressive Transformers for End-to-End Sign Language Production" (ECCV 2020)

DiscoNet: Learning Distilled Collaboration Graph for Multi-Agent Perception [NeurIPS 2021]

[ICML 2021, Long Talk] Delving into Deep Imbalanced Regression

This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".

[NeurIPS'21] Projected GANs Converge Faster

Adaptive, interpretable wavelets across domains (NeurIPS 2021)

This repository is dedicated to developing and maintaining code for experiments with wide neural networks.

Unsupervised Domain Adaptation for Nighttime Aerial Tracking (CVPR2022)

Minimal implementation of Denoised Smoothing: A Provable Defense for Pretrained Classifiers in TensorFlow.

《Where am I looking at? Joint Location and Orientation Estimation by Cross-View Matching》(CVPR 2020)

An alarm clock coded in Python 3 with Tkinter

Joint Versus Independent Multiview Hashing for Cross-View Retrieval[J] (IEEE TCYB 2021, PyTorch Code)

ICCV2021 Oral SA-ConvONet: Sign-Agnostic Optimization of Convolutional Occupancy Networks

[Preprint] "Bag of Tricks for Training Deeper Graph Neural Networks A Comprehensive Benchmark Study" by Tianlong Chen, Kaixiong Zhou, Keyu Duan, Wenqing Zheng, Peihao Wang, Xia Hu, Zhangyang Wang

Springer Link Download Module for Python

Deep Sea Treasure Environment for Multi-Objective Optimization Research

Multi-task Self-supervised Object Detection via Recycling of Bounding Box Annotations (CVPR, 2019)

A criticism of a recent paper on buggy image downsampling methods in popular image processing and deep learning libraries.

Related tags

Overview

A Criticism of the Paper On Buggy Resizing Libraries

Owner

Just Go with the Flow: Self-Supervised Scene Flow Estimation

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

New AidForBlind - Various Libraries used like OpenCV and other mentioned in Requirements.txt

Source code for "Progressive Transformers for End-to-End Sign Language Production" (ECCV 2020)

DiscoNet: Learning Distilled Collaboration Graph for Multi-Agent Perception [NeurIPS 2021]

[ICML 2021, Long Talk] Delving into Deep Imbalanced Regression

This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".

[NeurIPS'21] Projected GANs Converge Faster

Adaptive, interpretable wavelets across domains (NeurIPS 2021)

This repository is dedicated to developing and maintaining code for experiments with wide neural networks.

Unsupervised Domain Adaptation for Nighttime Aerial Tracking (CVPR2022)

Minimal implementation of Denoised Smoothing: A Provable Defense for Pretrained Classifiers in TensorFlow.

《Where am I looking at? Joint Location and Orientation Estimation by Cross-View Matching》(CVPR 2020)

An alarm clock coded in Python 3 with Tkinter

Joint Versus Independent Multiview Hashing for Cross-View Retrieval[J] (IEEE TCYB 2021, PyTorch Code)

ICCV2021 Oral SA-ConvONet: Sign-Agnostic Optimization of Convolutional Occupancy Networks

[Preprint] "Bag of Tricks for Training Deeper Graph Neural Networks A Comprehensive Benchmark Study" by Tianlong Chen*, Kaixiong Zhou*, Keyu Duan, Wenqing Zheng, Peihao Wang, Xia Hu, Zhangyang Wang

Springer Link Download Module for Python

Deep Sea Treasure Environment for Multi-Objective Optimization Research

Multi-task Self-supervised Object Detection via Recycling of Bounding Box Annotations (CVPR, 2019)

[Preprint] "Bag of Tricks for Training Deeper Graph Neural Networks A Comprehensive Benchmark Study" by Tianlong Chen, Kaixiong Zhou, Keyu Duan, Wenqing Zheng, Peihao Wang, Xia Hu, Zhangyang Wang