edge-SR: Super-Resolution For The Masses

Last update: Nov 10, 2022

Related tags

Overview

edge-SR: Super Resolution For The Masses

Citation

Pablo Navarrete Michelini, Yunhua Lu and Xingqun Jiang. "edge-SR: Super-Resolution For The Masses", in IEEE Winter conference on Applications of Computer Vision (WACV), 2022.

BibTeX

@inproceedings{eSR,
    title     = {edge--{SR}: Super--Resolution For The Masses},
    author    = {Navarrete~Michelini, Pablo and Lu, Yunhua and Jiang, Xingqun},
    booktitle = {Proceedings of the {IEEE/CVF} Winter Conference on Applications of Computer Vision ({WACV})},
    month     = {January},
    year      = {2022},
    pages     = {1078--1087},
    url       = {https://arxiv.org/abs/2108.10335}
}

Instructions:

Place input images in input directory (provided as empty directory). Color images will be converted to grayscale.
To upscale images run: python run.py.

Output images will come out in output directory.
The GPU number and model file can be changed in run.py (in comment "CHANGE HERE").

Requirements:

Python 3, PyTorch, NumPy, Pillow, OpenCV

Experiment results

The data directory contains the file tests.pkl that has the Python dictionary with all our test results on different devices. The following sample code shows how to read the file:

>>> import pickle
>>> test = pickle.load(open('tests.pkl', 'rb'))
>>> test['Bicubic_s2']
    {'psnr_Set5': 33.72849620514912,
     'ssim_Set5': 0.9283912810369976,
     'lpips_Set5': 0.14221979230642318,
     'psnr_Set14': 30.286027790636204,
     'ssim_Set14': 0.8694934108301432,
     'lpips_Set14': 0.19383049915943826,
     'psnr_BSDS100': 29.571233006609656,
     'ssim_BSDS100': 0.8418117904964167,
     'lpips_BSDS100': 0.26246454380452633,
     'psnr_Urban100': 26.89378248655882,
     'ssim_Urban100': 0.8407461069831571,
     'lpips_Urban100': 0.21186692919582129,
     'psnr_Manga109': 30.850672809780587,
     'ssim_Manga109': 0.9340133711400112,
     'lpips_Manga109': 0.102985977955641,
     'parameters': 104,
     'speed_AGX': 18.72132628065749,
     'power_AGX': 1550,
     'speed_MaxQ': 632.5429857814075,
     'power_MaxQ': 50,
     'temperature_MaxQ': 76,
     'memory_MaxQ': 2961,
     'speed_RPI': 11.361346064182795,
     'usage_RPI': 372.8714285714285}

The keys of the dictionary identify the name of each model and its hyper--parameters using the following format:

Bicubic_s#,
eSR-MAX_s#_K#_C#,
eSR-TM_s#_K#_C#,
eSR-TR_s#_K#_C#,
eSR-CNN_s#_C#_D#_S#,
ESPCN_s#_D#_S#, or
FSRCNN_s#_D#_S#_M#,

where # represents an integer number with the value of the correspondent hyper-parameter. For each model the data of the dictionary contains a second dictionary with the information displayed above. This includes: number of model parameters; image quality metrics PSNR, SSIM and LPIPS measured in 5 different datasets; as well as power, speed, CPU usage, temperature and memory usage for devices AGX (Jetson AGX Xavier), MaxQ (GTX 1080 MaxQ) and RPI (Raspberry Pi 400).

edge-SR: Super-Resolution For The Masses

Related tags

Overview

edge-SR: Super Resolution For The Masses

Citation

BibTeX

Instructions:

Requirements:

Experiment results

Owner

Pablo

A spaCy wrapper of OpenTapioca for named entity linking on Wikidata

Image2pcl - Enter the metaverse with 2D image to 3D projections

The code for the Subformer, from the EMNLP 2021 Findings paper: "Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers", by Machel Reid, Edison Marrese-Taylor, and Yutaka Matsuo

Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.

Code for the Findings of NAACL 2022(Long Paper): AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks

It analyze the sentiment of the user, whether it is postive or negative.

Code for "Finetuning Pretrained Transformers into Variational Autoencoders"

Intent parsing and slot filling in PyTorch with seq2seq + attention

华为商城抢购手机的Python脚本 Python script of Huawei Store snapping up mobile phones

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

Extracting Summary Knowledge Graphs from Long Documents

Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS)

A combination of autoregressors and autoencoders using XLNet for sentiment analysis

Neural-Machine-Translation - Implementation of revolutionary machine translation models

Toward a Visual Concept Vocabulary for GAN Latent Space, ICCV 2021

Legal text retrieval for python

edge-SR: Super-Resolution For The Masses

Related tags

Overview

edge-SR: Super Resolution For The Masses

Citation

BibTeX

Instructions:

Requirements:

Experiment results

Owner

Pablo

A spaCy wrapper of OpenTapioca for named entity linking on Wikidata

Image2pcl - Enter the metaverse with 2D image to 3D projections

The code for the Subformer, from the EMNLP 2021 Findings paper: "Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers", by Machel Reid, Edison Marrese-Taylor, and Yutaka Matsuo

Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.

Code for the Findings of NAACL 2022(Long Paper): AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks

It analyze the sentiment of the user, whether it is postive or negative.

Code for "Finetuning Pretrained Transformers into Variational Autoencoders"

Intent parsing and slot filling in PyTorch with seq2seq + attention

华为商城抢购手机的Python脚本 Python script of Huawei Store snapping up mobile phones

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

Extracting Summary Knowledge Graphs from Long Documents

Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含 自然语言处理各领域的 面试题积累。

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS)

A combination of autoregressors and autoencoders using XLNet for sentiment analysis

Neural-Machine-Translation - Implementation of revolutionary machine translation models

Toward a Visual Concept Vocabulary for GAN Latent Space, ICCV 2021

Legal text retrieval for python

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。