Computer Vision Paper Reviews with Key Summary of paper, End to End Code Practice and Jupyter Notebook converted papers

Overview

Computer-Vision-Paper-Reviews

Computer Vision Paper Reviews with Key Summary along Papers & Codes.

Jonathan Choi 2021

The repository provides 100+ Papers across Computer Vision fields converted as Jupyter Notebook, with the Key Summary and End to End Code Practice.


Contents

The goal of the repository is providing an end to end study scripts of most read and important papers.

The prefered readers are not limited for researchers, but also for students and engieeners from rookies to the professions in computer vision fields .

To provide the perfect and rich understanding, each paper contains following three main contents.

Key Summary

Providing key summaries and terminologies of the paper so that even rookies can study as perfectly and easily as possible.

Code Practice

Providing an end to end study script of codes for the paper so that even rookies can study as easily and perfectly as possible.

Jupyter Notebook edited Original Paper

Providing the Original Paper converted into Jupyter notbook for easy and fast modification and understanding.


Category/Paper/

Paper_Review_Practice.ipynb includes

Key Summary according to the flow of Original Paper (Jupyter Notebook Edited) with the End to End Code Practice

Paper.ipynb includes

Original Paper (Jupyter Notebook Edited)

Review.ipynb includes

Key Summary

Practice.ipynb includes

End to End Code Practice


Index


Working Papers

If you want to see Road Map and the process, please visit here.


Implicit Neural Representation

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Instance Normalization: The Missing Ingredient for Fast Stylization

Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization

Semantic Image Synthesis with Spatially-Adaptive Normalization

Universal Style Transfer via Feature Transforms

A Neural Algorithm of Artistic Style

Convolutional neural network architecture for geometric matching

Perceptual Losses for Real-Time Style Transfer and Super-Resolution

Geometric Style Transfer


Image to Image Translation

Image-to-Image Translation with Conditional Adversarial Networks (CVPR 2017)

Bi-level Feature Alignment for Versatile Image Translation and Manipulation


Transformer

[DETR] End-to-End Object Detection with Transformers

[Vision Transformer] An Image Is Worth 16x16 Words: Transformers For Image Recognition at Scale

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

[Transformer] Attention Is All You Need

Vision Transformers for Dense Prediction


Object Detection

Feature Pyramid Networks for Object Detection

Selective Search for Object Recognition

R-CNN

Fast R-CNN

Faster R-CNN

Sparse R-CNN

YOLOv4: Optimal Speed and Accuracy of Object Detection**


Segmentation

Panoptic Feature Pyramid Networks

Mask R-CNN

PointRend: Image Segmentation as Rendering

Cost Aggregation Is All You Need for Few-Shot Segmentation


Convolutional Neural Network

Deep Residual Learning for Image Recognition

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks


Representation Learning

Unsupervised Feature Learning via Non-Parametric Instance Discrimination

Momentum Contrast for Unsupervised Visual Representation Learning.

A Simple Framework for Contrastive Learning of Visual Representations

Bootstrap Your Own Latent- A New Approach to Self-supervised Learning

Exploring Simple Siamese Representation Learning


Image Generation

Generative Adversarial Networks

A Style-Based Generator Architecture for Generative Adversarial Networks

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

Semantic Image Synthesis with Spatially-Adaptive Normalization

Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks


Vision and Language


Depth Estimation


Correspondence


Implicit Field

Owner
Jonathan Choi
Researching Computer Vision @ Korea University. To The Infinity, And Beyond!
Jonathan Choi
Official repository for "Restormer: Efficient Transformer for High-Resolution Image Restoration". SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

Restormer: Efficient Transformer for High-Resolution Image Restoration Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan,

Syed Waqas Zamir 906 Dec 30, 2022
Official PyTorch code for Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021)

Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021) This repository is the official P

Jingyun Liang 159 Dec 30, 2022
Learning Representations that Support Robust Transfer of Predictors

Transfer Risk Minimization (TRM) Code for Learning Representations that Support Robust Transfer of Predictors Prepare the Datasets Preprocess the Scen

Yilun Xu 15 Dec 07, 2022
AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation

AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation A pytorch-version implementation codes of paper:

11 Dec 13, 2022
Use of Attention Gates in a Convolutional Neural Network / Medical Image Classification and Segmentation

Attention Gated Networks (Image Classification & Segmentation) Pytorch implementation of attention gates used in U-Net and VGG-16 models. The framewor

Ozan Oktay 1.6k Dec 30, 2022
Official implementation of Deep Convolutional Dictionary Learning for Image Denoising.

DCDicL for Image Denoising Hongyi Zheng*, Hongwei Yong*, Lei Zhang, "Deep Convolutional Dictionary Learning for Image Denoising," in CVPR 2021. (* Equ

Z80 91 Dec 21, 2022
Code for generating the figures in the paper "Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views?"

Code for running simulations for the paper "Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Lin

Matthew Farrell 1 Nov 22, 2022
An executor that performs image segmentation on fashion items

ClothingSegmenter U2NET fashion image/clothing segmenter based on https://github.com/levindabhi/cloth-segmentation Overview The ClothingSegmenter exec

Jina AI 5 Mar 30, 2022
Image Super-Resolution by Neural Texture Transfer

SRNTT: Image Super-Resolution by Neural Texture Transfer Tensorflow implementation of the paper Image Super-Resolution by Neural Texture Transfer acce

Zhifei Zhang 413 Nov 30, 2022
PyTorch Implementation of Vector Quantized Variational AutoEncoders.

Pytorch implementation of VQVAE. This paper combines 2 tricks: Vector Quantization (check out this amazing blog for better understanding.) Straight-Th

Vrushank Changawala 2 Oct 06, 2021
Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device" @ CAD&Graphics2019

PortraitNet Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device". @ CAD&Graphics 2019 Introduction We propose a

265 Dec 01, 2022
FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control

FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control by Dimitri von Rütte, Luca Biggio, Yannic Kilcher, Thomas Hofmann FIGARO: Generat

Dimitri 83 Jan 07, 2023
D2Go is a toolkit for efficient deep learning

D2Go D2Go is a production ready software system from FacebookResearch, which supports end-to-end model training and deployment for mobile platforms. W

Facebook Research 744 Jan 04, 2023
A python3 tool to take a 360 degree survey of the RF spectrum (hamlib + rotctld + RTL-SDR/HackRF)

RF Light House (rflh) A python script to use a rotor and a SDR device (RTL-SDR or HackRF One) to measure the RF level around and get a data set and be

Pavel Milanes (CO7WT) 11 Dec 13, 2022
Source code for CVPR 2021 paper "Riggable 3D Face Reconstruction via In-Network Optimization"

Riggable 3D Face Reconstruction via In-Network Optimization Source code for CVPR 2021 paper "Riggable 3D Face Reconstruction via In-Network Optimizati

130 Jan 02, 2023
A tensorflow implementation of an HMM layer

tensorflow_hmm Tensorflow and numpy implementations of the HMM viterbi and forward/backward algorithms. See Keras example for an example of how to use

Zach Dwiel 283 Oct 19, 2022
The story of Chicken for Club Bing

Chicken Story tl;dr: The time when Microsoft banned my entire country for cheating at Club Bing. (A lot of the details are from memory so I've recreat

Eyal 142 May 16, 2022
PyTorchMemTracer - Depict GPU memory footprint during DNN training of PyTorch

A Memory Tracer For PyTorch OOM is a nightmare for PyTorch users. However, most

Jiarui Fang 9 Nov 14, 2022
Using fully convolutional networks for semantic segmentation with caffe for the cityscapes dataset

Using fully convolutional networks for semantic segmentation (Shelhamer et al.) with caffe for the cityscapes dataset How to get started Download the

Simon Guist 27 Jun 06, 2022
Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

Deep Vision and Graphics This repo supplements course "Deep Vision and Graphics" taught at YSDA @fall'21. The course is the successor of "Deep Learnin

Yandex School of Data Analysis 160 Jan 02, 2023