In this project we use both Resnet and Self-attention layer for cat, dog and flower classification.

Last update: Nov 23, 2022

Related tags

Overview

cdf_att_classification

classes = {0: 'cat', 1: 'dog', 2: 'flower'}

In this project we use both Resnet and Self-attention layer for cdf-Classification. Specifically, For Resnet, we extract low level features from Convolutional Neural Network (CNN) trained on Dogcatflower_2 dataset(details show later).
We take inspiration from the Self-attention mechanism which is a prominent method in cv domain. We also use Grad-CAM algorithm to Visualize the gradient of the back propagation of the pretrain model to understand this network. The code is released for academic research use only. For commercial use, please contact [[email protected]].

Installation

Clone this repo.

git clone https://github.com/Alan-lab/cdf_classification
cd cdf_classification/

This code requires pytorch, python3.7, cv2, d2l. Please install it.

Dataset Preparation

For cdf_classification, the datasets must be downloaded beforehand. Please download them on the respective webpages. Please cite them if you use the data.

Preparing Cat and Dog Dataset. The dataset can be downloaded here.

Preparing flower Dataset. The dataset can be downloaded here.

You can also download Dogcatflower_2 dataset(made from above datasets) use the following link:

Link:https://pan.baidu.com/s/1ZcP_isbbRQBq9BHU6p_VtQ

key:oz7z

Training New Models

Prepare your own dataset like this (https://github.com/Alan-lab/data/Dogcatflower_2).
Training:

python main.py

model.pth will be extrated in the folder ./cdf_classification.

If av_test_acc < 0.75, model.pth will not save(d2l.train_ch6).

3.Predict

Prepare your valid dataset like this (https://github.com/Alan-lab/data/catsdogsflowers/valid1).

python Predict/predict.py

4.Class Activation Map The response size of the feature map is mapped to the original image, allowing readers to understand the effect of the model more intuitively. Prepare your picture like this (https://github.com/Alan-lab/data/Dogcatflower/test/flower/flower.1501.jpg).

python Viewer/Grad_CAM.py

More details can be found in folder.

The Experimental Result

Preformance

dataset	Cat-acc	Dog-acc	flower-acc
Dogcatflower_2_train	96.2	88.7	93.6
Dogcatflower_2_test	72.7	69.2	89.7
catsdogsflowers_valid1	75.1	76.9	91.4
catsdogsflowers_valid2	75.5	73.5	92.9

2.Visualization

Postive sample

Negative sample

Multi-attention

Acknowledgments

This work is mainly supported by (https://courses.d2l.ai/zh-v2/) and CSDN.

Contributions

If you have any questions/comments/bug reports, feel free to open a github issue or pull a request or e-mail to the author Lailanqing ([email protected]).

In this project we use both Resnet and Self-attention layer for cat, dog and flower classification.

Related tags

Overview

cdf_att_classification

Installation

Dataset Preparation

Training New Models

The Experimental Result

Acknowledgments

Contributions

Owner

CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection

なりすまし検出(anti-spoof-mn3)のWebカメラ向けデモ

Official PyTorch implementation of "IntegralAction: Pose-driven Feature Integration for Robust Human Action Recognition in Videos", CVPRW 2021

PyTorch implementation of hand mesh reconstruction described in CMR and MobRecon.

SAAVN - Sound Adversarial Audio-Visual Navigation,ICLR2022 (In PyTorch)

Apache Spark - A unified analytics engine for large-scale data processing

Learning Multiresolution Matrix Factorization and its Wavelet Networks on Graphs

使用深度学习框架提取视频硬字幕；docker容器免安装深度学习库，使用本地api接口使得界面和后端识别分离；

A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results

A simple, unofficial implementation of MAE using pytorch-lightning

Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.

Real-time 3D multi-person detection made easy with OpenPose and the ZED

Code for WSDM 2022 paper, Contrastive Learning for Representation Degeneration Problem in Sequential Recommendation.

LEDNet: A Lightweight Encoder-Decoder Network for Real-time Semantic Segmentation

3rd Place Solution of the Traffic4Cast Core Challenge @ NeurIPS 2021

An image classification app boilerplate to serve your deep learning models asap!

An open-source outlier detection package by Getcontact Data Team

The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

KIDA: Knowledge Inheritance in Data Aggregation

A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization