CvT-ASSD: Convolutional vision-Transformerbased Attentive Single Shot MultiBox Detector (ICTAI 2021 CCF-C 会议)The 33rd IEEE International Conference on Tools with Artificial Intelligence

Last update: Mar 07, 2022

Related tags

Deep Learning CvT-ASSD

Overview

CvT-ASSD

including extra CvT, CvT-SSD, VGG-ASSD models

original-code-website:

https://github.com/albert-jin/CvT-SSD

new-code-website:

https://github.com/albert-jin/CvT-ASSD

为了符合开源号召,本项目于2021-7-12 正式开源...

project architecture:

Mentions

You may probably need to install an anaconda environment which contains all packages followed.
- pytorch 1.9.0 py3.7_cuda10.2_cudnn7_0 pytorch
- cudatoolkit 10.2.89 h74a9793_1
- opencv-python 4.5.2.54 pypi_0 pypi
- visdom 0.1.8.9 pypi_0 pypi
- yacs 0.1.8 pypi_0 pypi
- jupyter 1.0.0 pypi_0 pypi
For training, an NVIDIA GPU is strongly recommended for speed. we use two NVIDIA GTX-1080TI, but we recommend GPUs like Tesla-V100 /RTX-3090 for more memory
Before you run the codes for self-study or reappearance the performance in this paper "CvT-ASSD", please add the CvT_SSD/model/ directory into sources Root caused by the reference of many codes inside of model directory
you should download the pytorch parameters file postfix by ".pth" and move into models/CvT/weights like 项目结构.PNG
图像物体检测benchmark(参照论文native-SSD)一般是将VOC2007—TEST的数据作为模型的测试集,训练集可有以下搭配:
- 1. 07:VOC2007 trainval 训练集验证集
- 1. 02+12 VOC2007 trainval + VOC2007 trainval 训练集验证集
- 1. 07+12+COCO 在 COCO trainval35k上预训练,然后在07+12上微调
评价指标maP使用mxnet提供的VOC07MApMetric,将recall分成10等分,继而对所有precision取平均,在对类别去平均,具体参见 https://blog.csdn.net/u014203453/article/details/77598997

CvT-ASSD: Convolutional vision-Transformerbased Attentive Single Shot MultiBox Detector (ICTAI 2021 CCF-C 会议)The 33rd IEEE International Conference on Tools with Artificial Intelligence

Related tags

Overview

CvT-ASSD

including extra CvT, CvT-SSD, VGG-ASSD models

original-code-website:

new-code-website:

为了符合开源号召,本项目于2021-7-12 正式开源...

project architecture:

Mentions

Owner

金伟强 -上海大学人工智能小渣渣~

Deep Inertial Prediction (DIPr)

PyTorch Implement of Context Encoders: Feature Learning by Inpainting

Speech recognition tool to convert audio to text transcripts, for Linux and Raspberry Pi.

STRIVE: Scene Text Replacement In Videos

CountDown to New Year and shoot fireworks

Code repo for EMNLP21 paper "Zero-Shot Information Extraction as a Unified Text-to-Triple Translation"

Two-Stage Peer-Regularized Feature Recombination for Arbitrary Image Style Transfer

J.A.R.V.I.S is an AI virtual assistant made in python.

Cookiecutter PyTorch Lightning

A collection of awesome resources image-to-image translation.

Learning from Synthetic Data with Fine-grained Attributes for Person Re-Identification

The official implementation of EIGNN: Efficient Infinite-Depth Graph Neural Networks (NeurIPS 2021)

U-Net implementation in PyTorch for FLAIR abnormality segmentation in brain MRI

Vit-ImageClassification - Pytorch ViT for Image classification on the CIFAR10 dataset

LoFTR:Detector-Free Local Feature Matching with Transformers CVPR 2021

3D-Reconstruction 基于深度学习方法的单目多视图三维重建

Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.

Learning Temporal Consistency for Low Light Video Enhancement from Single Images (CVPR2021)

PoseViz – Multi-person, multi-camera 3D human pose visualization tool built using Mayavi.

DeepGNN is a framework for training machine learning models on large scale graph data.