Implementation of Convolutional enhanced image Transformer

Last update: Dec 13, 2022

Overview

CeiT : Convolutional enhanced image Transformer

This is an unofficial PyTorch implementation of Incorporating Convolution Designs into Visual Transformers .

Training :

python train.py -c configs/default.yaml --name "name_of_exp"

Usage :

import torch
from ceit import CeiT

img = torch.ones([1, 3, 224, 224])
    
model = CeiT(image_size = 224, patch_size = 4, num_classes = 100)
out = model(img)

print("Shape of out :", out.shape)      # [B, num_classes]

model = CeiT(image_size = 224, patch_size = 4, num_classes = 100, with_lca = True)
out = model(img)

print("Shape of out :", out.shape)      # [B, num_classes]

Note :

LCA might not be properly implemented.

Citation :

@misc{yuan2021incorporating,
      title={Incorporating Convolution Designs into Visual Transformers}, 
      author={Kun Yuan and Shaopeng Guo and Ziwei Liu and Aojun Zhou and Fengwei Yu and Wei Wu},
      year={2021},
      eprint={2103.11816},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Acknowledgement :

Base ViT code is borrowed from @lucidrains repo : https://github.com/lucidrains/vit-pytorch
Training and dataloader code is borrowed from @jeonsworld repo : https://github.com/jeonsworld/ViT-pytorch

Implementation of Convolutional enhanced image Transformer

Related tags

Overview

CeiT : Convolutional enhanced image Transformer

Training :

Usage :

Note :

Citation :

Acknowledgement :

Owner

Rishikesh (ऋषिकेश)

Official implementation of the paper Momentum Capsule Networks (MoCapsNet)

Repository for "Toward Practical Monocular Indoor Depth Estimation" (CVPR 2022)

ReLoss - Official implementation for paper "Relational Surrogate Loss Learning" ICLR 2022

Must-read Papers on Physics-Informed Neural Networks.

Pytorch GUI(demo) for iVOS(interactive VOS) and GIS (Guided iVOS)

Rendering color and depth images for ShapeNet models.

Shitty gaze mouse controller

Introduction to AI assignment 1 HCM University of Technology, term 211

Paddle Graph Learning (PGL) is an efficient and flexible graph learning framework based on PaddlePaddle

A medical imaging framework for Pytorch

Change is Everywhere: Single-Temporal Supervised Object Change Detection in Remote Sensing Imagery (ICCV 2021)

The Official Repository for "Generalized OOD Detection: A Survey"

Self-Supervised Deep Blind Video Super-Resolution

Volumetric parameterization of the placenta to a flattened template

An offline deep reinforcement learning library

PyTorch implementation of SCAFFOLD (Stochastic Controlled Averaging for Federated Learning, ICML 2020).

TalkingHead-1KH is a talking-head dataset consisting of YouTube videos

Dynamic hair modeling from monocular videos using deep neural networks

Official PyTorch Implementation of Mask-aware IoU and maYOLACT Detector [BMVC2021]

Image-to-image translation with conditional adversarial nets