This repository contains a CBIR system that uses swin transformer to extract image's feature.

Last update: Nov 17, 2022

Related tags

Overview

Swin-transformer based CBIR

This repository contains a CBIR(content-based image retrieval) system. Here we use Swin-transformer to extract query image's feature, and retrieve similar ones from image database. Notably, our program achieves intelligent user interaction, including selecting an image by opening explorer dialog and cropping interested region by drafting mouse.

Structure

SWIN_CBIR/
|-- checkpoints/
|
|-- database/
|   |-- data/
|   |   |-- 1.jpg
|   |   |-- 2.jpg
|   |  
|   |-- DB.npz
|   |-- index.txt
|
|-- models/
|   |-- __init__.py
|   |-- build.py
|   |-- swin_transformer.py
|
|-- scripts/
|   |-- generate_DB.sh
|
|-- test/
|
|-- config.py
|-- database.py
|-- generate_DB.py
|-- main.py
|-- requirements.txt
|-- README

Getting Started

Prepare images database

Just find out some images and put them into database/data/.
run ./script/generate_DB.sh in linux machine to extract features of all images and package them into DB.npz.
run main.py, open an image and select interested region, then program will find similar images in database automatically!

Results

Here we show two image retrieval results. Two images in the first row are original image and cropped image respectively while the others are retrieval results (have been sorted by similarity).

Note: all images are resize to square for visual requirement, so there would be distorted in some of the images.

Acknowledgments

Part of code in this repository are copied from Swin-transformer, thank the authors for their exquiste code.

This repository contains a CBIR system that uses swin transformer to extract image's feature.

Related tags

Overview

Swin-transformer based CBIR

Structure

Getting Started

Results

Acknowledgments

Owner

JsHou

Pytorch implementation of NeurIPS 2021 paper: Geometry Processing with Neural Fields.

PyTorch implementation of ShapeConv: Shape-aware Convolutional Layer for RGB-D Indoor Semantic Segmentation.

Official code for the paper "Self-Supervised Prototypical Transfer Learning for Few-Shot Classification"

The project page of paper: Architecture disentanglement for deep neural networks [ICCV 2021, oral]

This repository contains the implementation of Deep Detail Enhancment for Any Garment proposed in Eurographics 2021

Instance-wise Feature Importance in Time (FIT)

Dataset for the Research2Clinics @ NeurIPS 2021 Paper: What Do You See in this Patient? Behavioral Testing of Clinical NLP Models

Implementation of popular SOTA self-supervised learning algorithms as Fastai Callbacks.

Segmentation Training Pipeline

The FIRST GANs-based omics-to-omics translation framework

A code repository associated with the paper A Benchmark for Rough Sketch Cleanup by Chuan Yan, David Vanderhaeghe, and Yotam Gingold from SIGGRAPH Asia 2020.

Tree LSTM implementation in PyTorch

Neural Style and MSG-Net

[ICCV 2021] FaPN: Feature-aligned Pyramid Network for Dense Image Prediction

All materials of Cassandra Event, Udyam'22

TorchMD-Net provides state-of-the-art graph neural networks and equivariant transformer neural networks potentials for learning molecular potentials

BABEL: Bodies, Action and Behavior with English Labels [CVPR 2021]

Mosaic of Object-centric Images as Scene-centric Images (MosaicOS) for long-tailed object detection and instance segmentation.

An image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testingAn image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testing

NCVX (NonConVeX): A User-Friendly and Scalable Package for Nonconvex Optimization in Machine Learning.