A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)

Last update: Jul 07, 2022

Related tags

Deep Learning BigPretrain

Overview

BigPretrain

A Simple Framwork for CV Pre-training Model based on prototype.

Models

Click on the hyperlink to view details.

SOCO:Aligning Pretraining for Detection via Object-Level Contrastive Learning
- Paper, Official Code
- New Features:
VirTex

License

For academic use, this project is licensed under the 2-clause BSD License. For commercial use, please contact the authors.

Owner

Sense-GVT

GitHub Repository

A curated list and survey of awesome Vision Transformers.

English | 简体中文 A curated list and survey of awesome Vision Transformers. You can use mind mapping software to open the mind mapping source file. You c

281 Dec 21, 2022

StyleGAN2 Webtoon / Anime Style Toonify

StyleGAN2 Webtoon / Anime Style Toonify Korea Webtoon or Japanese Anime Character Stylegan2 base high Quality 1024x1024 / 512x512 Generate and Transfe

121 Dec 21, 2022

Python Actor concurrency library

Thespian Actor Library This library provides the framework of an Actor model for use by applications implementing Actors. Thespian Site with Documenta

177 Dec 11, 2022

Code and data for ImageCoDe, a contextual vison-and-language benchmark

ImageCoDe This repository contains code and data for ImageCoDe: Image Retrieval from Contextual Descriptions. Data All collected descriptions for the

27 Dec 02, 2022

Framework for estimating the structures and parameters of Bayesian networks (DAGs) at per-sample resolution

Sample-specific Bayesian Networks A framework for estimating the structures and parameters of Bayesian networks (DAGs) at per-sample or per-patient re

1 Sep 23, 2022

The Official PyTorch Implementation of DiscoBox.

DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision Paper | Project page | Demo (Youtube) | Demo (Bilib

89 Jan 09, 2023

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

Build Type Linux MacOS Windows Build Status OpenPose has represented the first real-time multi-person system to jointly detect human body, hand, facia

25.7k Jan 09, 2023

AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations

AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations. Each modality’s augmentations are contained within its own sub-l

4.6k Jan 09, 2023

Books, Presentations, Workshops, Notebook Labs, and Model Zoo for Software Engineers and Data Scientists wanting to learn the TF.Keras Machine Learning framework

792 Dec 28, 2022

a general-purpose Transformer based vision backbone

Swin Transformer By Ze Liu*, Yutong Lin*, Yue Cao*, Han Hu*, Yixuan Wei, Zheng Zhang, Stephen Lin and Baining Guo. This repo is the official implement

9.9k Jan 08, 2023

Code and model benchmarks for "SEVIR : A Storm Event Imagery Dataset for Deep Learning Applications in Radar and Satellite Meteorology"

NeurIPS 2020 SEVIR Code for paper: SEVIR : A Storm Event Imagery Dataset for Deep Learning Applications in Radar and Satellite Meteorology Requirement

46 Dec 15, 2022

A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)

Related tags

Overview

BigPretrain

Models

License

Owner

Sense-GVT

A curated list and survey of awesome Vision Transformers.

StyleGAN2 Webtoon / Anime Style Toonify

Python Actor concurrency library

Code and data for ImageCoDe, a contextual vison-and-language benchmark

Framework for estimating the structures and parameters of Bayesian networks (DAGs) at per-sample resolution

The Official PyTorch Implementation of DiscoBox.

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations

Books, Presentations, Workshops, Notebook Labs, and Model Zoo for Software Engineers and Data Scientists wanting to learn the TF.Keras Machine Learning framework

a general-purpose Transformer based vision backbone

Code and model benchmarks for "SEVIR : A Storm Event Imagery Dataset for Deep Learning Applications in Radar and Satellite Meteorology"

This is an unofficial PyTorch implementation of Meta Pseudo Labels

L-Verse: Bidirectional Generation Between Image and Text

PyTorch reimplementation of Diffusion Models

Perturbed Self-Distillation: Weakly Supervised Large-Scale Point Cloud Semantic Segmentation (ICCV2021)

UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning

Code for paper Novel View Synthesis via Depth-guided Skip Connections

Fake videos detection by tracing the source using video hashing retrieval.

ViViT: Curvature access through the generalized Gauss-Newton's low-rank structure

Empowering journalists and whistleblowers