INTRODUCTION

This is a modification of the OpenAI-CLIP repo of moein-shariatnia(https://github.com/moein-shariatnia/OpenAI-CLIP).

The current training dataset supports flicker-8k or flicker-30k, and the image encoder supports Resnet50 or ViT(vit_base_patch16_384).

Text encoder supports only DistilBert like moein-shariatnia.

ENVIRONTMENT SETTING

$ virtualenv .venv --python=python3.6
$ source .venv/bin/activate
$ pip install -r requirements.txt

EXECUTTION

Pretrain

$ python3 pretrain.py

Inference

$ python3 inference.py --qeury={YOUR QUERY}

CAUTION

You must set(or check) some options in config.py before pretrain & inference

ex1) dataset("8k" or "30k"): Train dataset(flicker-8k or flicker-30k)

ex2) model_name("resnet50" or "vit_base_patch16_384"): Type of image encoder

ex3) pretrained(True or False): Decide whether to learn by loading pretrain versions of text encoder(DistilBert) and image encoder(resnet50 or ViT)

ex4) batch_size: Set according to the capacity of the machine

This is a modification of the OpenAI-CLIP repository of moein-shariatnia

Related tags

Overview

INTRODUCTION

ENVIRONTMENT SETTING

EXECUTTION

CAUTION

Owner

Sangwon Beak

Natural language Understanding Toolkit

🐍 A hyper-fast Python module for reading/writing JSON data using Rust's serde-json.

Shellcode antivirus evasion framework

I label phrases on a scale of five values: negative, somewhat negative, neutral, somewhat positive, positive

🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy

Natural language computational chemistry command line interface.

Source code for the paper "TearingNet: Point Cloud Autoencoder to Learn Topology-Friendly Representations"

A deep learning-based translation library built on Huggingface transformers

A Fast Command Analyser based on Dict and Pydantic

Repositório do trabalho de introdução a NLP

Chinese version of GPT2 training code, using BERT tokenizer.

A fast and easy implementation of Transformer with PyTorch.

This repository is home to the Optimus data transformation plugins for various data processing needs.

Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech tagging and word segmentation.

Python package for performing Entity and Text Matching using Deep Learning.

Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

BERT Attention Analysis

This code extends the neural style transfer image processing technique to video by generating smooth transitions between several reference style images

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Code for paper "Which Training Methods for GANs do actually Converge? (ICML 2018)"