MetaNLI

Meta learning algorithms to train cross-lingual NLI (multi-task) models

Train (source task)

Reptile

To train the model using Reptile algorithm, run the command below:

python reptile.py \
    --meta_tasks sc_en,sc_de,sc_es,sc_fr \
    --queue_len 4 \
    --temp 5.0 \
    --epochs 1 \
    --meta_lr 1e-5 \
    --scheduler \
    --gamma 0.5 \
    --step_size 4000 \
    --shot 4 \
    --meta_iteration 8000 \
    --log_interval 300

Prototypical

To train the model using Prototypical Networks algorithm, run the command below:

python prototype.py \
    --meta_tasks sc_en,sc_de,sc_es,sc_fr \
    --target_task sc_fa \
    --epochs 1 \
    --meta_lr 1e-5 \
    --lambda_1 1 \
    --lambda_2 1 \
    --scheduler \
    --gamma 0.5 \
    --step_size 1000 \
    --shot 8 \
    --query_num 0 \
    --target_shot 8 \
    --meta_iteration 2500 \
    --log_interval 50

Zero-shot Test (on target task)

To perform a zero-shot test of the trained model on the target task, run the command below:

python zeroshot.py \
    --load saved/model_sc.pt \
    --task sc_fa

Fine-tune (target task)

To fine-tune the trained model on the target task, run the command below:

python finetune.py \
    --save saved \
    --model_filename fine.pt \
    --load saved/model_sc.pt \
    --task sc_fa \
    --epochs 5 \
    --lr 1e-5

Meta learning algorithms to train cross-lingual NLI (multi-task) models

Related tags

Overview

MetaNLI

Train (source task)

Reptile

Prototypical

Zero-shot Test (on target task)

Fine-tune (target task)

Owner

M.Hassan Mojab

The ability of computer software to identify words and phrases in spoken language and convert them to human-readable text

simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.

:mag: Transformers at scale for question answering & neural search. Using NLP via a modular Retriever-Reader-Pipeline. Supporting DPR, Elasticsearch, HuggingFace's Modelhub...

ConvBERT: Improving BERT with Span-based Dynamic Convolution

초성 해석기 based on ko-BART

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

A modular Karton Framework service that unpacks common packers like UPX and others using the Qiling Framework.

Text preprocessing, representation and visualization from zero to hero.

A design of MIDI language for music generation task, specifically for Natural Language Processing (NLP) models.

CCF BDCI 2020 房产行业聊天问答匹配赛道 A榜47/2985

Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project.

A collection of Korean Text Datasets ready to use using Tensorflow-Datasets.

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

Just Another Telegram Ai Chat Bot Written In Python With Pyrogram.

Malaya-Speech is a Speech-Toolkit library for bahasa Malaysia, powered by Deep Learning Tensorflow.

Huggingface Transformers + Adapters = ❤️

LightSeq: A High-Performance Inference Library for Sequence Processing and Generation

Persian Bert For Long-Range Sequences

Code for the paper TestRank: Bringing Order into Unlabeled Test Instances for Deep Learning Tasks

Dust model dichotomous performance analysis