JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Last update: Oct 26, 2022

Related tags

Deep Learning JASS

Overview

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

This the repository for this paper.

Find extensions of this work and new pre-trained models here: code, paper

Requirements

Install OpenNMT-py (1.0) and subword-nmt.

pip install OpenNMT-py
pip install subword-nmt

Pre-trained JASS models

We release JASS models on 2 language pairs: ja+en, ja+ru. For Japanese seq2seq pretraining, we use our proposed JASS methods while MASS is utilized for English and Russian.

Model	Vocabulary	BPE codes
JASS-jaen	ja-en	ja-en.bpe.codes
JASS-jaru	ja-ru	ja-ru.bpe.codes

Usage

Run the bpe precrocessing for the dataset to be finetuned. After setting up the downloaded vocabulary for src and tgt sentences during the preprocessing phase by preprocess.py of OpenNMT, use train_from argument of train.py in OpenNMT to implement the finetuning for the pretrained model.

Others

We will update the current Japanese--English pre-trained model and release pretrained models on Japanese--Chinese and Japanese--Korean. We released new models here: code

Reference

[1] Zhuoyuan Mao, Fabien Cromieres, Raj Dabre, Haiyue Song, Sadao Kurohashi, JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

@inproceedings{mao-etal-2020-jass,
    title = "{JASS}: {J}apanese-specific Sequence to Sequence Pre-training for Neural Machine Translation",
    author = "Mao, Zhuoyuan  and
      Cromieres, Fabien  and
      Dabre, Raj  and
      Song, Haiyue  and
      Kurohashi, Sadao",
    booktitle = "Proceedings of The 12th Language Resources and Evaluation Conference",
    month = may,
    year = "2020",
    address = "Marseille, France",
    publisher = "European Language Resources Association",
    url = "https://www.aclweb.org/anthology/2020.lrec-1.454",
    pages = "3683--3691",
    language = "English",
    ISBN = "979-10-95546-34-4",
}

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Related tags

Overview

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Requirements

Pre-trained JASS models

Usage

Others

Reference

Owner

Zhuoyuan Mao

BridgeGAN - Tensorflow implementation of Bridging the Gap between Label- and Reference-based Synthesis in Multi-attribute Image-to-Image Translation.

Codebase for Time-series Generative Adversarial Networks (TimeGAN)

Data, notebooks, and articles associated with the RSNA AI Deep Learning Lab at RSNA 2021

Back to the Feature: Learning Robust Camera Localization from Pixels to Pose (CVPR 2021)

Emotion Recognition from Facial Images

On Evaluation Metrics for Graph Generative Models

The final project of "Applying AI to EHR Data" of "AI for Healthcare" nanodegree - Udacity.

Explore the Expression: Facial Expression Generation using Auxiliary Classifier Generative Adversarial Network

A library for using chemistry in your applications

A two-stage U-Net for high-fidelity denoising of historical recordings

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

Real-time Joint Semantic Reasoning for Autonomous Driving

Official implementation of AAAI-21 paper "Label Confusion Learning to Enhance Text Classification Models"

A graph adversarial learning toolbox based on PyTorch and DGL.

Official code for our CVPR '22 paper "Dataset Distillation by Matching Training Trajectories"

Locally Most Powerful Bayesian Test for Out-of-Distribution Detection using Deep Generative Models

Generalizing Gaze Estimation with Outlier-guided Collaborative Adaptation

Deep learning PyTorch library for time series forecasting, classification, and anomaly detection

Overview of architecture and implementation of TEDS-Net, as described in MICCAI 2021: "TEDS-Net: Enforcing Diffeomorphisms in Spatial Transformers to Guarantee TopologyPreservation in Segmentations"

Versatile Generative Language Model