Code for the paper "Adapting Monolingual Models: Data can be Scarce when Language Similarity is High"

Last update: Aug 02, 2021

Related tags

Deep Learning low-resource-adapt

Overview

Wietse de Vries • Martijn Bartelds • Malvina Nissim • Martijn Wieling

Adapting Monolingual Models: Data can be Scarce when Language Similarity is High

This repository contains everything that is needed to replicate the results in the paper:

📝 Adapting Monolingual Models: Data can be Scarce when Language Similarity is High

Models

The best fine-tuned models for Gronings and West Frisian are available on the HuggingFace model hub:

Lexical layers

These models are identical to BERTje, but with different lexical layers (bert.embeddings.word_embeddings).

🤗 GroNLP/bert-base-dutch-cased (Dutch; source language)
🤗 GroNLP/bert-base-dutch-cased-gronings (Gronings)
🤗 GroNLP/bert-base-dutch-cased-frisian (West Frisian)

POS tagging

These models share the same fine-tuned Transformer layers + classification head, but with the retrained lexical layers from the models above.

🤗 GroNLP/bert-base-dutch-cased-upos-alpino (Dutch)
🤗 GroNLP/bert-base-dutch-cased-upos-alpino-gronings (Gronings)
🤗 GroNLP/bert-base-dutch-cased-upos-alpino-frisian (West Frisian)

Development

Conda/mamba dependencies are listed in environment.yml. This repository contains all scripts and configs that are needed to replicate the results in the paper. A more extensive usage guide will be provided later.

BibTeX entry

The paper is to appear in Findings of ACL2021. The preprint can be cited as:

@misc{devries2021adapting,
      title={{Adapting Monolingual Models: Data can be Scarce when Language Similarity is High}}, 
      author={Wietse de Vries and Martijn Bartelds and Malvina Nissim and Martijn Wieling},
      year={2021},
      eprint={2105.02855},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Code for the paper "Adapting Monolingual Models: Data can be Scarce when Language Similarity is High"

Related tags

Overview

Adapting Monolingual Models: Data can be Scarce when Language Similarity is High

Models

Lexical layers

POS tagging

Development

BibTeX entry

Owner

Wietse de Vries

Segmentation Training Pipeline

Adaptable tools to make reinforcement learning and evolutionary computation algorithms.

NeuralDiff: Segmenting 3D objects that move in egocentric videos

A repository with exploration into using transformers to predict DNA ↔ transcription factor binding

Graph Convolutional Networks in PyTorch

PyTorch implementation of InstaGAN: Instance-aware Image-to-Image Translation

TalkingHead-1KH is a talking-head dataset consisting of YouTube videos

Implementing DropPath/StochasticDepth in PyTorch

Official code of paper "PGT: A Progressive Method for Training Models on Long Videos" on CVPR2021

This is the official pytorch implementation of the BoxEL for the description logic EL++

Continual Learning of Long Topic Sequences in Neural Information Retrieval

Deep Hedging Demo - An Example of Using Machine Learning for Derivative Pricing.

View model summaries in PyTorch!

Meta-meta-learning with evolution and plasticity

Pre-Training Graph Neural Networks for Cold-Start Users and Items Representation.

HiPAL: A Deep Framework for Physician Burnout Prediction Using Activity Logs in Electronic Health Records

Simple tutorials using Google's TensorFlow Framework

Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments (CoRL 2020)

An improvement of FasterGICP: Acceptance-rejection Sampling based 3D Lidar Odometry

MetaTTE: a Meta-Learning Based Travel Time Estimation Model for Multi-city Scenarios