In this workshop we will be exploring NLP state of the art transformers, with SOTA models like T5 and BERT, then build a model using HugginFace transformers framework.

Last update: Apr 13, 2022

Overview

Transformers are all you need

In this workshop we will be exploring NLP state of the art transformers, with SOTA models like T5 and BERT, then build a model using HugginFace transformers framework.

Table of Content

The workshop will be divided into four parts

Introduction to Transformers as a HYPE
Sneak peek to the theory behind Transfomers
Quick tour (Huggingface framework)
Lab
- fine tune a translation model

Note that you can always open the notebooks on Google Colab ( No need to install anything ) you just need a stable internet connection :

- fine tune a translation model

2. How to get started

Fork this repository
Create a branch by your name
Go through the notebook and complete all tasks
Submit a pull request

Homework exercise

Your task is to fine-tune a classification model

Using HuggingFace transformers and datasets.
fine tune it to one of the classification task of the GLUE Benchmark(CoLa to be specific).
Use a checkpoint from the Hub ("distilbert-base-uncased" for example)
Once finished submit a pull request to this repo, make sure to place your .ipynb file in the submissions folder (YOUR_NAME.ipynb)

Useful ressources : text_classification

In this workshop we will be exploring NLP state of the art transformers, with SOTA models like T5 and BERT, then build a model using HugginFace transformers framework.

Related tags

Overview

Transformers are all you need

Table of Content

Note that you can always open the notebooks on Google Colab ( No need to install anything ) you just need a stable internet connection :

2. How to get started

Homework exercise

Owner

Aymen Berriche

One Stop Anomaly Shop: Anomaly detection using two-phase approach: (a) pre-labeling using statistics, Natural Language Processing and static rules; (b) anomaly scoring using supervised and unsupervised machine learning.

NLP library designed for reproducible experimentation management

Yet Another Compiler Visualizer

New Modeling The Background CodeBase

Optimal Transport Tools (OTT), A toolbox for all things Wasserstein.

TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP. Democratize AI for everyone.

NeuTex: Neural Texture Mapping for Volumetric Neural Rendering

NeurIPS'21: Probabilistic Margins for Instance Reweighting in Adversarial Training (Pytorch implementation).

PyTranslator é simultaneamente um editor e tradutor de texto com diversos recursos e interface feito com coração e 100% em Python

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

A toolkit for document-level event extraction, containing some SOTA model implementations

BiNE: Bipartite Network Embedding

Fast, DB Backed pretrained word embeddings for natural language processing.

Natural Language Processing for Adverse Drug Reaction (ADR) Detection

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Implementation of ProteinBERT in Pytorch

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge

Code for lyric-section-to-comment generation based on huggingface transformers.