Mastering Transformers, published by Packt

Last update: Jan 01, 2023

Related tags

Text Data & NLP Mastering-Transformers

Overview

Mastering Transformers

This is the code repository for Mastering Transformers, published by Packt.

Build state-of-the-art models from scratch with advanced natural language processing techniques

What is this book about?

Transformer-based language models have dominated natural language processing (NLP) studies and have now become a new paradigm. With this book, you'll learn how to build various transformer-based NLP applications using the Python Transformers library.

This book covers the following exciting features:

Explore state-of-the-art NLP solutions with the Transformers library
Train a language model in any language with any transformer architecture
Fine-tune a pre-trained language model to perform several downstream tasks
Select the right framework for the training, evaluation, and production of an end-to-end solution
Get hands-on experience in using TensorBoard and Weights & Biases
Visualize the internal representation of transformer models for interpretability

If you feel this book is for you, get your copy today!

Instructions and Navigations

All of the code is organized into folders. For example, Chapter03.

The code will look like the following:

import pandas as pd
imdb_df = pd.read_csv("IMDB Dataset.csv")
reviews = imdb_df.review.to_string(index=None)
with open("corpus.txt", "w") as f:
      f.writelines(reviews)

Following is what you need for this book: This book is for deep learning researchers, hands-on NLP practitioners, as well as ML/NLP educators and students who want to start their journey with Transformers. Beginner-level machine learning knowledge and a good command of Python will help you get the best out of this book.

With the following software and hardware list you can run all code files present in the book (Chapter 1-11).

Software and Hardware List

Chapter	Software required	OS required
1-11	Python 3.6x, Transformers, Google Colaboratory, Jupyter Notebook, TensorFlow	Windows, Mac OS X, and Linux (Any)
10	Docker, Locust.io	Windows, Mac OS X, and Linux (Any)

We also provide a PDF file that has color images of the screenshots/diagrams used in this book. Click here to download it.

Code in Action

Click on the following link to see the Code in Action:

https://bit.ly/3i4vFzJ

Get to Know the Author

Savaş Yıldırım He graduated from the Istanbul Technical University Department of Computer Engineering and holds a Ph.D. degree in Natural Language Processing (NLP). Currently, he is an associate professor at the Istanbul Bilgi University, Turkey, and is a visiting researcher at the Ryerson University, Canada. He is a proactive lecturer and researcher with more than 20 years of experience teaching courses on machine learning, deep learning, and NLP.

Meysam Asgari-Chenaghlu He is an AI manager at Carbon Consulting and is also a Ph.D. candidate at the University of Tabriz. He has been a consultant for Turkey's leading telecommunication and banking companies. He has also worked on various projects, including natural language understanding and semantic search.

Mastering Transformers, published by Packt

Related tags

Overview

Mastering Transformers

What is this book about?

Instructions and Navigations

Software and Hardware List

Code in Action

Related products

Get to Know the Author

Owner

Packt

Big Bird: Transformers for Longer Sequences

A music comments dataset, containing 39,051 comments for 27,384 songs.

An extension for asreview implements a version of the tf-idf feature extractor that saves the matrix and the vocabulary.

Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Code for the paper "Language Models are Unsupervised Multitask Learners"

CoNLL-English NER Task (NER in English)

FB ID CLONER WUTHOT CHECKPOINT, FACEBOOK ID CLONE FROM FILE

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.

Repositório do trabalho de introdução a NLP

Visual Automata is a Python 3 library built as a wrapper for Caleb Evans' Automata library to add more visualization features.

Sequence Modeling with Structured State Spaces

A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion

NLP topic mdel LDA - Gathered from New York Times website

Training code of Spatial Time Memory Network. Semi-supervised video object segmentation.

An evaluation toolkit for voice conversion models.

Hostapd-mac-tod-acl - Setup a hostapd AP with MAC ToD ACL

Share constant definitions between programming languages and make your constants constant again

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation

STS Benchmark comprises a selection of the English datasets used in the STS tasks organized in the context of SemEval between 2012 and 2017. The selection of datasets include text from image captions, news headlines and user forums.