Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

Last update: Jan 08, 2023

Overview

GPT2-Pytorch with Text-Generator

Better Language Models and Their Implications

Our model, called GPT-2 (a successor to GPT), was trained simply to predict the next word in 40GB of Internet text. Due to our concerns about malicious applications of the technology, we are not releasing the trained model. As an experiment in responsible disclosure, we are instead releasing a much smaller model for researchers to experiment with, as well as a technical paper. from openAI Blog

This repository is simple implementation GPT-2 about text-generator in Pytorch with compress code

The original repertoire is openai/gpt-2. Also You can Read Paper about gpt-2, "Language Models are Unsupervised Multitask Learners". To Understand more detail concept, I recommend papers about Transformer Model.
Good implementation GPT-2 in Pytorch which I referred to, huggingface/pytorch-pretrained-BERT, You can see more detail implementation in huggingface repository.
Transformer(Self-Attention) Paper : Attention Is All You Need(2017)
First OpenAi-GPT Paper : Improving Language Understanding by Generative Pre-Training(2018)
See OpenAI Blog about GPT-2 and Paper

Quick Start

download GPT2 pre-trained model in Pytorch which huggingface/pytorch-pretrained-BERT already made! (Thanks for sharing! it's help my problem transferring tensorflow(ckpt) file to Pytorch Model!)

$ git clone https://github.com/graykode/gpt-2-Pytorch && cd gpt-2-Pytorch
# download huggingface's pytorch model 
$ curl --output gpt2-pytorch_model.bin https://s3.amazonaws.com/models.huggingface.co/bert/gpt2-pytorch_model.bin
# setup requirements, if using mac os, then run additional setup as descibed below
$ pip install -r requirements.txt

Now, You can run like this.

Text from Book 1984, George Orwell

$ python main.py --text "It was a bright cold day in April, and the clocks were striking thirteen. Winston Smith, his chin nuzzled into his breast in an effort to escape the vile wind, slipped quickly through the glass doors of Victory Mansions, though not quickly enough to prevent a swirl of gritty dust from entering along with him."

Also You can Quick Starting in Google Colab

Option

--text : sentence to begin with.
--quiet : not print all of the extraneous stuff like the "================"
--nsamples : number of sample sampled in batch when multinomial function use
--unconditional : If true, unconditional generation.
--batch_size : number of batch size
--length : sentence length (< number of context)
--temperature: the thermodynamic temperature in distribution (default 0.7)
--top_k : Returns the top k largest elements of the given input tensor along a given dimension. (default 40)

See more detail option about temperature and top_k in here

Dependencies

Pytorch 0.41+
regex 2017.4.5

Mac OS Setup

$ python3 -m venv venv
$ source venv/bin/activate
$ pip install torch tqdm
$ brew install libomp
$ export LC_ALL=en_US.UTF-8
$ export LANG=en_US.UTF-8
$ pip install -r requirements.txt

Author

Tae Hwan Jung(Jeff Jung) @graykode
Author Email : [email protected]

License

OpenAi/GPT2 follow MIT license, huggingface/pytorch-pretrained-BERT is Apache license.
I follow MIT license with original GPT2 repository

Acknowledgement

Jeff Wu(@WuTheFWasThat), Thomas Wolf(@thomwolf) for allowing referring code.

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

Related tags

Overview

GPT2-Pytorch with Text-Generator

Quick Start

Option

Dependencies

Mac OS Setup

Author

License

Acknowledgement

Owner

Tae-Hwan Jung

NSFW A chatbot based on GPT2-chitchat

BMInf (Big Model Inference) is a low-resource inference package for large-scale pretrained language models (PLMs).

Graph4nlp is the library for the easy use of Graph Neural Networks for NLP

Code Generation using a large neural network called GPT-J

A framework for evaluating Knowledge Graph Embedding Models in a fine-grained manner.

LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language

RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).

Phomber is infomation grathering tool that reverse search phone numbers and get their details, written in python3.

auto_code_complete is a auto word-completetion program which allows you to customize it on your need

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

My implementation of Safaricom Machine Learning Codility test. The code has bugs, logical I guess I made errors and any correction will be appreciated.

PyTorch Implementation of the paper Single Image Texture Translation for Data Augmentation

Twewy-discord-chatbot - Build a Discord AI Chatbot that Speaks like Your Favorite Character

Natural Language Processing Specialization

💛 Code and Dataset for our EMNLP 2021 paper: "Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion Causes"

Chinese version of GPT2 training code, using BERT tokenizer.

SimpleChinese2 集成了许多基本的中文NLP功能，使基于 Python 的中文文字处理和信息提取变得简单方便。

BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese

Tools to download and cleanup Common Crawl data

Unsupervised Document Expansion for Information Retrieval with Stochastic Text Generation

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

Related tags

Overview

GPT2-Pytorch with Text-Generator

Quick Start

Option

Dependencies

Mac OS Setup

Author

License

Acknowledgement

Owner

Tae-Hwan Jung

**NSFW** A chatbot based on GPT2-chitchat

BMInf (Big Model Inference) is a low-resource inference package for large-scale pretrained language models (PLMs).

Graph4nlp is the library for the easy use of Graph Neural Networks for NLP

Code Generation using a large neural network called GPT-J

A framework for evaluating Knowledge Graph Embedding Models in a fine-grained manner.

LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language

RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).

Phomber is infomation grathering tool that reverse search phone numbers and get their details, written in python3.

auto_code_complete is a auto word-completetion program which allows you to customize it on your need

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

My implementation of Safaricom Machine Learning Codility test. The code has bugs, logical I guess I made errors and any correction will be appreciated.

PyTorch Implementation of the paper Single Image Texture Translation for Data Augmentation

Twewy-discord-chatbot - Build a Discord AI Chatbot that Speaks like Your Favorite Character

Natural Language Processing Specialization

💛 Code and Dataset for our EMNLP 2021 paper: "Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion Causes"

Chinese version of GPT2 training code, using BERT tokenizer.

SimpleChinese2 集成了许多基本的中文NLP功能，使基于 Python 的中文文字处理和信息提取变得简单方便。

BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese

Tools to download and cleanup Common Crawl data

Unsupervised Document Expansion for Information Retrieval with Stochastic Text Generation

NSFW A chatbot based on GPT2-chitchat