BERT-SST2-Prod

Reproduction process of BERT on SST2 dataset

安装说明

下载代码库

git clone https://github.com/JunnYu/BERT-SST2-Prod

进入文件夹，安装requirements

pip install -r requirements.txt

安装PaddlePaddle与PyTorch

# CPU版本的PaddlePaddle
pip install paddlepaddle==2.2.0 -i https://mirror.baidu.com/pypi/simple
# 如果希望安装GPU版本的PaddlePaddle，可以使用下面的命令
# pip install paddlepaddle-gpu==2.2.0.post112 -f https://www.paddlepaddle.org.cn/whl/linux/mkl/avx/stable.html
# 安装PyTorch
pip install torch==1.10.0+cu113 torchvision==0.11.1+cu113 torchaudio==0.10.0+cu113 -f https://download.pytorch.org/whl/cu113/torch_stable.html

注意: 本项目依赖于paddlepaddle-2.2.0版本，安装时需要注意。

验证PaddlePaddle是否安装成功

运行python，输入下面的命令。

import paddle
paddle.utils.run_check()
print(paddle.__version__)

如果输出下面的内容，则说明PaddlePaddle安装成功。

PaddlePaddle is installed successfully! Let's start deep learning with PaddlePaddle now.
2.2.0

验证PyTorch是否安装成功

运行python，输入下面的命令，如果可以正常输出，则说明torch安装成功。

import torch
print(torch.__version__)
# 如果安装的是cpu版本，可以按照下面的命令确认torch是否安装成功
# 期望输出为 tensor([1.])
print(torch.Tensor([1.0]))
# 如果安装的是gpu版本，可以按照下面的命令确认torch是否安装成功
# 期望输出为 tensor([1.], device='cuda:0')
print(torch.Tensor([1.0]).cuda())

Reproduction process of BERT on SST2 dataset

Related tags

Overview

BERT-SST2-Prod

安装说明

Owner

yujun

Beautiful visualizations of how language differs among document types.

"Investigating the Limitations of Transformers with Simple Arithmetic Tasks", 2021

Text to speech converter with GUI made in Python.

Huggingface Transformers + Adapters = ❤️

Multilingual text (NLP) processing toolkit

Utilize Korean BERT model in sentence-transformers library

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

Retraining OpenAI's GPT-2 on Discord Chats

Python utility library for compositing PDF documents with reportlab.

A simple Streamlit App to classify swahili news into different categories.

Python generation script for BitBirds

[ICLR 2021 Spotlight] Pytorch implementation for "Long-tailed Recognition by Routing Diverse Distribution-Aware Experts."

Indonesia spellchecker with python

glow-speak is a fast, local, neural text to speech system that uses eSpeak-ng as a text/phoneme front-end.

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Community and sentiment analysis based on tweets

基于pytorch_rnn的古诗词生成

Simple and efficient RevNet-Library with DeepSpeed support

Pytorch version of BERT-whitening

Transformer training code for sequential tasks