IMDB film review sentiment classification based on BERT's supervised learning model.

Last update: Apr 17, 2022

Overview

IMDB-sentiment-classification-based-on-BERT

IMDB film review sentiment classification based on BERT's supervised learning model. On the other hand, the model can be extended to other natural language multi-classification tasks.

Documents description

ALL_OUTPUT:4组实验运行结果。

BERT_BASE_DIR:谷歌预训练BERT模型文件。

DATA_DIR:训练集、验证集、测试集文件。

output_models:空文件夹，运行程序时存储输出文件。

Raw_Data:原始数据集以及数据预处理过程涉及到的一些数据文件。

IMDB Parameters:运行‘run.py’文件时需将该文件中的参数传入程序。

run.py:训练、验证、测试模型时运行的文件。

Parameters

--task_name=mrpc
--do_train=true
--do_eval=true
--do_predict=true
--data_dir=D:\XXXXXXX\IMDB\DATA_DIR
--vocab_file=BERT_BASE_DIR/vocab.txt
--bert_config_file=BERT_BASE_DIR/bert_config.json
--init_checkpoint=BERT_BASE_DIR/bert_model.ckpt
--max_seq_length=30
--train_batch_size=1
--learning_rate=2e-6
--num_train_epochs=5
--output_dir=output_models/

The optimization result

Accuracy:
0.922（validation set）
0.928（test set）

File distribution

An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

pl_prompt_sst An example project using OpenPrompt under the framework of pytorch-lightning for a training prompt-based text classification model on SS

5 Oct 21, 2022

AI-powered literature discovery and review engine for medical/scientific papers

AI-powered literature discovery and review engine for medical/scientific papers paperai is an AI-powered literature discovery and review engine for me

819 Dec 30, 2022

Predicting the usefulness of reviews given the review text and metadata surrounding the reviews.

Predicting Yelp Review Quality Table of Contents Introduction Motivation Goal and Central Questions The Data Data Storage and ETL EDA Data Pipeline Da

3 Nov 27, 2022

When doing audio and video sentiment recognition, I found that a lot of code is duplicated, often a function in different time debugging for a long time, based on this problem, I want to manage all the previous work, organized into an open source library can be iterative. For their own use and others.

FastAudioVisual Our project is developed here. The goal finish time is March 01, 2021 What is FastAudioVisual? FastAudioVisual is a tool that allows u

39 Oct 27, 2022

Implementation of paper Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa.

RoBERTaABSA This repo contains the code for NAACL 2021 paper titled Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoB

106 Nov 28, 2022

A paper list for aspect based sentiment analysis.

Aspect-Based-Sentiment-Analysis A paper list for aspect based sentiment analysis. Survey [IEEE-TAC-20]: Issues and Challenges of Aspect-based Sentimen

419 Dec 20, 2022

MRC approach for Aspect-based Sentiment Analysis (ABSA)

B-MRC MRC approach for Aspect-based Sentiment Analysis (ABSA) Paper: Bidirectional Machine Reading Comprehension for Aspect Sentiment Triplet Extracti

1 Apr 5, 2022

This project uses unsupervised machine learning to identify correlations between daily inoculation rates in the USA and twitter sentiment in regards to COVID-19.

4 Oct 15, 2022

Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech tagging and word segmentation.

Pytorch-NLU，一个中文文本分类、序列标注工具包，支持中文长文本、短文本的多类、多标签分类任务，支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech tagging and word segmentation.

186 Dec 24, 2022

IMDB film review sentiment classification based on BERT's supervised learning model.

Related tags

Overview

IMDB-sentiment-classification-based-on-BERT

Documents description

Parameters

The optimization result

File distribution

You might also like...

An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

AI-powered literature discovery and review engine for medical/scientific papers

Predicting the usefulness of reviews given the review text and metadata surrounding the reviews.

Implementation of paper Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa.

A paper list for aspect based sentiment analysis.

MRC approach for Aspect-based Sentiment Analysis (ABSA)

This project uses unsupervised machine learning to identify correlations between daily inoculation rates in the USA and twitter sentiment in regards to COVID-19.

Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech tagging and word segmentation.

Releases(IMDB-big-file-v1.0)

IMDB-big-file-v1.0(Nov 11, 2021)

Owner

Paris

VADER Sentiment Analysis. VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon and rule-based sentiment analysis tool that is specifically attuned to sentiments expressed in social media, and works well on texts from other domains.

Repository for Graph2Pix: A Graph-Based Image to Image Translation Framework

official ( API ) for the zAmericanEnglish app in [ Google play ] and [ App store ]

An open source library for deep learning end-to-end dialog systems and chatbots.

Spooky Skelly For Python

GVT is a generic translation tool for parts of text on the PC screen with Text to Speak functionality.

Train and use generative text models in a few lines of code.

Based on 125GB of data leaked from Twitch, you can see their monthly revenues from 2019-2021

Finds snippets in iambic pentameter in English-language text and tries to combine them to a rhyming sonnet.

Ελληνικά νέα (Python script) / Greek News Feed (Python script)

Model parallel transformers in JAX and Haiku

Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17

Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.

Stanford CoreNLP provides a set of natural language analysis tools written in Java

Course project of [email protected]

Club chatbot

Residual2Vec: Debiasing graph embedding using random graphs

Sequence-to-Sequence learning using PyTorch

Code for Emergent Translation in Multi-Agent Communication

This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Combating Embedding Barrier in Multilingual Models for Low-Resource Language Understanding".