A combination of autoregressors and autoencoders using XLNet for sentiment analysis

Last update: Nov 20, 2021

Overview

A combination of autoregressors and autoencoders using XLNet for sentiment analysis

Abstract

In this paper sentiment analysis has been performed in order to evaluate the performance of XLNet on this particular task. XLNet is rather a ground-breaking network on language understanding which uses the perks of both autoregressive models and autoencoders. While BERT uses autoencoders and Transformers use autoregression, XLNet combines the aforementioned networks’ attributes in order to achieve higher performance in many NLP tasks, such as sentiment analysis, question answering, reading comprehension, natural language understanding etc. In this work we evaluate the XLNet model in several sentiment classification tasks in terms of accuracy and efficiency. The XLNet reaches state of the art results and outperforms BERT which is the previous state of the art model on natural language processing.

This was an assignment for the course of Deep learning in PhD program of National Technical Unicersity of Athens

Team composed of 3 persons
Runs has been made on HPC-ARIS through batch scripts
Course grade 10/10 (excellent)
Full report formatted as a paper in here
Code for 2 sentiment analysis tasks out of 3 (implemented by the author of this repo) in here
Data available here

A combination of autoregressors and autoencoders using XLNet for sentiment analysis

Related tags

Overview

A combination of autoregressors and autoencoders using XLNet for sentiment analysis

Abstract

This was an assignment for the course of Deep learning in PhD program of National Technical Unicersity of Athens

Owner

James Zaridis

Words_And_Phrases - Just a repo for useful words and phrases that might come handy in some scenarios. Feel free to add yours

Natural Language Processing for Adverse Drug Reaction (ADR) Detection

Count the frequency of letters or words in a text file and show a graph.

Header-only C++ HNSW implementation with python bindings

Watson Natural Language Understanding and Knowledge Studio

The FinQA dataset from paper: FinQA: A Dataset of Numerical Reasoning over Financial Data

Sorce code and datasets for "K-BERT: Enabling Language Representation with Knowledge Graph",

Source code of the "Graph-Bert: Only Attention is Needed for Learning Graph Representations" paper

Sentiment Classification using WSD, Maximum Entropy & Naive Bayes Classifiers

API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend

Simple, hackable offline speech to text - using the VOSK-API.

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

Training RNNs as Fast as CNNs

Reformer, the efficient Transformer, in Pytorch

Higher quality textures for the Metal Gear Solid series.

Datasets of Automatic Keyphrase Extraction

Document processing using transformers

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

DLO8012: Natural Language Processing & CSL804: Computational Lab - II