A Structured Self-attentive Sentence Embedding

Last update: Nov 28, 2022

Overview

Structured Self-attentive sentence embeddings

Implementation for the paper A Structured Self-Attentive Sentence Embedding, which was published in ICLR 2017: https://arxiv.org/abs/1703.03130 .

USAGE:

For binary sentiment classification on imdb dataset run : python classification.py "binary"

For multiclass classification on reuters dataset run : python classification.py "multiclass"

You can change the model parameters in the model_params.json file Other tranining parameters like number of attention hops etc can be configured in the config.json file.

If you want to use pretrained glove embeddings , set the use_embeddings parameter to "True" ,default is set to False. Do not forget to download the glove.6B.50d.txt and place it in the glove folder.

Implemented:

Classification using self attention
Regularization using Frobenius norm
Gradient clipping
Visualizing the attention weights

Instead of pruning ,used averaging over the sentence embeddings.

Visualization:

After training, the model is tested on 100 test points. Attention weights for the 100 test data are retrieved and used to visualize over the text using heatmaps. A file visualization.html gets saved in the visualization/ folder after successful training. The visualization code was provided by Zhouhan Lin (@hantek). Many thanks.

Below is a shot of the visualization on few datapoints.

Training accuracy 93.4% Tested on 1000 points with 90.2% accuracy

A Structured Self-attentive Sentence Embedding

Related tags

Overview

Structured Self-attentive sentence embeddings

USAGE:

Implemented:

Visualization:

Owner

Kaushal Shetty

Tools and data for measuring the popularity & growth of various programming languages.

Package for controllable summarization

Simple program that translates the name of files into English

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

Espial is an engine for automated organization and discovery of personal knowledge

使用Mask LM预训练任务来预训练Bert模型。训练垂直领域语料的模型表征，提升下游任务的表现。

Chinese segmentation library

Text editor on python tkinter to convert english text to other languages with the help of ployglot.

CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training

vits chinese, tts chinese, tts mandarin

Test finetuning of XLSR (multilingual wav2vec 2.0) for other speech classification tasks

Machine translation models released by the Gourmet project

This is a general repo that helps you develop fast/effective NLP classifiers using Huggingface

Transformers and related deep network architectures are summarized and implemented here.

InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

Header-only C++ HNSW implementation with python bindings

EasyTransfer is designed to make the development of transfer learning in NLP applications easier.

Knowledge Graph,Question Answering System，基于知识图谱和向量检索的医疗诊断问答系统

This repository contains (not all) code from my project on Named Entity Recognition in philosophical text