SAINT PyTorch implementation

Last update: Dec 25, 2022

Overview

SAINT-pytorch

A Simple pyTorch implementation of "Towards an Appropriate Query, Key, and Value Computation for Knowledge Tracing" based on https://arxiv.org/abs/2002.07033.

SAINT: Separated Self-AttentIve Neural Knowledge Tracing. SAINT has an encoder-decoder structure where exercise and response embedding sequence separately enter the encoder and the decoder respectively, which allows to stack attention layers multiple times.

SAINT model architecture

Usage

import torch
import torch.nn as nn
import torch.nn.functional as F
import numpy as np
import copy

from saint import saint, random_data

seq_len = 100
total_ex = 1200
total_cat = 234
total_in = 2

in_ex, in_cat, in_de = random_data(64, 
                                seq_len , 
                                total_ex, 
                                total_cat, 
                                total_in)


model = saint(dim_model=128,
            num_en=6,
            num_de=6,
            heads_en=8,
            heads_de=8,
            total_ex=total_ex,
            total_cat=total_cat,
            total_in=total_in )

outs = model(in_ex, in_cat, in_de)

print(outs.shape)
# torch.Size([64, 100, 1])

Parameters

dim_model: int.
Dimension of model ( embeddings, attention, linear layers).
num_en: int.
Number of encoder layers.
num_de: int.
Number of decoder layers.
heads_en: int.
Number of heads in multi-head attention block in each layer of encoder.
heads_de: int.
Number of heads in multi-head attention block in each layer of decoder.
total_ex: int.
Total number of unique excercise.
total_cat: int.
Total number of unique concept categories.
total_in: int.
Total number of unique interactions.

todo

change positional embedding to sine.

Citations

@article{choi2020towards,
  title={Towards an Appropriate Query, Key, and Value Computation for Knowledge Tracing},
  author={Choi, Youngduck and Lee, Youngnam and Cho, Junghyun and Baek, Jineon and Kim, Byungsoo and Cha, Yeongmin and Shin, Dongmin and Bae, Chan and Heo, Jaewe},
  journal={arXiv preprint arXiv:2002.07033},
  year={2020}
}

@misc{vaswani2017attention,
    title   = {Attention Is All You Need},
    author  = {Ashish Vaswani and Noam Shazeer and Niki Parmar and Jakob Uszkoreit and Llion Jones and Aidan N. Gomez and Lukasz Kaiser and Illia Polosukhin},
    year    = {2017},
    eprint  = {1706.03762},
    archivePrefix = {arXiv},
    primaryClass = {cs.CL}
}

SAINT PyTorch implementation

Related tags

Overview

SAINT-pytorch

SAINT model architecture

Usage

Parameters

todo

Citations

Owner

Arshad Shaikh

Code for using and evaluating SpanBERT.

A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python

Fuzzy String Matching in Python

Natural language processing summarizer using 3 state of the art Transformer models: BERT, GPT2, and T5

Blender addon - Scrub timeline from viewport with a shortcut

Subtitle Workshop (subshop): tools to download and synchronize subtitles

A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.

Official PyTorch implementation of Time-aware Large Kernel (TaLK) Convolutions (ICML 2020)

Rank-One Model Editing for Locating and Editing Factual Knowledge in GPT

Named Entity Recognition API used by TEI Publisher

Code for EMNLP20 paper: "ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training"

Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech tagging and word segmentation.

Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

A flask application to predict the speech emotion of any .wav file.

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

Text Classification in Turkish Texts with Bert

A combination of autoregressors and autoencoders using XLNet for sentiment analysis

Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.

Include MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.