Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"

Last update: Nov 21, 2022

Related tags

Overview

ON-LSTM

This repository contains the code used for word-level language model and unsupervised parsing experiments in Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks paper, originally forked from the LSTM and QRNN Language Model Toolkit for PyTorch. If you use this code or our results in your research, we'd appreciate if you cite our paper as following:

@article{shen2018ordered,
  title={Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks},
  author={Shen, Yikang and Tan, Shawn and Sordoni, Alessandro and Courville, Aaron},
  journal={arXiv preprint arXiv:1810.09536},
  year={2018}
}

Software Requirements

Python 3.6, NLTK and PyTorch 0.4 are required for the current codebase.

Steps

Install PyTorch 0.4 and NLTK
Download PTB data. Note that the two tasks, i.e., language modeling and unsupervised parsing share the same model strucutre but require different formats of the PTB data. For language modeling we need the standard 10,000 word Penn Treebank corpus data, and for parsing we need Penn Treebank Parsed data.
Scripts and commands
- Train Language Modeling python main.py --batch_size 20 --dropout 0.45 --dropouth 0.3 --dropouti 0.5 --wdrop 0.45 --chunk_size 10 --seed 141 --epoch 1000 --data /path/to/your/data
- Test Unsupervised Parsing python test_phrase_grammar.py --cuda
The default setting in main.py achieves a perplexity of approximately 56.17 on PTB test set and unlabeled F1 of approximately 47.7 on WSJ test set.

Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"

Related tags

Overview

ON-LSTM

Software Requirements

Steps

Owner

Yikang Shen

A Tensorfflow implementation of Attend, Infer, Repeat

Einshape: DSL-based reshaping library for JAX and other frameworks.

Eye-Blink-Counter - Python based Computer Vision project which counts how many time a person blinks

A collection of differentiable SVD methods and also the official implementation of the ICCV21 paper "Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?"

IA for recognising Traffic Signs using Keras [Tensorflow]

Research on Tabular Deep Learning (Python package & papers)

The source code for 'Noisy-Labeled NER with Confidence Estimation' accepted by NAACL 2021

Earthquake detection via fiber optic cables using deep learning

Virtual hand gesture mouse using a webcam

Boosting Adversarial Attacks with Enhanced Momentum (BMVC 2021)

Source code and notebooks to reproduce experiments and benchmarks on Bias Faces in the Wild (BFW).

(CVPR 2022) A minimalistic mapless end-to-end stack for joint perception, prediction, planning and control for self driving.

Continuous Security Group Rule Change Detection & Response at scale

Bayesian dessert for Lasagne

PyTorch implementation of some learning rate schedulers for deep learning researcher.

Lava-DL, but with PyTorch-Lightning flavour

Codebase for the self-supervised goal reaching benchmark introduced in the LEXA paper

Code and data for the paper "Hearing What You Cannot See"

Open source repository for the code accompanying the paper 'Non-Rigid Neural Radiance Fields Reconstruction and Novel View Synthesis of a Deforming Scene from Monocular Video'.

Prml - Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop