Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Last update: Oct 18, 2022

Related tags

Deep Learning GCS_KI

Overview

Graph Convolution Simulator (GCS)

Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Requirements:

PyTorch and DGL should be installed based on your system. For other libraries, you can install them using the following command:

$ pip install -r requirements.txt

Run Knowledge Integration Interpretation (KI) by GCS on example data:

$ bash run_example.sh

Interpretation results are saved in ./example/example_data/gcs.edgelist.

If the knowledge graph is small, users can visualize it by ./example/example_data/results.pdf. Here is the results for the example data:

Run Knowledge Intergration Interpretation by GCS for your own model

Step 1: Prepare the entity embedding of vanilla LM and knowledge-enhanced LM:

Store them as PyTorch tensor (.pt) format. Make sure they have the same number of rows, and the indexes of entities are the same. The default files are emb_roberta.pt and emb_kadapter.pt.

Step 2: Prepare the knowledge graph:

Three files are needed to load the knowledge graph:

a) qid2idx.json: The index dictionary. The key is entity Q-label, and value is the index of entity in entity embedding
b) qid2label.json : The label dictionary. The key is entity Q-label, and the value is the entity label text. Note that this dictionary is only for visualization, you can set it as {Q-label: Q-label} if you don't have the text.
c) kg.edgelist: The knowledge triple to construct knowledge graph. Each row is for one triple as: entity1_idx \t entity2_idx \t {}.

Step 3: Run GCS for KI interpretation:

After two preparation steps, you can run GCS by:

$ python src/example.py  --emb_vlm emb_roberta.pt  -emb_klm emb_kadapter.pt  --data_dir ./example_data  --lr 1e-3  --loss mi_loss

As for the hyperparameters, users may check them in ./example/src/example.py. Note that for large knowledge graphs, we recommend to use mutual information loss (mi_loss), and please do not visualize the results for large knowledge graphs.

Step 4: Analyze GCS interpretation results:

The interpretation results are saved in ./example/example_data/gcs.edgelist. Each row is for one triple as: entity1_idx \t entity2_idx \t {'a': xxxx}. Here, the value of 'a' is the attention coefficient value on the triple/entity (entity1, r, entity2). Users may use them to analyze the factual knowledge learned during knowledge integration.

Reproduce the results in the paper

Please enter ./all_exp folder for more details

Cite

If you use the code, please cite the paper:

@article{hou2022understanding,
  title={Understanding Knowledge Integration in Language Models with Graph Convolutions},
  author={Hou, Yifan and Fu, Guoji and Sachan, Mrinmaya},
  journal={arXiv preprint arXiv:2202.00964},
  year={2022}
}

Contact

Feel free to open an issue or send me ([email protected]) an email if you have any questions!

Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Related tags

Overview

Graph Convolution Simulator (GCS)

Requirements:

Run Knowledge Integration Interpretation (KI) by GCS on example data:

Run Knowledge Intergration Interpretation by GCS for your own model

Step 1: Prepare the entity embedding of vanilla LM and knowledge-enhanced LM:

Step 2: Prepare the knowledge graph:

Step 3: Run GCS for KI interpretation:

Step 4: Analyze GCS interpretation results:

Reproduce the results in the paper

Cite

Contact

Owner

yifan

A toolkit for controlling Euro Truck Simulator 2 with python to develop self-driving algorithms.

A large-scale video dataset for the training and evaluation of 3D human pose estimation models

A Keras implementation of YOLOv4 (Tensorflow backend)

2021-MICCAI-Progressively Normalized Self-Attention Network for Video Polyp Segmentation

LTR_CrossEncoder: Legal Text Retrieval Zalo AI Challenge 2021

[NeurIPS 2021] "Delayed Propagation Transformer: A Universal Computation Engine towards Practical Control in Cyber-Physical Systems"

A novel pipeline framework for multi-hop complex KGQA task. About the paper title: Improving Multi-hop Embedded Knowledge Graph Question Answering by Introducing Relational Chain Reasoning

An end-to-end machine learning web app to predict rugby scores (Pandas, SQLite, Keras, Flask, Docker)

This repository contains the code for our fast polygonal building extraction from overhead images pipeline.

RANZCR-CLiP 7th Place Solution

PyTorch implementation of the wavelet analysis from Torrence & Compo

A highly efficient and modular implementation of Gaussian Processes in PyTorch

Minecraft agent to farm resources using reinforcement learning

Unsupervised Pre-training for Person Re-identification (LUPerson)

PyTorch implementation of SIFT descriptor

curl-impersonate: A special compilation of curl that makes it impersonate Chrome & Firefox

This is an official repository of CLGo: Learning to Predict 3D Lane Shape and Camera Pose from a Single Image via Geometry Constraints

Planning from Pixels in Environments with Combinatorially Hard Search Spaces -- NeurIPS 2021

Towards Implicit Text-Guided 3D Shape Generation (CVPR2022)

Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"