This tutorial repository is to introduce the functionality of KGTK to first-time users

Last update: Dec 21, 2022

Related tags

Overview

Welcome to the KGTK notebook tutorial

The goal of this tutorial repository is to introduce the functionality of KGTK to first-time users. The Knowledge Graph Toolkit (KGTK) is a comprehensive framework for the creation and exploitation of large hyper-relational knowledge graphs (KGs), designed for ease of use, scalability, and speed. The tutorial consists of several notebooks that demonstrate how to perform network analysis, graph profiling, knowledge enrichment, and embedding computation over a portion of the Wikidata knowledge graph. The tutorial notebooks can be found in the tutorial folder. All notebooks require minimum configuration and can be run locally or in Google Colab in a matter of a few minutes. The input data for the notebooks is stored in the datasets folder. Basic understanding of knowledge graphs is sufficient for this tutorial.

This repository has been created for the purpose of the KGTK tutorial presented at ISWC 2021. For more information on this tutorial, see our website.

Notebooks

01-kgtk-introduction.ipynb introduction to kgtk and kypher.
02-kg-profiling.ipynb performs profiling of a Wikidata subgraph, by computing deep statistics of its classes, instances, and properties.
03-kg-graph-embeddings.ipynb computes graph embeddings of a Wikidata subgraph using kgtk, demonstrates how to use these embeddings for similarity estimation, and visualizes them.
04-kg-enrichment-with-csv.ipynb shows how structured data from IMDb can be integrated into a subset of Wikidata.
05-kg-enrichment-with-lod.ipynb shows how LOD graphs like Getty Vocabulary can be used to enrich Wikidata by using kgtk operations.
06-kg-network-analysis.ipynb analyzes the family network of Arnold Schwarzenegger (Q2685) in Wikidata by using KGTK operations.
07-kg-constraint-validation.ipynb demonstrates how to do constraint validation on one wikidata property.

Running the notebooks in Google Colab

List of steps required to be able to run the ISI Google colab Notebooks.

Make a copy of the notebooks to your Google Drive.

The following tutorial notebooks are available to run in Google Colab

Click on a link, it'll take you to the Google Colab notebook. These are readonly notebook links.

Click on Save a copy in Drive from the File menu as shown.

This will create a copy of the notebook in your Google Drive.

Install `kgtk`

Run the first cell to install kgtk.

If you see this warning,

click on Run anyway to continue

You'll see an error after the install finishes,

This is because of a conflict in Google Colab's python environment. You have to click on the Restart Runtime button.

You do not have to install kgtk again.

In some notebooks, there are a few more installation cells, in case you see the same error as above, please click on Restart Runtime

Run the cells in the notebook

Now, simply run all the cells. The notebook should run successfully.

Google Colab Caveats

The colab VM and python environment is ephemeral. The VM will reset after a while, all the installed libraries and files produced will be lost.
Google Colab File IO. Download / Upload files to Google Colab
You can connect a google drive to the colab notebook to read from and save to.
Users can run the same colab notebook by sharing it with a link. This can have unwanted complications in case multiple people run the same cell at the same time.

Contact

Amandeep Singh ([email protected])
Pedro Szekely ([email protected])
Filip Ilievski ([email protected])

This tutorial repository is to introduce the functionality of KGTK to first-time users

Related tags

Overview

Welcome to the KGTK notebook tutorial

Notebooks

Running the notebooks in Google Colab

Make a copy of the notebooks to your Google Drive.

Install `kgtk`

Run the cells in the notebook

Google Colab Caveats

Contact

Owner

USC ISI I2

Implementation of the federated dual coordinate descent (FedDCD) method.

An implementation of Geoffrey Hinton's paper "How to represent part-whole hierarchies in a neural network" in Pytorch.

Tiny-NewsRec: Efﬁcient and Effective PLM-based News Recommendation

Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

Newt - a Gaussian process library in JAX.

Lorien: A Unified Infrastructure for Efficient Deep Learning Workloads Delivery

Automatic detection and classification of Covid severity degree in LUS (lung ultrasound) scans

Simple node deletion tool for onnx.

This is a Image aid classification software based on python TK library development

Dynamic Environments with Deformable Objects (DEDO)

Partial implementation of ODE-GAN technique from the paper Training Generative Adversarial Networks by Solving Ordinary Differential Equations

Stratified Transformer for 3D Point Cloud Segmentation (CVPR 2022)

A PyTorch Lightning solution to training OpenAI's CLIP from scratch.

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Virtual Dance Reality Stage: a feature that offers you to share a stage with another user virtually

A PyTorch implementation of "Signed Graph Convolutional Network" (ICDM 2018).

[ACM MM 2021] Yes, "Attention is All You Need", for Exemplar based Colorization

Official implementation for paper: A Latent Transformer for Disentangled Face Editing in Images and Videos.

Bounding Wasserstein distance with couplings

Self-supervised Product Quantization for Deep Unsupervised Image Retrieval - ICCV2021

This tutorial repository is to introduce the functionality of KGTK to first-time users

Related tags

Overview

Welcome to the KGTK notebook tutorial

Notebooks

Running the notebooks in Google Colab

Make a copy of the notebooks to your Google Drive.

Install kgtk

Run the cells in the notebook

Google Colab Caveats

Contact

Owner

USC ISI I2

Implementation of the federated dual coordinate descent (FedDCD) method.

An implementation of Geoffrey Hinton's paper "How to represent part-whole hierarchies in a neural network" in Pytorch.

Tiny-NewsRec: Efﬁcient and Effective PLM-based News Recommendation

Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

Newt - a Gaussian process library in JAX.

Lorien: A Unified Infrastructure for Efficient Deep Learning Workloads Delivery

Automatic detection and classification of Covid severity degree in LUS (lung ultrasound) scans

Simple node deletion tool for onnx.

This is a Image aid classification software based on python TK library development

Dynamic Environments with Deformable Objects (DEDO)

Partial implementation of ODE-GAN technique from the paper Training Generative Adversarial Networks by Solving Ordinary Differential Equations

Stratified Transformer for 3D Point Cloud Segmentation (CVPR 2022)

A PyTorch Lightning solution to training OpenAI's CLIP from scratch.

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Virtual Dance Reality Stage: a feature that offers you to share a stage with another user virtually

A PyTorch implementation of "Signed Graph Convolutional Network" (ICDM 2018).

[ACM MM 2021] Yes, "Attention is All You Need", for Exemplar based Colorization

Official implementation for paper: A Latent Transformer for Disentangled Face Editing in Images and Videos.

Bounding Wasserstein distance with couplings

Self-supervised Product Quantization for Deep Unsupervised Image Retrieval - ICCV2021

Install `kgtk`