HyperLib: Deep learning in the Hyperbolic space

Last update: Dec 25, 2022

Related tags

Overview

HyperLib: Deep learning in the Hyperbolic space

Background

This library implements common Neural Network components in the hypberbolic space (using the Poincare model). The implementation of this library uses Tensorflow as a backend and can easily be used with Keras and is meant to help Data Scientists, Machine Learning Engineers, Researchers and others to implement hyperbolic neural networks.

You can also use this library for uses other than neural networks by using the mathematical functions avaialbe in the Poincare class. In the future we may implement components that can be used in models other than neural networks. You can learn more about Hyperbolic networks here.

Example Usage

Install the library

pip install hyperlib

Creating a hyperbolic neural network using Keras:

import tensorflow as tf
from tensorflow import keras
from hyperlib.nn.layers.lin_hyp import LinearHyperbolic
from hyperlib.nn.optimizers.rsgd import RSGD
from hyperlib.manifold.poincare import Poincare

# Create layers
hyperbolic_layer_1 = LinearHyperbolic(32, Poincare(), 1)
hyperbolic_layer_2 = LinearHyperbolic(32, Poincare(), 1)
output_layer = LinearHyperbolic(10, Poincare(), 1)

# Create optimizer
optimizer = RSGD(learning_rate=0.1)

# Create model architecture
model = tf.keras.models.Sequential([
  hyperbolic_layer_1,
  hyperbolic_layer_2,
  output_layer
])

# Compile the model with the Riemannian optimizer            
model.compile(
    optimizer=optimizer,
    loss=tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True),
    metrics=[tf.keras.metrics.SparseCategoricalAccuracy()],
)

Using math functions on the Poincare ball:

import tensorflow as tf
from hyperlib.manifold.poincare import Poincare

p = Poincare()

# Create two matrices
a = tf.constant([[5.0,9.4,3.0],[2.0,5.2,8.9],[4.0,7.2,8.9]])
b = tf.constant([[4.8,1.0,2.3]])

# Matrix multiplication on the Poincare ball
curvature = 1
p.mobius_matvec(a, b, curvature)

TODO:

Implement an Attention Mechanism
Implement a Riemannian Adam Optimizer
Remove casting of layer variables to tf.float64

References

[1] Chami, I., Ying, R., Ré, C. and Leskovec, J. Hyperbolic Graph Convolutional Neural Networks. NIPS 2019.

[2] Nickel, M. and Kiela, D. Poincaré embeddings for learning hierarchical representations. NIPS 2017.

[3] Khrulkov, Mirvakhabova, Ustinova, Oseledets, Lempitsky. Hyperbolic Image Embeddings.

[4] Wei Peng, Varanka, Mostafa, Shi, Zhao. Hyperbolic Deep Neural Networks: A Survey.

Comments

Sarkar Embedding and CICD
Overview

This PR has two parts

Sarkar tree embedding for 2 and 3 dimensions

CICD testing and building

Sarkar Embedding

The two main functions are sarkar_embedding and sarkar_embedding_3D.
These are used to embed a (weighted) tree into the 2D or 3D Poincare ball[^1]. The 3D version uses a "fibonacci coding" for distributing points on a 2D sphere[^2]. [^2]: http://extremelearning.com.au/evenly-distributing-points-on-a-sphere/ [^1]: https://homepages.inf.ed.ac.uk/rsarkar/papers/HyperbolicDelaunayFull.pdf

Example usage:

from hyperlib.util.graph import binary_tree from hyperlib.util.sarkar import sarkar_embedding_3D T = binary_tree(4) # a depth 4 binary tree root = 0 # the index to use as the root tau = 0.4 # the scaling factor for edges emb = sarkar_embedding_3D(T, root, tau=tau, precision=40)

Note that both of these functions use mpmath for high precision calculations, and return a mpmath.matrix. I added high precision Poincare math functions in the utils package.

Sarkar's algorithm can be extended to higher dimensions but the "spherical coding" part is more difficult. Will address in the future.

CICD

I added two basic workflows. One runs a linter and tests on push and pull requests to main.
The other triggers when we tag a version on main. It builds wheels and uploads them to PyPI and Test PyPI. I added API tokens for nalex's PyPI account as GitHub secrets on this repo. The wheels are built for linux, windows, and mac on x86_64, amd64, i686 architectures using cibuildwheel. I tested on ubuntu and @sourface94 tested on windows.

To build locally you use

pip setup.py build

Note that you need a cpp compiler to build the treerep part. On linux you need gxx_linux-64 which can be install from conda install -c conda-forge gxx_linux-64.

When developing please run the tests locally first. To run the linter:

python -m pip install flake8 flake8 . --count --select=E9,F63,F7,F82 --show-source --statistics

and to run the tests just pytest -v tests/.

Addendum

I updated the embedding example in the readme and made a examples/ folder for future examples.
I also added install instructions.
opened by meiji163 1
TreeRep and hyperbolic functions
Added embedding package with TreeRep cxx source

Added setup.py with pybind11, tested on ARM MacOS Big Sur

fixed some Poincare functions and added functions
opened by meiji163 1
Sarkar Embedding
This PR adds Sarkar's algorithm for embedding a tree in 2 and 3 dimensional hyperbolic space, plus more high precision utility functions for the Poincare disc.

TODO:

Sarkar's algorithm for >3 dimensions. We have to implement a solution to the "spherical coding" problem in high dimensions.

unit tests
opened by meiji163 0
TreeRep
Added Treerep cxx source with pybind11 wrapper. To use simply call the function embedding.graph.treerep on the distance matrix. There are also functions to convert the resulting tree to a scipy.csgraph or a networkx graph.

Added functions for working with distance matrices and measuring delta-hyperbolicity under embedding.metric

Add utils.multiprecision for high precision calculations using mpmath (necessary for calculating large hyperbolic distances accurately)

Fixed a few Poincare class functions and added some

Made tests for treerep

Built and tested successfully on MacOS Big Sur and Manjaro Linux
opened by meiji163 0
optimizers & hyperbolic funcs
Moved hyperbolic functions to utils.math. When working with the library it was inconvenient having to access functions through the "Poincare" class. The benefit of having the class isn't clear to me.

Fixed some errors in the hyperbolic functions and added a few functions (hyp_dist, clipped_norm, parallel_transport, gyr, lambda_x)

Rewrote RSGD. Optimizers should inherit from keras optimizer_v2 and implement sparse and dense updates separately (see here). It should be possible to momentum next.

attempted RAdam, still buggy. The problem is parallel transport of momentum ( see Becigneul & Ganea pg. 5 )

Added Jupyter notebook in /examples to demo word embedding. Not sure where to put this but it could be nice to have more demos/tutorials in the future.
opened by meiji163 0

Hyperlib not using GPU

I am trying to train on google colab using the following code

from random import choice
import numpy as np
import pandas as pd
import tensorflow as tf
from tensorflow import keras

from hyperlib.manifold.lorentz import Lorentz
from hyperlib.manifold.poincare import Poincare
from hyperlib.models.pehr import HierarchicalEmbeddings


def load_wordnet_data(file, negatives=20):
    noun_closure = pd.read_csv(file)
    noun_closure_np = noun_closure[["id1","id2"]].values

    edges = set()
    for i, j in noun_closure_np:
        edges.add((i,j))

    unique_nouns = list(set(
        noun_closure["id1"].tolist()+noun_closure["id2"].tolist()
    ))

    noun_closure["neg_pairs"] = noun_closure["id1"].apply(get_neg_pairs, args=(edges, unique_nouns, 20,))
    return noun_closure, unique_nouns

def get_neg_pairs(noun, edges, unique_nouns, negatives=20):
    neg_list = []
    while len(neg_list) < negatives:
        neg_noun = choice(unique_nouns)
        if neg_noun != noun \
        and not neg_noun in neg_list \
        and not ((noun, neg_noun) in edges or (neg_noun, noun) in edges):
            neg_list.append(neg_noun)
    return neg_list


# Make training dataset
noun_closure, unique_nouns = load_wordnet_data("mammal_closure.csv", negatives=15)
noun_closure_dataset = noun_closure[["id1","id2"]].values

batch_size = 16
train_dataset = tf.data.Dataset.from_tensor_slices(
        (noun_closure_dataset, noun_closure["neg_pairs"].tolist()))
train_dataset = train_dataset.shuffle(buffer_size=1024).batch(batch_size)

# Create model
model = HierarchicalEmbeddings(vocab=unique_nouns, embedding_dim=10)
sgd = keras.optimizers.SGD(learning_rate=1e-2, momentum=0.9)

# Run custom training loop
model.fit(train_dataset, sgd, epochs=20)
embs = model.get_embeddings()

M = Poincare()
mammal = M.expmap0(model(tf.constant('dog.n.01')), c=1)
dists = M.dist(mammal, embs, c=1.0)
top = tf.math.top_k(-dists[:,0], k=20)
for i in top.indices:
    print(unique_nouns[i],': ',-dists[i,0].numpy())

I see that the GPU is not being used when I inspect the GPU usage. Kindly help

opened by rahulsee 0

model.fit does not show any output or progress

When running model.fit as per this code example I am unable to make out whether any progress is happening or the training is hung for me. Kindly help

This is the code I am trying to run

from random import choice
import numpy as np
import pandas as pd
import tensorflow as tf
from tensorflow import keras

from hyperlib.manifold.lorentz import Lorentz
from hyperlib.manifold.poincare import Poincare
from hyperlib.models.pehr import HierarchicalEmbeddings


def load_wordnet_data(file, negatives=20):
    noun_closure = pd.read_csv(file)
    noun_closure_np = noun_closure[["id1","id2"]].values

    edges = set()
    for i, j in noun_closure_np:
        edges.add((i,j))

    unique_nouns = list(set(
        noun_closure["id1"].tolist()+noun_closure["id2"].tolist()
    ))

    noun_closure["neg_pairs"] = noun_closure["id1"].apply(get_neg_pairs, args=(edges, unique_nouns, 20,))
    return noun_closure, unique_nouns

def get_neg_pairs(noun, edges, unique_nouns, negatives=20):
    neg_list = []
    while len(neg_list) < negatives:
        neg_noun = choice(unique_nouns)
        if neg_noun != noun \
        and not neg_noun in neg_list \
        and not ((noun, neg_noun) in edges or (neg_noun, noun) in edges):
            neg_list.append(neg_noun)
    return neg_list


# Make training dataset
noun_closure, unique_nouns = load_wordnet_data("mammal_closure.csv", negatives=15)
noun_closure_dataset = noun_closure[["id1","id2"]].values

batch_size = 16
train_dataset = tf.data.Dataset.from_tensor_slices(
        (noun_closure_dataset, noun_closure["neg_pairs"].tolist()))
train_dataset = train_dataset.shuffle(buffer_size=1024).batch(batch_size)

# Create model
model = HierarchicalEmbeddings(vocab=unique_nouns, embedding_dim=10)
sgd = keras.optimizers.SGD(learning_rate=1e-2, momentum=0.9)

# Run custom training loop
model.fit(train_dataset, sgd, epochs=20)
embs = model.get_embeddings()

M = Poincare()
mammal = M.expmap0(model(tf.constant('dog.n.01')), c=1)
dists = M.dist(mammal, embs, c=1.0)
top = tf.math.top_k(-dists[:,0], k=20)
for i in top.indices:
    print(unique_nouns[i],': ',-dists[i,0].numpy())

opened by rahulsee 1

RSGD Improvements

Right now optimizers.rsgd is implemented specifically for the Poincare model.
We should add an interface that works for any model.

TODO: write other improvements here

opened by meiji163 1
Solve Precision Issues
This is a tracking issue for solving precision issues.

Problem Overview

Precision is one of the major obstacles to the adoption of hyperbolic geometry in machine learning.
As shown in Representation Tradeoffs for Hyperbolic Embeddings, there is a tradeoff between precision and dimensionality when representing points in hyperbolic space with floats, independent of the model that is used.

Hyperlib should have a solution to this in its core components. Ideally the solution will satisfy the following.

reasonably efficient: it doesn't incur significant overhead compared to Euclidean methods and is GPU compatible

easy to use: it's abstracted away from the API so that a casual user doesn't have to touch it

general: it's general enough to be used with different models of hyperbolic space

Approaches

Hope for the best

We see many papers that simply accept the precision errors and try to mitigate them, or go to higher dimensions.
E.g. Our current approach in the Poincare model is to cast tf.float64, which only gets us 53 bits of precision.

Multiprecision

In the sarkar embeddings, we use a multi-precision library mpmath to represent points. As far as multiprecision arithmetic goes it is fast (assuming it is using the gmpy) backend. However the support for vector operations is not good and it cannot easily interoperate with numpy or tensorflow. Also we do not yet have a good method to automatically determine the precision setting (for example, in sarkar_embedding it uses far too much precision by default).

Avoiding the Problem

One common approach to avoid precision errors, especially in hyperbolic SGD, is to map from the (Euclidean) tangent space and do all operations there instead. We should definitely experiment with and support this method in Hyperlib. This will work for all models via the exponential map. However, it only solves part of the problem.

Multi-Component Float

Multi-Component Floats (MCF) are an alternate representation for floats that can be vectorized, proposed by Yu and De Sa as a way to do calculations in the upper half-space model. IMO this is the most promising approach if it can be extended to other models of hyperbolic space.

Todos

[ ] Spike: implementing MCF for upper-half space

tracking
opened by meiji163 0
Tracking Issue: Documentation
This is a tracking issue for improving documentation as hyperlib develops.
This will be important for getting more people to use hyperbolic ML (esp. examples)

[ ] Standardize function doc strings

[ ] Generate docs with sphinx

[ ] Put up documentation site

[ ] Examples (expand on point this later)

documentation tracking
opened by meiji163 0

Releases(v0.0.6)

v0.0.6(Jun 8, 2022)
New features:

Lorentz manifold implementation

New model module which currently contains one model which is an implementation of Poincare Embeddings for Learning Hierarchical Representations called HierarchicalEmbeddings

New loss function module which contains implementation of loss function which can be used for HierarchicalEmbeddings model (and other uses)

New WordNet embeddings example using HierarchicalEmbeddings model

Source code(tar.gz)
Source code(zip)
hyperlib-0.0.6-cp37-cp37m-macosx_10_9_x86_64.whl(108.23 KB)
hyperlib-0.0.6-cp37-cp37m-manylinux_2_17_i686.manylinux2014_i686.whl(153.65 KB)
hyperlib-0.0.6-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl(146.97 KB)
hyperlib-0.0.6-cp37-cp37m-musllinux_1_1_i686.whl(710.17 KB)
hyperlib-0.0.6-cp37-cp37m-musllinux_1_1_x86_64.whl(653.35 KB)
hyperlib-0.0.6-cp37-cp37m-win32.whl(89.68 KB)
hyperlib-0.0.6-cp37-cp37m-win_amd64.whl(100.20 KB)
hyperlib-0.0.6-cp38-cp38-macosx_10_9_x86_64.whl(109.02 KB)
hyperlib-0.0.6-cp38-cp38-manylinux_2_17_i686.manylinux2014_i686.whl(151.21 KB)
hyperlib-0.0.6-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl(144.79 KB)
hyperlib-0.0.6-cp38-cp38-musllinux_1_1_i686.whl(707.78 KB)
hyperlib-0.0.6-cp38-cp38-musllinux_1_1_x86_64.whl(650.22 KB)
hyperlib-0.0.6-cp38-cp38-win32.whl(88.63 KB)
hyperlib-0.0.6-cp38-cp38-win_amd64.whl(99.20 KB)
hyperlib-0.0.6-cp39-cp39-macosx_10_9_x86_64.whl(109.24 KB)
hyperlib-0.0.6-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl(151.57 KB)
hyperlib-0.0.6-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl(144.87 KB)
hyperlib-0.0.6-cp39-cp39-musllinux_1_1_i686.whl(708.33 KB)
hyperlib-0.0.6-cp39-cp39-musllinux_1_1_x86_64.whl(650.46 KB)
hyperlib-0.0.6-cp39-cp39-win32.whl(88.77 KB)
hyperlib-0.0.6-cp39-cp39-win_amd64.whl(98.44 KB)
v0.0.4(Mar 7, 2022)

What's Changed

Full Changelog: https://github.com/nalexai/hyperlib/compare/v0.0.3...v0.0.4
Source code(tar.gz)
Source code(zip)
hyperlib-0.0.4-cp37-cp37m-macosx_10_9_x86_64.whl(104.18 KB)
hyperlib-0.0.4-cp37-cp37m-manylinux_2_17_i686.manylinux2014_i686.whl(149.24 KB)
hyperlib-0.0.4-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl(142.54 KB)
hyperlib-0.0.4-cp37-cp37m-musllinux_1_1_i686.whl(705.96 KB)
hyperlib-0.0.4-cp37-cp37m-musllinux_1_1_x86_64.whl(648.96 KB)
hyperlib-0.0.4-cp37-cp37m-win32.whl(85.58 KB)
hyperlib-0.0.4-cp37-cp37m-win_amd64.whl(96.01 KB)
hyperlib-0.0.4-cp38-cp38-macosx_10_9_x86_64.whl(104.96 KB)
hyperlib-0.0.4-cp38-cp38-manylinux_2_17_i686.manylinux2014_i686.whl(146.86 KB)
hyperlib-0.0.4-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl(140.52 KB)
hyperlib-0.0.4-cp38-cp38-musllinux_1_1_i686.whl(703.48 KB)
hyperlib-0.0.4-cp38-cp38-musllinux_1_1_x86_64.whl(645.95 KB)
hyperlib-0.0.4-cp38-cp38-win32.whl(84.56 KB)
hyperlib-0.0.4-cp38-cp38-win_amd64.whl(95.10 KB)
hyperlib-0.0.4-cp39-cp39-macosx_10_9_x86_64.whl(105.16 KB)
hyperlib-0.0.4-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl(147.04 KB)
hyperlib-0.0.4-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl(140.55 KB)
hyperlib-0.0.4-cp39-cp39-musllinux_1_1_i686.whl(704.08 KB)
hyperlib-0.0.4-cp39-cp39-musllinux_1_1_x86_64.whl(646.19 KB)
hyperlib-0.0.4-cp39-cp39-win32.whl(84.67 KB)
hyperlib-0.0.4-cp39-cp39-win_amd64.whl(94.28 KB)
v0.0.3(May 28, 2021)

Source code(tar.gz)
Source code(zip)
hyperlib-0.0.3-py3-none-any.whl(8.83 KB)
hyperlib-0.0.3.tar.gz(8.08 KB)

Owner

GitHub Repository

YOLOv5 + ROS2 object detection package

YOLOv5-ROS YOLOv5 + ROS2 object detection package This program changes the input of detect.py (ultralytics/yolov5) to sensor_msgs/Image of ROS2. Requi

23 Dec 19, 2022

Code for the paper: Fighting Fake News: Image Splice Detection via Learned Self-Consistency

Fighting Fake News: Image Splice Detection via Learned Self-Consistency [paper] [website] Minyoung Huh *12, Andrew Liu *1, Andrew Owens1, Alexei A. Ef

174 Dec 09, 2022

Official implementation of particle-based models (GNS and DPI-Net) on the Physion dataset.

Physion: Evaluating Physical Prediction from Vision in Humans and Machines [paper] Daniel M. Bear, Elias Wang, Damian Mrowca, Felix J. Binder, Hsiao-Y

18 Dec 19, 2022

BlueFog Tutorials

BlueFog Tutorials Welcome to the BlueFog tutorials! In this repository, we've put together a collection of awesome Jupyter notebooks. These notebooks

4 Oct 27, 2021

SplineConv implementation for Paddle.

SplineConv implementation for Paddle This module implements the SplineConv operators from Matthias Fey, Jan Eric Lenssen, Frank Weichert, Heinrich Mül

3 Dec 29, 2021

OREO: Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning (NeurIPS 2021)

OREO: Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning (NeurIPS 2021) Video demo We here provide a video demo from co

20 Nov 25, 2022

Official code of CVPR 2021's PLOP: Learning without Forgetting for Continual Semantic Segmentation

PLOP: Learning without Forgetting for Continual Semantic Segmentation This repository contains all of our code. It is a modified version of Cermelli e

116 Dec 14, 2022

⚖️🔁🔮🕵️‍♂️🦹🖼️ Code for Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances paper.

Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances This repository contains the code for Measuring the Co

0 Nov 06, 2022

HyperLib: Deep learning in the Hyperbolic space

Related tags

Overview

HyperLib: Deep learning in the Hyperbolic space

Background

Example Usage

TODO:

References

Comments

Overview

Sarkar Embedding

CICD

Addendum

Problem Overview

Approaches

Hope for the best

Multiprecision

Avoiding the Problem

Multi-Component Float

Todos

Releases(v0.0.6)

v0.0.6(Jun 8, 2022)

v0.0.4(Mar 7, 2022)

What's Changed

v0.0.3(May 28, 2021)

Owner

YOLOv5 + ROS2 object detection package

Code for the paper: Fighting Fake News: Image Splice Detection via Learned Self-Consistency

Official implementation of particle-based models (GNS and DPI-Net) on the Physion dataset.

BlueFog Tutorials

SplineConv implementation for Paddle.

OREO: Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning (NeurIPS 2021)

Official code of CVPR 2021's PLOP: Learning without Forgetting for Continual Semantic Segmentation

⚖️🔁🔮🕵️‍♂️🦹🖼️ Code for *Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances* paper.

QilingLab challenge writeup

Understanding Convolution for Semantic Segmentation

Just-Now - This Is Just Now Login Friendlist Cloner Tools

Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services

This is a repository for a Semantic Segmentation inference API using the Gluoncv CV toolkit

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Simulation of self-focusing of laser beams in condensed media

This is an early in-development version of training CLIP models with hivemind.

Uncertainty Estimation via Response Scaling for Pseudo-mask Noise Mitigation in Weakly-supervised Semantic Segmentation

Code for the paper "There is no Double-Descent in Random Forests"

Steerable discovery of neural audio effects

ClevrTex: A Texture-Rich Benchmark for Unsupervised Multi-Object Segmentation

⚖️🔁🔮🕵️‍♂️🦹🖼️ Code for Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances paper.