EGNN - Implementation of E(n)-Equivariant Graph Neural Networks, in Pytorch

Last update: Jan 04, 2023

Overview

EGNN - Pytorch

Implementation of E(n)-Equivariant Graph Neural Networks, in Pytorch. May be eventually used for Alphafold2 replication. This technique went for simple invariant features, and ended up beating all previous methods (including SE3 Transformer and Lie Conv) in both accuracy and performance. SOTA in dynamical system models, molecular activity prediction tasks, etc.

Install

$ pip install egnn-pytorch

Usage

import torch
from egnn_pytorch import EGNN

layer1 = EGNN(dim = 512)
layer2 = EGNN(dim = 512)

feats = torch.randn(1, 16, 512)
coors = torch.randn(1, 16, 3)

feats, coors = layer1(feats, coors)
feats, coors = layer2(feats, coors) # (1, 16, 512), (1, 16, 3)

With edges

import torch
from egnn_pytorch import EGNN

layer1 = EGNN(dim = 512, edge_dim = 4)
layer2 = EGNN(dim = 512, edge_dim = 4)

feats = torch.randn(1, 16, 512)
coors = torch.randn(1, 16, 3)
edges = torch.randn(1, 16, 16, 4)

feats, coors = layer1(feats, coors, edges)
feats, coors = layer2(feats, coors, edges) # (1, 16, 512), (1, 16, 3)

Citations

@misc{satorras2021en,
    title 	= {E(n) Equivariant Graph Neural Networks}, 
    author 	= {Victor Garcia Satorras and Emiel Hoogeboom and Max Welling},
    year 	= {2021},
    eprint 	= {2102.09844},
    archivePrefix = {arXiv},
    primaryClass = {cs.LG}
}

Comments

training batch size

Dear authors,

thanks for your great work! I saw your example, which is easy to understand. But I notice that during training, in each iteration, it seems it supports the case where batch-size > 1, but all the graphs have the same adj_mat. do you have better solution for that? thanks

opened by futianfan 6
Import Error when torch_geometric is not available

https://github.com/lucidrains/egnn-pytorch/blob/e35510e1be94ee9f540bf2ffea49cd63578fe473/egnn_pytorch/egnn_pytorch.py#L413

A small problem, this Tensor is not defined.

Thanks for your work.

opened by zrt 4
About aggregations in EGNN_sparse

Hi, thanks for your great work!

I have a question on how aggregations are computed for node embedding and coordinate embedding. In the paper, the aggregation for node embedding is computed over its neighbors, while the aggregation for coordinate embedding is computed over is computed over all others. However, in EGNN_sparse, I didn't notice such difference in aggregations.

I guess it is because computing all-pair messages for coordinate embedding makes 'sparse' meaningless, but I would like to double-check to see if I get this correctly. So anyway, did you do this intentionally? Or did I miss something?

My appreciation.

opened by simon1727 4
Few queries on the implementation

Hi - fast work coding these things up, as usual! Looking at the paper and your code, you're not using squared distance for the edge weighting. Is that intentional? Also, it looks like you are adding the old feature vectors to the new ones rather than taking the new vectors directly from the fully connected net - is that also an intentional change from the paper?

opened by denjots 3
Fix PyG problems. add exmaple for point cloud denoising
Fixed some tiny errors in data flows for the PyG layers (dimensions and slices mainly)

fixed the EGNN_Sparse_Network so now it works

provides example for point cloud denoising (from gaussian masked coordinates), and showcases potential issues:

unstable (could be due to nature of data, not sure, but gvp does well on it)

not able to beat baseline (in contrast, gvp gets to 0.8 RMSD while this gets to the baseline 1 RMSD but not below it)
opened by hypnopump 2
EGNN_sparse incorrect positional encoding output
Hi, many thanks for the implementation!

I was quickly checking the code for the pytorch geometric implementation of the EGNN_sparse layer, and I noticed that it expects the first 3 columns in the features to be the coordinates. However, in the update method, features and coordinates are passed in the wrong order.

https://github.com/lucidrains/egnn-pytorch/blob/375d686c749a685886874baba8c9e0752db5f5be/egnn_pytorch/egnn_pytorch.py#L192

This may cause problems during learning (think of concatenating several of these layers), as they expect coordinate and feature order to be consistent.

One can reproduce this behaviour in the following snippet:

layer = EGNN_sparse(feats_dim=1, pos_dim=3, m_dim=16, fourier_features=0) R = rot(*torch.rand(3)) T = torch.randn(1, 1, 3) feats = torch.randn(16, 1) coors = torch.randn(16, 3) x1 = torch.cat([coors, feats], dim=-1) x2 = torch.cat([(coors @ R + T).squeeze() , feats], dim=-1) edge_idxs = (torch.rand(2, 20) * 16).long() out1 = layer(x=x1, edge_index=edge_idxs) out2 = layer(x=x2, edge_index=edge_idxs)

After fixing the order of these arguments in the update method then the layer behaves as expected (output features are equivariant, and coordinate features are equivariant upon se(3) transformation)
opened by josejimenezluna 2

Nan Values after stacking multiple layers

Hi Lucid!!

I find that when stacking multiple layers the output from the model rapidly goes to Nan. I suspect it may be related to the weights used for initialization.

Here is a minimal working example:

Make some data:

    import numpy as np
    import torch
    from egnn_pytorch import EGNN
    
    torch.set_default_dtype(torch.double)

    zline = np.arange(0, 2, 0.05)
    xline = np.sin(zline * 2 * np.pi) 
    yline = np.cos(zline * 2 * np.pi)
    points = np.array([xline, yline, zline])
    geom = torch.tensor(points.transpose())[None,:]
    feat = torch.randint(0, 20, (1, geom.shape[1],1))

Make a model:

    class ResEGNN(torch.nn.Module):
        def __init__(self, depth = 2, dims_in = 1):
            super().__init__()
            self.layers = torch.nn.ModuleList([EGNN(dim = dims_in) for i in range(depth)])
        
        def forward(self, geom, feat):
            for layer in self.layers:
                feat, geom = layer(feat, geom)
            return geom

Run model for varying depths:

    for i in range(10):
        model = ResEGNN(depth = i)
        pred = model(geom, feat)
        mean_absolute_value  = torch.abs(pred).mean()
        print("Order of predictions {:.2f}".format(np.log(mean_absolute_value.detach().numpy())))

Output : Order of predictions -0.29 Order of predictions 0.05 Order of predictions 6.65 Order of predictions 21.38 Order of predictions 78.25 Order of predictions 302.71 Order of predictions 277.38 Order of predictions nan Order of predictions nan Order of predictions nan

opened by brennanaba 2

Edge features thrown out

Hi, thanks for this implementation!

I was wondering if the pytorch-geometric implementation of this architecture is throwing the edge features out by mistake, as seen here

https://github.com/lucidrains/egnn-pytorch/blob/1b8320ade1a89748e4042ae448626652f1c659a1/egnn_pytorch/egnn_pytorch.py#L148-L151

Or maybe my understanding is wrong? Cheers,

opened by josejimenezluna 2
solve ij -> i bottleneck in sparse version
I don't recommend normalizing the weights nor the coords.

The weights are the coefficient that multiplies the delta in the i->j direction

the coords are the deltas in the i->j direction Can't see the advantage of normalizing them beyond a naive stabilization that might affect the convergence properties by needing more layers due to the limited transformation that a layer will be able to do.

It works fine for denoising without normalization (the unstability might come from huge outliers, but then tuning the learning rate or clipping the gradients might be of help.)
opened by hypnopump 0
Questions about the EGNN code

Recently, I've tried to read EGNN paper and study your EGNN code. Actually, I had hard time to understand both paper and code because my major is not computer science. When studying your code, I realize that the shape of hidden_out and the shape of kwargs["x"] must be same to perform add operation (becaus of residual connection) in the class EGNN_sparse forward method. How can I increase or decrease the hidden dimension size of x?

I would like to get some advice.

Thanks for your consideration in this regard.

opened by Byun-jinyoung 0
Wrong edge_index size hint in class EGNN_Sparse of pyg version

Hi, I found there may be a little mistake. In the input hint of class EGNN_Sparse of pyg version, the size of edge_index is (n_edges, 2). However, it should be (2, n_edges). Otherwise, the distance calculation will be not correct. """ Inputs: * x: (n_points, d) where d is pos_dims + feat_dims * edge_index: (n_edges, 2) * edge_attr: tensor (n_edges, n_feats) excluding basic distance feats. * batch: (n_points,) long tensor. specifies xloud belonging for each point * angle_data: list of tensors (levels, n_edges_i, n_length_path) long tensor. * size: None """

opened by Layne-Huang 2
Exploding Gradients With 4 Layers

I'm using EGNN with 4 layers (where I also do global attention after each layer), and I'm seeing exploding gradients after 90 epochs or so. I'm using techniques discussed earlier (sparse attention matrix, coor_weights_clamp_value, norm_coors), but I'm not sure if there's anything else I should be doing. I'm also not updating the coordinates, so the fix in the pull request doesn't apply.

opened by cutecows 0
Added optional tanh to coors_mlp

This removes the NaN bug completely (must also use norm_coors otherwise performance dies)

The NaN bug comes from the coors_mlp exploding, so forcing values between -1 and 1 prevents this. If coordinates are normalised then performance should not be adversely affected.

opened by jscant 1

Releases(0.2.6)

0.2.6(Jun 8, 2021)

Source code(tar.gz)
Source code(zip)
0.2.5(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.2.4(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.2.3(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.2.2(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.2.1(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.2.0(Jun 4, 2021)

Source code(tar.gz)
Source code(zip)
0.1.12(May 20, 2021)

Source code(tar.gz)
Source code(zip)
0.1.11(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.1.10(May 15, 2021)

Source code(tar.gz)
Source code(zip)
0.1.9(May 15, 2021)

Source code(tar.gz)
Source code(zip)
0.1.8(May 15, 2021)

Source code(tar.gz)
Source code(zip)
0.1.7(May 14, 2021)

Source code(tar.gz)
Source code(zip)
0.1.6(May 11, 2021)

Source code(tar.gz)
Source code(zip)
0.1.5(May 4, 2021)

Source code(tar.gz)
Source code(zip)
0.1.4(Apr 20, 2021)

Source code(tar.gz)
Source code(zip)
0.1.2(Apr 5, 2021)

Source code(tar.gz)
Source code(zip)
0.1.1(Mar 28, 2021)

Source code(tar.gz)
Source code(zip)
0.1.0(Mar 27, 2021)

Source code(tar.gz)
Source code(zip)
0.0.45(Mar 27, 2021)

Source code(tar.gz)
Source code(zip)
0.0.44(Mar 27, 2021)

Source code(tar.gz)
Source code(zip)
0.0.43(Mar 27, 2021)

Source code(tar.gz)
Source code(zip)
0.0.42(Mar 27, 2021)

Source code(tar.gz)
Source code(zip)
0.0.41(Mar 27, 2021)

Source code(tar.gz)
Source code(zip)
0.0.40(Mar 27, 2021)

Source code(tar.gz)
Source code(zip)
0.0.39(Mar 27, 2021)

Source code(tar.gz)
Source code(zip)
0.0.38(Mar 27, 2021)

Source code(tar.gz)
Source code(zip)
0.0.37(Mar 27, 2021)

Source code(tar.gz)
Source code(zip)
0.0.36(Mar 27, 2021)

Source code(tar.gz)
Source code(zip)
0.0.35(Mar 24, 2021)

Source code(tar.gz)
Source code(zip)

Owner

Phil Wang

Working with Attention. It's all we need.

GitHub Repository

An addernet CUDA version

Training addernet accelerated by CUDA Usage cd adder_cuda python setup.py install cd .. python main.py Environment pytorch 1.10.0 CUDA 11.3 benchmark

4 Jun 20, 2022

Discord Multi Tool that focuses on design and easy usage

Multi-Tool-v1.0 Discord Multi Tool that focuses on design and easy usage Delete webhook Block all friends Spam webhook Modify webhook Webhook info Tok

24 May 23, 2022

Code accompanying the paper "Wasserstein GAN"

Wasserstein GAN Code accompanying the paper "Wasserstein GAN" A few notes The first time running on the LSUN dataset it can take a long time (up to an

3.1k Jan 01, 2023

Blender Add-on that sets a Material's Base Color to one of Pantone's Colors of the Year

Blender PCOY (Pantone Color of the Year) MCMC (Mid-Century Modern Colors) HG71 (House & Garden Colors 1971) Blender Add-ons That Assign a Custom Color

15 Nov 20, 2022

Some simple programs built in Python: webcam with cv2 that detects eyes and face, with grayscale filter

Programas en Python Algunos programas simples creados en Python: 📹 Webcam con c

1 Feb 15, 2022

A python tutorial on bayesian modeling techniques (PyMC3)

Bayesian Modelling in Python Welcome to "Bayesian Modelling in Python" - a tutorial for those interested in learning how to apply bayesian modelling t

2.4k Jan 06, 2023

Collection of generative models in Tensorflow

tensorflow-generative-model-collections Tensorflow implementation of various GANs and VAEs. Related Repositories Pytorch version Pytorch version of th

3.8k Dec 30, 2022

Supervised domain-agnostic prediction framework for probabilistic modelling

A supervised domain-agnostic framework that allows for probabilistic modelling, namely the prediction of probability distributions for individual data

112 Oct 23, 2022

Official pytorch implementation of paper "Image-to-image Translation via Hierarchical Style Disentanglement".

HiSD: Image-to-image Translation via Hierarchical Style Disentanglement Official pytorch implementation of paper "Image-to-image Translation

364 Dec 14, 2022

PyTorch Implementation for Deep Metric Learning Pipelines

Easily Extendable Basic Deep Metric Learning Pipeline Karsten Roth ([email

543 Jan 04, 2023

A complete, self-contained example for training ImageNet at state-of-the-art speed with FFCV

ffcv ImageNet Training A minimal, single-file PyTorch ImageNet training script designed for hackability. Run train_imagenet.py to get... ...high accur

92 Dec 31, 2022

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

SimMIM By Zhenda Xie*, Zheng Zhang*, Yue Cao*, Yutong Lin, Jianmin Bao, Zhuliang Yao, Qi Dai and Han Hu*. This repo is the official implementation of

674 Dec 26, 2022

[ICML 2020] DrRepair: Learning to Repair Programs from Error Messages

DrRepair: Learning to Repair Programs from Error Messages This repo provides the source code & data of our paper: Graph-based, Self-Supervised Program

155 Jan 08, 2023

Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"

Introduction This repository contains research code for the ACL 2021 paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual

20 Aug 04, 2022

Official PyTorch implementation of "Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning" (AAAI 2021)

Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning Official PyTorch implementation of "Proxy Synthesis: Learning with Synthetic

30 Dec 06, 2022

A project for developing transformer-based models for clinical relation extraction

Clinical Relation Extration with Transformers Aim This package is developed for researchers easily to use state-of-the-art transformers models for ext

101 Dec 19, 2022

2021搜狐校园文本匹配算法大赛分比我们低的都是帅哥队

sohu_text_matching 2021搜狐校园文本匹配算法大赛Top2：分比我们低的都是帅哥队本repo包含了本次大赛决赛环节提交的代码文件及答辩PPT，提交的模型文件可在百度网盘获取（链接：https://pan.baidu.com/s/1T9FtwiGFZhuC8qqwXKZSNA ，

43 Oct 01, 2022

Erpnext app for make employee salary on payroll entry based on one or more project with percentage for all project equal 100 %

Project Payroll this app for make payroll for employee based on projects like project on 30 % and project 2 70 % as account dimension it makes genral

8 Jan 02, 2023

Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"

ood-text-emnlp Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them" Files fine_tune.py is used to finetune the GPT-2 mo

19 Oct 28, 2022

Torchyolo - Yolov3 ve Yolov4 modellerin Pytorch uygulamasıdır

TORCHYOLO : Yolo Modellerin Pytorch Uygulaması Yapılacaklar: Yolov3 model.py ve

3 Aug 22, 2022

EGNN - Implementation of E(n)-Equivariant Graph Neural Networks, in Pytorch

Related tags

Overview

EGNN - Pytorch

Install

Usage

Citations

Comments

Make some data:

Make a model:

Run model for varying depths:

Releases(0.2.6)

0.2.6(Jun 8, 2021)

0.2.5(Jun 5, 2021)

0.2.4(Jun 5, 2021)

0.2.3(Jun 5, 2021)

0.2.2(Jun 5, 2021)

0.2.1(Jun 5, 2021)

0.2.0(Jun 4, 2021)

0.1.12(May 20, 2021)

0.1.11(May 16, 2021)

0.1.10(May 15, 2021)

0.1.9(May 15, 2021)

0.1.8(May 15, 2021)

0.1.7(May 14, 2021)

0.1.6(May 11, 2021)

0.1.5(May 4, 2021)

0.1.4(Apr 20, 2021)

0.1.2(Apr 5, 2021)

0.1.1(Mar 28, 2021)

0.1.0(Mar 27, 2021)

0.0.45(Mar 27, 2021)

0.0.44(Mar 27, 2021)

0.0.43(Mar 27, 2021)

0.0.42(Mar 27, 2021)

0.0.41(Mar 27, 2021)

0.0.40(Mar 27, 2021)

0.0.39(Mar 27, 2021)

0.0.38(Mar 27, 2021)

0.0.37(Mar 27, 2021)

0.0.36(Mar 27, 2021)

0.0.35(Mar 24, 2021)