Deep generative models of 3D grids for structure-based drug discovery

Last update: Jan 03, 2023

Overview

What is liGAN?

liGAN is a research codebase for training and evaluating deep generative models for de novo drug design based on 3D atomic density grids. It is based on libmolgrid and the gnina fork of caffe.

VAE paper - 2 minute talk

CVAE paper - 15 minute talk

Dependencies

numpy
pandas
scikit-image
openbabel
rdkit
molgrid
torch
protobuf
gnina version of caffe

Usage

You can use the scripts download_data.sh and download_weights.sh to download the test data and weights that were evaluated in the above papers.

The script generate.py is used to generate atomic density grids and molecular structures from a trained generative model.

Its basic usage can be seen in the scripts generate_vae.sh:

LIG_FILE=$1 # e.g. data/molport/0/102906000_8.sdf

python3 generate.py \
  --data_model_file models/data_48_0.5_molport.model \
  --gen_model_file models/vae.model \
  --gen_weights_file weights/gen_e_0.1_1_disc_x_10_0.molportFULL_rand_.0.0_gen_iter_100000.caffemodel \
  --rec_file data/molport/10gs_rec.pdb \
  --lig_file $LIG_FILE \
  --out_prefix VAE \
  --n_samples 10 \
  --fit_atoms \
  --dkoes_make_mol \
  --output_sdf \
  --output_dx \
  --gpu

And generate_cvae.sh:

REC_FILE=$1 # e.g. data/crossdock2020/PARP1_HUMAN_775_1012_0/2rd6_A_rec.pdb
LIG_FILE=$2 # e.g. data/crossdock2020/PARP1_HUMAN_775_1012_0/2rd6_A_rec_2rd6_78p_lig_tt_min.sdf

python3 generate.py \
  --data_model_file models/data_48_0.5_crossdock.model \
  --gen_model_file models/cvae.model \
  --gen_weights_file weights/lessskip_crossdocked_increased_1.lowrmsd.0_gen_iter_1500000.caffemodel \
  --rec_file $REC_FILE \
  --lig_file $LIG_FILE \
  --out_prefix CVAE \
  --n_samples 10 \
  --fit_atoms \
  --dkoes_make_mol \
  --output_sdf \
  --output_dx \
  --gpu

Both scripts can be run from the root directory of the repository.

Deep generative models of 3D grids for structure-based drug discovery

Related tags

Overview

What is liGAN?

Dependencies

Usage

Owner

Matt Ragoza

Pytorch implementation for "Open Compound Domain Adaptation" (CVPR 2020 ORAL)

Temporal Knowledge Graph Reasoning Triggered by Memories

Implementation of a Transformer, but completely in Triton

Key information extraction from invoice document with Graph Convolution Network

code from "Tensor decomposition of higher-order correlations by nonlinear Hebbian plasticity"

Implements VQGAN+CLIP for image and video generation, and style transfers, based on text and image prompts. Emphasis on ease-of-use, documentation, and smooth video creation.

El-Gamal on Elliptic Curve (Python)

GPU-accelerated PyTorch implementation of Zero-shot User Intent Detection via Capsule Neural Networks

Official repo for BMVC2021 paper ASFormer: Transformer for Action Segmentation

《Where am I looking at? Joint Location and Orientation Estimation by Cross-View Matching》(CVPR 2020)

An implementation of Fastformer: Additive Attention Can Be All You Need in TensorFlow

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

Largest list of models for Core ML (for iOS 11+)

NUANCED is a user-centric conversational recommendation dataset that contains 5.1k annotated dialogues and 26k high-quality user turns.

GeneGAN: Learning Object Transfiguration and Attribute Subspace from Unpaired Data

CT-Net: Channel Tensorization Network for Video Classification

ML From Scratch

FinEAS: Financial Embedding Analysis of Sentiment 📈

Urban mobility simulations with Python3, RLlib (Deep Reinforcement Learning) and Mesa (Agent-based modeling)

Fusion-in-Decoder Distilling Knowledge from Reader to Retriever for Question Answering