Create large-scale ML-driven multiscale simulation ensembles to study the interactions

Overview

MuMMI RAS v0.1

Released: Nov 16, 2021

MuMMI RAS is the application component of the MuMMI framework developed to create large-scale ML-driven multiscale simulation ensembles to study the interactions of RAS proteins and RAS-RAF protein complexes with lipid plasma membranes.

MuMMI framework was developed as part of the Pilot2 project of the Joint Design of Advanced Computing Solutions for Cancer funded jointly by the Department of Energy (DOE) and the National Cancer Institute (NCI).

The Pilot 2 project focuses on developing multiscale simulation models for understanding the interactions of the lipid plasma membrane with the RAS and RAF proteins. The broad computational tool development aims of this pilot are:

  • Developing scalable multi-scale molecular dynamics code that will automatically switch between phase field, coarse-grained and all-atom simulations.
  • Developing scalable machine learning and predictive models of molecular simulations to:
    • identify and quantify states from simulations
    • identify events from simulations that can automatically signal change of resolution between phase field, coarse-grained and all-atom simulations
    • aggregate information from the multi-resolution simulations to efficiently feedback to/from machine learning tools
  • Integrate sparse information from experiments with simulation data

MuMMI RAS defines the specific functionalities needed for the various components and scales of a target multiscale simulation. The application components need to define the scales, how to read the corresponding data, how to perform ML-based selection, how to run the simulations, how to perform analysis, and how to perform feedback. This code uses several utilities made available through "MuMMI Core".

Publications

MuMMI framework is described in the following publications.

  1. Bhatia et al. Generalizable Coordination of Large Multiscale Ensembles: Challenges and Learnings at Scale. In Proceedings of the ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, SC '21, Article No. 10, November 2021. doi:10.1145/3458817.3476210.

  2. Di Natale et al. A Massively Parallel Infrastructure for Adaptive Multiscale Simulations: Modeling RAS Initiation Pathway for Cancer. In Proceedings of the ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, SC '19, Article No. 57, November 2019. doi:10.1145/3295500.3356197.
    Best Paper at SC 2019.

  3. Ingólfsson et al. Machine Learning-driven Multiscale Modeling Reveals Lipid-Dependent Dynamics of RAS Signaling Protein. Proceedings of the National Academy of Sciences (PNAS), accepted, 2021. preprint.

  4. Reciprocal Coupling of Coarse-Grained and All-Atom scales. In preparation.

Installation

git clone https://github.com/mummi-framework/mummi-ras
cd mummi-ras
pip3 install .

export MUMMI_ROOT=/path/to/outputs
export MUMMI_CORE=/path/to/core/repo
export MUMMI_APP=/path/to/app/repo
export MUMMI_RESOURCES=/path/to/resources
The installaton process as described above installs the MuMMI framework. The simulation codes (gridsim2d, ddcMD, AMBER, GROMACS) are not included and are to be installed separately.
Spack installation. We are also working towards releasing the option of installing MuMMI and its dependencies through spack.

Authors and Acknowledgements

MuMMI was developed at Lawrence Livermore National Laboratory, in collaboration with Los Alamos National Laboratory, Oak Ridge National Laboratory, and International Business Machines. A list of main contributors is given below.

  • LLNL: Harsh Bhatia, Francesco Di Natale, Helgi I Ingólfsson, Joseph Y Moon, Xiaohua Zhang, Joseph R Chavez, Fikret Aydin, Tomas Oppelstrup, Timothy S Carpenter, Shiv Sundaram (previously LLNL), Gautham Dharuman (previously LLNL), Dong H Ahn, Stephen Herbein, Tom Scogland, Peer-Timo Bremer, and James N Glosli.

  • LANL: Chris Neale and Cesar Lopez

  • ORNL: Chris Stanley

  • IBM: Sara K Schumacher

MuMMI was funded by the Pilot2 project led by Dr. Fred Streitz (DOE) and Dr. Dwight Nissley (NIH). We acknowledge contributions from the entire Pilot 2 team.

This work was performed under the auspices of the U.S. Department of Energy (DOE) by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344, Los Alamos National Laboratory (LANL) under Contract DE-AC5206NA25396, and Oak Ridge National Laboratory under Contract DE-AC05-00OR22725.

Contact: Lawrence Livermore National Laboratory, 7000 East Avenue, Livermore, CA 94550.

Contributing

Contributions may be made through pull requests and/or issues on github.

License

MuMMI RAS is distributed under the terms of the MIT License.

Livermore Release Number: LLNL-CODE-827655

Comments
  • Are the trajectories in your publications publicly available?

    Are the trajectories in your publications publicly available?

    Hi, Congrats on the success, and huge thanks for making it open source. I wonder whether the trajectories in your publications are publicly available. Or are there any demo trajectories?

    I am a Ph.D. student at KAUST, using computer graphics to build and visualize mesoscale biology models, such as SARS-CoV-2 and bacteriophage T4. If possible, I (and my colleagues) would like to perform (multiscale, multi-representation, multi-granularity) visualization research on the trajectories you generated.

    Many thanks, Roden

    opened by RodenLuo 2
  • `flux` vs `slurm`

    `flux` vs `slurm`

    Hi,

    As flux is mentioned in the dependencies, is it possible to reproduce MuMMI RAS on a cluster that only has slurm?

    Workflow dependencies (e.g., python, flux, dynim, keras, etc.)

    Quoted from: https://github.com/mummi-framework/mummi-ras/blob/main/INSTALL.md

    Many thanks, Roden

    opened by RodenLuo 0
  • gridsim2d availability

    gridsim2d availability

    Hi, I wonder if the following code is available or not.

    gridsim2d: to be released shortly

    Quoted from: https://github.com/mummi-framework/mummi-ras/blob/main/INSTALL.md

    Thanks, Roden

    opened by RodenLuo 0
  • Patch for gromacs availability

    Patch for gromacs availability

    Hi, I wonder if the following patch is available or not.

    Note that we have a patch for gromacs installation for customization. To be open-sourced soon.

    Quoted from: https://github.com/mummi-framework/mummi-ras/blob/main/INSTALL.md

    Thanks, Roden

    opened by RodenLuo 0
  • Small scale test data for local deployment

    Small scale test data for local deployment

    Hi, I'm interested in deploying MuMMI on the KAUST IBEX cluster. It is mentioned in the installation doc that there is a small set of test data. Is it now publicly available? If not, is it possible for me to somehow access it so that I can perform a test run?

    Many thanks, Roden

    Again on lassen and on summit, we have created a small set of test data, which can be used to launch MuMMI at small scales. This (and the larger dataset) will be made public through NCI website. Until then, we can make this data available upon request.

    opened by RodenLuo 1
Releases(v1.0.0)
Reggy - Regressions with arbitrarily complex regularization terms

reggy Regressions with arbitrarily complex regularization terms. Currently suppo

Kim 1 Jan 20, 2022
Open source time series library for Python

PyFlux PyFlux is an open source time series library for Python. The library has a good array of modern time series models, as well as a flexible array

Ross Taylor 2k Jan 02, 2023
A library of extension and helper modules for Python's data analysis and machine learning libraries.

Mlxtend (machine learning extensions) is a Python library of useful tools for the day-to-day data science tasks. Sebastian Raschka 2014-2021 Links Doc

Sebastian Raschka 4.2k Dec 29, 2022
customer churn prediction prevention in telecom industry using machine learning and survival analysis

Telco Customer Churn Prediction - Plotly Dash Application Description This dash application allows you to predict telco customer churn using machine l

Benaissa Mohamed Fayçal 3 Nov 20, 2021
Binary Classification Problem with Machine Learning

Binary Classification Problem with Machine Learning Solving Approach: 1) Ultimate Goal of the Assignment: This assignment is about solving a binary cl

Dinesh Mali 0 Jan 20, 2022
A Streamlit demo to interactively visualize Uber pickups in New York City

Streamlit Demo: Uber Pickups in New York City A Streamlit demo written in pure Python to interactively visualize Uber pickups in New York City. View t

Streamlit 230 Dec 28, 2022
BudouX is the successor to Budou, the machine learning powered line break organizer tool.

BudouX Standalone. Small. Language-neutral. BudouX is the successor to Budou, the machine learning powered line break organizer tool. It is standalone

Google 868 Jan 05, 2023
This is a Cricket Score Predictor that predicts the first innings score of a T20 Cricket match using Machine Learning

This is a Cricket Score Predictor that predicts the first innings score of a T20 Cricket match using Machine Learning. It is a Web Application.

Developer Junaid 3 Aug 04, 2022
NumPy-based implementation of a multilayer perceptron (MLP)

My own NumPy-based implementation of a multilayer perceptron (MLP). Several of its components can be tuned and played with, such as layer depth and size, hidden and output layer activation functions,

1 Feb 10, 2022
MICOM is a Python package for metabolic modeling of microbial communities

Welcome MICOM is a Python package for metabolic modeling of microbial communities currently developed in the Gibbons Lab at the Institute for Systems

57 Dec 21, 2022
learn python in 100 days, a simple step could be follow from beginner to master of every aspect of python programming and project also include side project which you can use as demo project for your personal portfolio

learn python in 100 days, a simple step could be follow from beginner to master of every aspect of python programming and project also include side project which you can use as demo project for your

BDFD 6 Nov 05, 2022
This repo implements a Topological SLAM: Deep Visual Odometry with Long Term Place Recognition (Loop Closure Detection)

This repo implements a topological SLAM system. Deep Visual Odometry (DF-VO) and Visual Place Recognition are combined to form the topological SLAM system.

Best of Australian Centre for Robotic Vision (ACRV) 32 Jun 23, 2022
A library to generate synthetic time series data by easy-to-use factors and generator

timeseries-generator This repository consists of a python packages that generates synthetic time series dataset in a generic way (under /timeseries_ge

Nike Inc. 87 Dec 20, 2022
AutoTabular automates machine learning tasks enabling you to easily achieve strong predictive performance in your applications.

AutoTabular automates machine learning tasks enabling you to easily achieve strong predictive performance in your applications. With just a few lines of code, you can train and deploy high-accuracy m

Robin 55 Dec 27, 2022
Library for machine learning stacking generalization.

stacked_generalization Implemented machine learning *stacking technic[1]* as handy library in Python. Feature weighted linear stacking is also availab

114 Jul 19, 2022
Spark development environment for k8s

Local Spark Dev Env with Docker Development environment for k8s. Using the spark-operator image to ensure it will be the same environment. Start conta

Otacilio Filho 18 Jan 04, 2022
A machine learning toolkit dedicated to time-series data

tslearn The machine learning toolkit for time series analysis in Python Section Description Installation Installing the dependencies and tslearn Getti

2.3k Dec 29, 2022
Napari sklearn decomposition

napari-sklearn-decomposition A simple plugin to use with napari This napari plug

1 Sep 01, 2022
fMRIprep Pipeline To Machine Learning

fMRIprep Pipeline To Machine Learning(Demo) 所有配置均在config.py文件下定义 前置环境(lilab) 各个节点均安装docker,并有fmripre的镜像 可以使用conda中的base环境(相应的第三份包之后更新) 1. fmriprep scr

Alien 3 Mar 08, 2022
Predicting job salaries from ads - a Kaggle competition

Predicting job salaries from ads - a Kaggle competition

Zygmunt Zając 57 Oct 23, 2020