Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.

Last update: Dec 17, 2022

Overview

Milano

(This is a research project, not an official NVIDIA product.)

Documentation

https://nvidia.github.io/Milano

Milano (Machine learning autotuner and network optimizer) is a tool for enabling machine learning researchers and practitioners to perform massive hyperparameters and architecture searches.

You can use it to:

Tune your model on a cloud backend of your choice
Benchmark Auto-ML algorithms (see how to add new search algorithm)

Your script can use any framework of your choice, for example, TensorFlow, PyTorch, Microsoft Cognitive Toolkit etc. or no framework at all. Milano only requires minimal changes to what your script accepts via command line and what it returns to stdout.

Currently supported backends:

Azkaban - on a single multi-GPU machine or server with Azkaban installed
AWS - Amazon cloud using GPU instances
SLURM - any cluster which is running SLURM

Prerequisites

Linux
Python 3
Ensure you have Python version 3.5 or later with packages listed in the requirements.txt file.
Backend with NVIDIA GPU

How to Get Started

Install all dependencies with the following command pip install -r requirements.txt.
Follow this mini-tutorial for local machine or this mini-tutorial for AWS

Visualize

We provide a script to convert the csv file output into two kinds of graphs:

Graphs of each hyperparameter with the benchmark (e.g. valid perplexity)
Color graphs that show the relationship between any two hyperparameters and the benchmark

To run the script, use:

python3 visualize.py --file [the name of the results csv file] 
                     --n [the number of samples to visualize]
                     --subplots [the number of subplots to show in a plot]
                     --max [the max value of benchmark you care about]

Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.

Related tags

Overview

Milano

Documentation

Prerequisites

How to Get Started

Visualize

Owner

NVIDIA Corporation

JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"

BirdCLEF 2021 - Birdcall Identification 4th place solution

Open Source Light Field Toolbox for Super-Resolution

Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts)

Stroke-predictions-ml-model - Machine learning model to predict individuals chances of having a stroke

This project is the PyTorch implementation of our CVPR 2022 paper:

A PyTorch Reimplementation of TecoGAN: Temporally Coherent GAN for Video Super-Resolution

Gray Zone Assessment

PyTorch implementation for our AAAI 2022 Paper "Graph-wise Common Latent Factor Extraction for Unsupervised Graph Representation Learning"

Automatically replace ONNX's RandomNormal node with Constant node.

img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation

[TIP 2021] SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction

Implementation of Barlow Twins paper

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Text mining project; Using distilBERT to predict authors in the classification task authorship attribution.

Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm Configuration

Motion and Shape Capture from Sparse Markers

Contextual Attention Localization for Offline Handwritten Text Recognition

Distributed Evolutionary Algorithms in Python

Algo-burn - Script to configure an Algorand address as a "burn" address for one or more ASA tokens