slim-python is a package to learn customized scoring systems for decision-making problems.

Last update: Nov 02, 2022

Related tags

Overview

slim-python is a package to learn customized scoring systems for decision-making problems.

These are simple decision aids that let users make yes-no predictions by adding and subtracting a few small numbers.

SLIM is designed to learn the most accurate scoring system for a given dataset and set of constraints. These models are produced by solving a hard optimization problem that directly optimizes for accuracy, sparsity, and customized constraints (e.g., hard limits on model size, TPR, FPR).

Requirements

slim-python was developed using Python 2.7.11 and CPLEX 12.6.2.

CPLEX

CPLEX is cross-platform commercial optimization tool with a Pytho API. It is freely available to students and faculty members at accredited institutions as part of the IBM Academic Initiative. To get CPLEX:

Join the IBM Academic Initiative. Note that it may take up to a week to obtain approval.
Download IBM ILOG CPLEX Optimization Studio V12.6.1 (or higher) from the software catalog
Install the file on your computer. Note mac/unix users will need to install a .bin file.
Setup the CPLEX Python modules as described here here.

Please check the CPLEX user manual or the CPLEX forums if you have problems installing CPLEX.

Citation

If you use SLIM for academic research, please cite our paper!

@article{
    ustun2015slim,
    year = {2015},
    issn = {0885-6125},
    journal = {Machine Learning},
    doi = {10.1007/s10994-015-5528-6},
    title = {Supersparse linear integer models for optimized medical scoring systems},
    url = {http://dx.doi.org/10.1007/s10994-015-5528-6},
    publisher = { Springer US},
    author = {Ustun, Berk and Rudin, Cynthia},
    pages = {1-43},
    language = {English}
}

slim-python is a package to learn customized scoring systems for decision-making problems.

Related tags

Overview

Requirements

CPLEX

Citation

Owner

Berk Ustun

Python package for stacking (machine learning technique)

scikit-learn models hyperparameters tuning and feature selection, using evolutionary algorithms.

Create large-scale ML-driven multiscale simulation ensembles to study the interactions

PROTEIN EXPRESSION ANALYSIS FOR DOWN SYNDROME

Random Forest Classification for Neural Subtypes

Probabilistic programming framework that facilitates objective model selection for time-varying parameter models.

Predict the income for each percentile of the population (Python) - FRENCH

A Python package to preprocess time series

Machine learning algorithms implementation

Model Agnostic Confidence Estimator (MACEST) - A Python library for calibrating Machine Learning models' confidence scores

Provide an input CSV and a target field to predict, generate a model + code to run it.

Dragonfly is an open source python library for scalable Bayesian optimisation.

Free MLOps course from DataTalks.Club

Can a machine learning project be implemented to estimate the salaries of baseball players whose salary information and career statistics for 1986 are shared?

Banpei is a Python package of the anomaly detection.

GAM timeseries modeling with auto-changepoint detection. Inspired by Facebook Prophet and implemented in PyMC3

Microsoft contributing libraries, tools, recipes, sample codes and workshop contents for machine learning & deep learning.

Distributed scikit-learn meta-estimators in PySpark

A Python toolkit for rule-based/unsupervised anomaly detection in time series

Responsible Machine Learning with Python