Machine learning model evaluation made easy: plots, tables, HTML reports, experiment tracking and Jupyter notebook analysis.

Last update: Dec 31, 2022

Overview

sklearn-evaluation

Machine learning model evaluation made easy: plots, tables, HTML reports, experiment tracking, and Jupyter notebook analysis.

Supports Python 3.6 and higher.

Documentation here.

Install

pip install sklearn-evaluation

Features

Plotting (confusion matrix, feature importances, precision-recall, roc)
Report generation (example)
Evaluate grid search results
Track experiments using a local SQLite database
Analyze notebooks output

Comments

Quickstart clustering

Adds quick start for clustering. Note that I had to make some changes to the tests and the elbow curve implementation since I found minor issues: hardcoded figure size, missing n_clusters in the title and hardcoded random seed.

opened by edublancas 10
new ROC api added to plot
Describe your changes

New ROC API (inherits from Plot)

plot.ROC.__add__ added for generating overlapping curves

The old roc API is still supported

Issue ticket number and link

Closes #84

Checklist before requesting a review

[x] I have performed a self-review of my code

[x] I have added thorough tests (when necessary).

[x] I have added the right documentation (when needed). Product update? If yes, write one line about this update.
opened by yafimvo 8
minor changes to silhouette_plot

I was going to release a new version with the silhouette_plot @neelasha23 but noticed a few things.

Our convention is not to include the word plot in the function names (since they're all in the plot, module, can you rename them?

silhouette_plot -> silhouette silhouette_plot_from_results -> silhouette_from_results

Also, please include 0.8.3 as the version when this plots became available, in case anyone is using an older version. This way they'll know they have to update, you can add a .. versionadded:: in a Notes section in the plot's docstring

https://www.sphinx-doc.org/en/master/usage/restructuredtext/directives.html#directive-versionadded

FYI: @idomic

opened by edublancas 7
Inconsistency in image comparison

The results of matplotlib's @image_comparison are a bit inconsistent sometimes (behaving differently in local vs CI). Maybe we can aim to build a custom utility for comparing images from plots.

opened by neelasha23 7
Bug: Missing colab flag

on some of the stats calls, the colab flag is missing. This makes it difficult to understand how many of the users are actually in colab or just plain docker.

opened by idomic 6

docs broken

looks like @neelasha23's last PR broke the documentation because of a change in sklearn:

PapermillExecutionError: 
---------------------------------------------------------------------------
Exception encountered at "In [1]":
---------------------------------------------------------------------------
ImportError                               Traceback (most recent call last)
Cell In[1], line 3
      1 import importlib
----> 3 from sklearn.datasets import load_boston
      4 from sklearn.model_selection import train_test_split
      5 from sklearn import metrics

File ~/checkouts/readthedocs.org/user_builds/sklearn-evaluation/conda/latest/lib/python3.8/site-packages/sklearn/datasets/__init__.py:156, in __getattr__(name)
    105 if name == "load_boston":
    106     msg = textwrap.dedent(
    107         """
    108         `load_boston` has been removed from scikit-learn since version 1.2.
   (...)
    154         """
    155     )
--> 156     raise ImportError(msg)
    157 try:
    158     return globals()[name]

ImportError: 
`load_boston` has been removed from scikit-learn since version 1.2.

FYI @idomic

opened by edublancas 5

Installing sklearn_evaluation

I used "pip install sklearn-evaluation" to install this library in anaconda. All requirements exit but it does not install. When I want to import it, there is no library. When I run pip command to install it, it does not access to install, nor install anything.

opened by AminShah69 5
Incompatibility with sklearn 0.20.0

Hi. I was trying to use this package with the up-to-dated version of scikit (0.20.0) but I did not understand how to do it. In particular, I was trying to use

from sklearn_evaluation import plot plot.grid_search(gridCV.grid_scores_, change=change,kind='bar')

but the member grid_scores_ does not exist any more (present till scikit 0.17) and has been substituted by cv_results_, which returns an object of different data type with respect to the former member. Is there an easy way to go on using this function by using the new cv_results_ in place of grid_scores_? Thank you.

opened by mfaggin 5
refactor plots for better integration with tracker
In sklearn-evaluation 0.8.2, I introduced two new methods to the SQL experiment tracker: log_confusion_matrix and log_classification_report. These two methods allow users to store plots in the SQLite database and retrieve them later.

However, unlike previous versions, we're not storing the actual plot in the database, but the statistics we need to re-create the plot. For example, to re-create a confusion matrix, we can store the numbers on each quadrant. The benefit of this approach is that we can serialize and unserialize the plots as objects and allow the user to combine them for better comparison. See this example.

Enabling this involves several changes in the plotting code since we need to split the part that computes the statistics to display from the code that generates the plot, and this has to be performed for each plot (so far, only confusion matrix and classification report have been refactored)

The purpose of this issue is to start refactoring other popular plots. We still need to support the old API (e.g., plot.confusion_matrix), but it should use the object-oriented API under the hood (e.g., plot.ConfusionMatrix)

The next one we can implement is the ROC curve. All classes should behave similarly; here are some pointers:

the class constructor should take the data needed to generate the plot (fpr and tpr as returned by roc_curve)

No need to implement __sub__ - not applicable for ROC. just raise a NotImplementedError with an appropriate error message

__add__ should create a new plot with overlapping ROC curves. This translates into users being able to do roc1 + roc2 to generated the overlapping plot

the _get_data method should return the data needed to re-create the plot (example)

the from_dump class method should re-create a plot from a dumped json file (note that the dump method is implemented in the parent class
opened by edublancas 4

Describe your changes

SKLearnEvaluationLogger decorator wraps telemetry log_api functionality and allows to generate logs for sklearn-evaluation as follows:

@SKLearnEvaluationLogger.log(feature='plot')
def confusion_matrix(
        y_true,
        y_pred,
        target_names=None,
        normalize=False,
        cmap=None,
        ax=None,
        **kwargs):
pass

this will generate the following log:

        {
          "metadata": {
          "action": "confusion_matrix"
          "feature": "plot",
          "args": {
                        "target_names": "None",
                        "normalize": "False",
                        "cmap": "None",
                        "ax": "None"
                    }
          }
        }

** since y_true and y_pred are positional arguments without default values it won't log them

we can also use pre-defined flags when calling a function

        return plot.confusion_matrix(self.y_true, self.y_pred, self.target_names, ax=_gen_ax())

which will generate the following log:

        "metadata": {
            "action": "confusion_matrix"
            "feature": "plot",
            "args": {
                "target_names": "['setosa', 'versicolor', 'virginica']",
                "normalize": "False",
                "cmap": "None",
                "ax": "AxesSubplot(0.125,0.11;0.775x0.77)"
            }
        },

Queries

Run queries and filter out sklearn-evaluation events by the event name: sklearn-evaluation Break these events by feature ('plot', 'report', 'SQLiteTracker', 'NotebookCollection') Break events by actions (i.e: 'confusion_matrix', 'roc', etc...) and/or flags ('is_report')

Errors

Failing runnings will be named: sklearn-evaluation-error

Checklist before requesting a review

[X] I have performed a self-review of my code
[X] I have added thorough tests (when necessary).
[] I have added the right documentation (when needed). Product update? If yes, write one line about this update.

opened by yafimvo 4

GridSearch heatmap for 'None' parameter

When I try to generate a heatmap for GridSearchCV results, if the parameter has 'None' type, it gives error: TypeError: '<' not supported between instances of 'NoneType' and 'int'

The parameter can be, for e.g. max_depth_for_decision_trees = [3, 5, 10, None].

Is there any workaround for this?

opened by shrsulav 4
doc intro is empty

our intro page is empty: https://sklearn-evaluation.ploomber.io/en/latest/intro.html

we should briefly describe the features in the library (possibly with some short examples) and add links to our quick starts

opened by edublancas 1
ConfusionMatrix fix.

Adresses #145

Restructured ConfusionMatrix class to include a plot method that plots data and axes to a matplotlib figure and returns a ConfusionMatrix class object. An object is returned so as to not break the addition and subtraction functions in the class. The figure is a matplotlib object and can be resized using matplotlib methods. The figure is accessed by the figure attribute of the class instance.

Example:

tree_cm = plot.ConfusionMatrix.from_raw_data(y_test, tree_pred, normalize=False) # Creates a ConfusionMatrix class instance tree_cm.figure.set_size_inches(5,5) # Resizes the figure to 5 by 5 inches tree_cm.figure # Outputs the figure contained in class instance

opened by digithed 1
documenting alternatives to elbow curve

I came across this paper, which suggests that the elbow method isn't the best for choosing the number of clusters. We should give it a read, look for other sources and incorporate some of this advice in our elbow curve documentation. We could implement the alternatives.

opened by edublancas 0
Prediction error plot - issue in logic

The prediction error piece has this logic: model.fit(y_reshaped, y_pred). This looks incorrect. It's trying to fit 2 sets of y values whereas it should fit (X,y). Need to understand why this statement is here and rectify accordingly.

opened by neelasha23 0

Releases(0.5.6)

0.5.6(Jun 26, 2021)

Source code(tar.gz)
Source code(zip)
0.5.5(Mar 28, 2021)

Source code(tar.gz)
Source code(zip)
0.5.4(Dec 28, 2020)

Source code(tar.gz)
Source code(zip)
0.5.3(Dec 15, 2020)

Source code(tar.gz)
Source code(zip)
0.5.2(Oct 2, 2020)
Adds SQLiteTracker for tracking ML experiments using a SQlite backend

Adds NotebookIntrospector [Experimental]

Migrates tests to nox

Adds DataSelector

Enables testing with Python 3.8

Source code(tar.gz)
Source code(zip)
0.5(May 3, 2019)
Adds new API for reports

Source code(tar.gz)
Source code(zip)
0.4(Dec 30, 2016)
Adds plot.validation_curve

Adds plot.learning_curve

Source code(tar.gz)
Source code(zip)
0.3(Jun 4, 2016)
Adds plot to evaluate results from sklearn grid search

Improves compatibility with Python 3

Source code(tar.gz)
Source code(zip)

Owner

Eduardo Blancas

Developing tools for reproducible Data Science.

GitHub Repository https://sklearn-evaluation.readthedocs.io

scikit-learn is a python module for machine learning built on top of numpy / scipy

About scikit-learn is a python module for machine learning built on top of numpy / scipy. The purpose of the scikit-learn-tutorial subproject is to le

122 Dec 12, 2022

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Horovod Horovod is a distributed deep learning training framework for TensorFlow, Keras, PyTorch, and Apache MXNet. The goal of Horovod is to make dis

12.9k Jan 07, 2023

High performance Python GLMs with all the features!

200 Dec 14, 2022

A repository to work on Machine Learning course. Select an algorithm to classify writer's gender, of Hebrew texts.

MachineLearning A repository to work on Machine Learning course. Select an algorithm to classify writer's gender, of Hebrew texts. Tested algorithms:

1 Feb 01, 2022

Banpei is a Python package of the anomaly detection.

Banpei Banpei is a Python package of the anomaly detection. Anomaly detection is a technique used to identify unusual patterns that do not conform to

282 Jan 03, 2023

A python library for easy manipulation and forecasting of time series.

Time Series Made Easy in Python darts is a python library for easy manipulation and forecasting of time series. It contains a variety of models, from

5.2k Jan 04, 2023

Practical Time-Series Analysis, published by Packt

Practical Time-Series Analysis This is the code repository for Practical Time-Series Analysis, published by Packt. It contains all the supporting proj

325 Dec 23, 2022

Nevergrad - A gradient-free optimization platform

Nevergrad - A gradient-free optimization platform nevergrad is a Python 3.6+ library. It can be installed with: pip install nevergrad More installati

3.4k Jan 08, 2023

Python Research Framework

106 Dec 13, 2022

inding a method to objectively quantify skill versus chance in games, using reinforcement learning

Skill-vs-chance-games-analysis - Finding a method to objectively quantify skill versus chance in games, using reinforcement learning

4 Nov 19, 2022

This is an auto-ML tool specialized in detecting of outliers

Auto-ML tool specialized in detecting of outliers Description This tool will allows you, with a Dash visualization, to compare 10 models of machine le

1 Nov 03, 2021

Free MLOps course from DataTalks.Club

MLOps Zoomcamp Our MLOps Zoomcamp course Sign up here: https://airtable.com/shrCb8y6eTbPKwSTL (it's not automated, you will not receive an email immed

4.6k Dec 31, 2022

Cool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.

python-is-cool A gentle guide to the Python features that I didn't know existed or was too afraid to use. This will be updated as I learn more and bec

3.3k Jan 05, 2023

SynapseML - an open source library to simplify the creation of scalable machine learning pipelines

Synapse Machine Learning SynapseML (previously MMLSpark) is an open source library to simplify the creation of scalable machine learning pipelines. Sy

3.9k Dec 30, 2022

A Streamlit demo to interactively visualize Uber pickups in New York City

Streamlit Demo: Uber Pickups in New York City A Streamlit demo written in pure Python to interactively visualize Uber pickups in New York City. View t

230 Dec 28, 2022

This repository has datasets containing information of Uber pickups in NYC from April 2014 to September 2014 and January to June 2015. data Analysis , virtualization and some insights are gathered here

uber-pickups-analysis Data Source: https://www.kaggle.com/fivethirtyeight/uber-pickups-in-new-york-city Information about data set The dataset contain

1 Nov 03, 2021

A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.

pmdarima Pmdarima (originally pyramid-arima, for the anagram of 'py' + 'arima') is a statistical library designed to fill the void in Python's time se

1.3k Dec 22, 2022

Flask app to predict daily radiation from the time series of Solcast from Islamabad, Pakistan

Solar-radiation-ISB-MLOps - Flask app to predict daily radiation from the time series of Solcast from Islamabad, Pakistan.

1 Dec 31, 2021

Continuously evaluated, functional, incremental, time-series forecasting

timemachines Autonomous, univariate, k-step ahead time-series forecasting functions assigned Elo ratings You can: Use some of the functionality of a s

343 Jan 04, 2023

MCML is a toolkit for semi-supervised dimensionality reduction and quantitative analysis of Multi-Class, Multi-Label data

MCML is a toolkit for semi-supervised dimensionality reduction and quantitative analysis of Multi-Class, Multi-Label data. We demonstrate its use

26 Nov 29, 2022

Machine learning model evaluation made easy: plots, tables, HTML reports, experiment tracking and Jupyter notebook analysis.

Related tags

Overview

sklearn-evaluation

Install

Features

Comments

Describe your changes

Issue ticket number and link

Checklist before requesting a review

Describe your changes

Queries

Errors

Checklist before requesting a review

Releases(0.5.6)

0.5.6(Jun 26, 2021)

0.5.5(Mar 28, 2021)

0.5.4(Dec 28, 2020)

0.5.3(Dec 15, 2020)

0.5.2(Oct 2, 2020)

0.5(May 3, 2019)

0.4(Dec 30, 2016)

0.3(Jun 4, 2016)

Owner

Eduardo Blancas

scikit-learn is a python module for machine learning built on top of numpy / scipy

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

High performance Python GLMs with all the features!

A repository to work on Machine Learning course. Select an algorithm to classify writer's gender, of Hebrew texts.

Banpei is a Python package of the anomaly detection.

A python library for easy manipulation and forecasting of time series.

Practical Time-Series Analysis, published by Packt

Nevergrad - A gradient-free optimization platform

Python Research Framework

inding a method to objectively quantify skill versus chance in games, using reinforcement learning

This is an auto-ML tool specialized in detecting of outliers

Free MLOps course from DataTalks.Club

Cool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.

SynapseML - an open source library to simplify the creation of scalable machine learning pipelines

A Streamlit demo to interactively visualize Uber pickups in New York City

This repository has datasets containing information of Uber pickups in NYC from April 2014 to September 2014 and January to June 2015. data Analysis , virtualization and some insights are gathered here

A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.

Flask app to predict daily radiation from the time series of Solcast from Islamabad, Pakistan

Continuously evaluated, functional, incremental, time-series forecasting

MCML is a toolkit for semi-supervised dimensionality reduction and quantitative analysis of Multi-Class, Multi-Label data