liquid_scikit_learn

Scikit learn library models to account for data and concept drift.

This python library focuses on solving data drift and concept drift in the industry to minimize retraining of the models regularly. After inspired about the capabilities of neurons in octopus tentacles, which they interact and adapt directly with the environment without their central nervous system. I designed the weights for these models in the similar way where they train on input and experience. Instead of calculating weights based on minimizing the loss function, derivatives of weights are calculated. ( Hasani Chen). This library also provides model expiration details at a feature level. This could help in finding the features that model has hard time adjusting.

This library adapts concepts from Nueral ODE for scikit-learn. The models in this librabry calculate the derivatives of weights instead of weights as in standard scikit-learn librabry.

There are two training phases, the first one is a standard scikit learn model that provides predictions and weights for each feature. Typically, in standard ML models, training data is sent in batches and inferences can be done real time and in batch. In this scenario for the second training phase, input data is sent in semi batches and model adapts with changing data drift and concept drift with time. The second training phase along with changing weights it provides decay rate for each weight, contribution from data drift and concept drift and model failure parameters.

For example, suppose we train three months of data in the first training phase for the model to understand patterns with its provided inputs and outputs. In the second phase of training, we send weekly batches of inputs and outputs to make the model to adapt to changes in data and output that typically changes with customer behavior. I will make efforts to extend this library for unsupervised learning also. Currently liquid logistic regression is available with limited parameter optimization.

To use this librabry for now, git clone the librarby and give path to the librarby.

To use standard logistic regression

from liquid_scikit_learn.liquid_logistic_regression import logistic_regression

To use liquid logistic regression

from liquid_scikit_learn.liquid_logistic_regression import liquid_logistic_regression

To get model expiration details at a feature level

from liquid_scikit_learn.liquid_logistic_regression import model_failure

Scikit learn library models to account for data and concept drift.

Related tags

Overview

liquid_scikit_learn

Owner

An implementation of Relaxed Linear Adversarial Concept Erasure (RLACE)

Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in the form of Jupyter Notebooks.

Little Ball of Fur - A graph sampling extension library for NetworKit and NetworkX (CIKM 2020)

Xeasy-ml is a packaged machine learning framework.

moDel Agnostic Language for Exploration and eXplanation

Implementation of the Object Relation Transformer for Image Captioning

Stock Price Prediction Bank Jago Using Facebook Prophet Machine Learning & Python

machine learning model deployment project of Iris classification model in a minimal UI using flask web framework and deployed it in Azure cloud using Azure app service

Data Version Control or DVC is an open-source tool for data science and machine learning projects

A simple guide to MLOps through ZenML and its various integrations.

Iris species predictor app is used to classify iris species created using python's scikit-learn, fastapi, numpy and joblib packages.

Self Organising Map (SOM) for clustering of atomistic samples through unsupervised learning.

MasTrade is a trading bot in baselines3,pytorch,gym

BudouX is the successor to Budou, the machine learning powered line break organizer tool.

Python-based implementations of algorithms for learning on imbalanced data.

TorchDrug is a PyTorch-based machine learning toolbox designed for drug discovery

Extended Isolation Forest for Anomaly Detection

DistML is a Ray extension library to support large-scale distributed ML training on heterogeneous multi-node multi-GPU clusters

Binary Classification Problem with Machine Learning

Library for machine learning stacking generalization.