Project to deploy a machine learning model based on Titanic dataset from Kaggle

Last update: May 23, 2022

Related tags

Overview

kaggle_titanic_deploy

Project to deploy a machine learning model based on Titanic dataset from Kaggle

In this project we used the Titanic dataset from Kaggle to build a simple Machine Learning Model and wrap it into a python application and run a FastAPI service to make real time predictions.

Prediction Service

The prediction service works as following:

Run Locally

using docker:

docker build -t kaggle_titanic_deploy .
docker run -it -p 8000:8000 kaggle_titanic_deploy

without docker:

pipenv install
gunicorn predictor.api.app:app --worker-class uvicorn.workers.UvicornWorker --bind 0.0.0.0:8000

More info

Link to presentation on Python Brasil 2021: https://www.youtube.com/watch?v=2YAWmpVgn0g
Link to Medium post: TBD

Owner

Vivian Yamassaki

Apaixonada por trabalhar com Data Science, é co-embaixadora do WiDS (Women in Data Science) e co-organizadora do MIA (Mulheres em Inteligência Artificial).

GitHub Repository

Diabetes Prediction with Logistic Regression

Diabetes Prediction with Logistic Regression Exploratory Data Analysis Data Preprocessing Model & Prediction Model Evaluation Model Validation: Holdou

2 Oct 23, 2021

BudouX is the successor to Budou, the machine learning powered line break organizer tool.

BudouX Standalone. Small. Language-neutral. BudouX is the successor to Budou, the machine learning powered line break organizer tool. It is standalone

868 Jan 05, 2023

Upgini : data search library for your machine learning pipelines

Automated data search library for your machine learning pipelines → find & deliver relevant external data & features to boost ML accuracy :chart_with_upwards_trend:

175 Jan 08, 2023

Random Forest Classification for Neural Subtypes

Random Forest classifier for neural subtypes extracted from extracellular recordings from human brain organoids.

1 Jan 31, 2022

This is the code repository for Interpretable Machine Learning with Python, published by Packt.

Interpretable Machine Learning with Python, published by Packt

299 Jan 02, 2023

ML-powered Loan-Marketer Customer Filtering Engine

In Loan-Marketing business employees are required to call the user's to buy loans of several fields and in several magnitudes. If employees are calling everybody in the network it is also very length

13 Jul 02, 2022

Required for a machine learning pipeline data preprocessing and variable engineering script needs to be prepared

Feature-Engineering Required for a machine learning pipeline data preprocessing and variable engineering script needs to be prepared. When the dataset

5 Apr 21, 2022

AutoX是一个高效的自动化机器学习工具，它主要针对于表格类型的数据挖掘竞赛。它的特点包括: 效果出色、简单易用、通用、自动化、灵活。

English | 简体中文 AutoX是什么？ AutoX一个高效的自动化机器学习工具，它主要针对于表格类型的数据挖掘竞赛。它的特点包括: 效果出色: AutoX在多个kaggle数据集上，效果显著优于其他解决方案(见效果对比)。简单易用: AutoX的接口和sklearn类似，方便上手使用。

431 Dec 28, 2022

Microsoft contributing libraries, tools, recipes, sample codes and workshop contents for machine learning & deep learning.

366 Jan 03, 2023

Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.

Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models. Solve a variety of tasks with pre-trained models or finetune them in

227 Dec 10, 2022

MIT-Machine Learning with Python–From Linear Models to Deep Learning

MIT-Machine Learning with Python–From Linear Models to Deep Learning | One of the 5 courses in MIT MicroMasters in Statistics & Data Science Welcome t

2 Aug 23, 2022

XManager: A framework for managing machine learning experiments 🧑‍🔬

XManager is a platform for packaging, running and keeping track of machine learning experiments. It currently enables one to launch experiments locally or on Google Cloud Platform (GCP). Interaction

620 Dec 27, 2022

Official code for HH-VAEM

HH-VAEM This repository contains the official Pytorch implementation of the Hierarchical Hamiltonian VAE for Mixed-type Data (HH-VAEM) model and the s

8 Nov 30, 2022

A Multipurpose Library for Synthetic Time Series Generation in Python

TimeSynth Multipurpose Library for Synthetic Time Series Please cite as: J. R. Maat, A. Malali, and P. Protopapas, “TimeSynth: A Multipurpose Library

278 Dec 26, 2022

A project based example of Data pipelines, ML workflow management, API endpoints and Monitoring.

MLOps template with examples for Data pipelines, ML workflow management, API development and Monitoring.

33 Dec 03, 2022

Python library which makes it possible to dynamically mask/anonymize data using JSON string or python dict rules in a PySpark environment.

pyspark-anonymizer Python library which makes it possible to dynamically mask/anonymize data using JSON string or python dict rules in a PySpark envir

6 Jun 30, 2022

A scikit-learn based module for multi-label et. al. classification

scikit-multilearn scikit-multilearn is a Python module capable of performing multi-label learning tasks. It is built on-top of various scientific Pyth

802 Jan 01, 2023

Code base of KU AIRS: SPARK Autonomous Vehicle Team

KU AIRS: SPARK Autonomous Vehicle Project Check this link for the blog post describing this project and the video of SPARK in simulation and on parkou

1 Nov 23, 2021

Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in the form of Jupyter Notebooks.

Databricks Certification Spark Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along

19 Dec 13, 2022

Python Machine Learning Jupyter Notebooks (ML website)

Python Machine Learning Jupyter Notebooks (ML website) Dr. Tirthajyoti Sarkar, Fremont, California (Please feel free to connect on LinkedIn here) Also

2.6k Jan 03, 2023

Project to deploy a machine learning model based on Titanic dataset from Kaggle

Related tags

Overview

kaggle_titanic_deploy

Prediction Service

Run Locally

More info

Owner

Vivian Yamassaki

Diabetes Prediction with Logistic Regression

BudouX is the successor to Budou, the machine learning powered line break organizer tool.

Upgini : data search library for your machine learning pipelines

Random Forest Classification for Neural Subtypes

This is the code repository for Interpretable Machine Learning with Python, published by Packt.

ML-powered Loan-Marketer Customer Filtering Engine

Required for a machine learning pipeline data preprocessing and variable engineering script needs to be prepared

AutoX是一个高效的自动化机器学习工具，它主要针对于表格类型的数据挖掘竞赛。 它的特点包括: 效果出色、简单易用、通用、自动化、灵活。

Microsoft contributing libraries, tools, recipes, sample codes and workshop contents for machine learning & deep learning.

Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.

MIT-Machine Learning with Python–From Linear Models to Deep Learning

XManager: A framework for managing machine learning experiments 🧑‍🔬

Official code for HH-VAEM

A Multipurpose Library for Synthetic Time Series Generation in Python

A project based example of Data pipelines, ML workflow management, API endpoints and Monitoring.

Python library which makes it possible to dynamically mask/anonymize data using JSON string or python dict rules in a PySpark environment.

A scikit-learn based module for multi-label et. al. classification

Code base of KU AIRS: SPARK Autonomous Vehicle Team

Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in the form of Jupyter Notebooks.

Python Machine Learning Jupyter Notebooks (ML website)

AutoX是一个高效的自动化机器学习工具，它主要针对于表格类型的数据挖掘竞赛。它的特点包括: 效果出色、简单易用、通用、自动化、灵活。