cl;asification problem using classification models in supervised learning

Overview

wine-quality-predition---classification

cl;asification problem using classification models in supervised learning

Wine Quality Prediction Analysis - Classification

Dataset Information

The two datasets are related to red and white variants of the Portuguese "Vinho Verde" wine. These datasets can be viewed as classification or regression tasks. The classes are ordered and not balanced (e.g. there are munch more normal wines than excellent or poor ones). Outlier detection algorithms could be used to detect the few excellent or poor wines. Also, we are not sure if all input variables are relevant. So it could be interesting to test feature selection methods. Two datasets were combined and few values were randomly removed.

Attribute Information:

Input variables (based on physicochemical tests):
1 - fixed acidity
2 - volatile acidity
3 - citric acid
4 - residual sugar
5 - chlorides
6 - free sulfur dioxide
7 - total sulfur dioxide
8 - density
9 - pH
10 - sulphates
11 - alcohol
Output variable (based on sensory data):
12 - quality (score between 0 and 10)

Download link: https://www.kaggle.com/rajyellow46/wine-quality

Libraries

  • numpy
  • pandas
  • matplotlib
  • seaborn
  • scikit-learn

    Future Work

  • Handling missing values
  • Removing Outliers
  • Removing Attributes
  • Random oveerSampling

    Algorithms

  • Logistic Regression
  • Decision Tree
  • Random Forest
  • KNn
  • svm in SVC
  • Extra Tress
  • LightGBM

    Best Model Accuracy: 90.00 -> from Extra tree classifier

  • Owner
    Vineeth Reddy Gangula
    The guy who loves to explore the thrill of adventure!!! Follow the passion no matter what comes in the way... Rise up and push past your limits..!
    Vineeth Reddy Gangula
    The Balloon Learning Environment - flying stratospheric balloons with deep reinforcement learning.

    Balloon Learning Environment Docs The Balloon Learning Environment (BLE) is a simulator for stratospheric balloons. It is designed as a benchmark envi

    Google 87 Dec 25, 2022
    This repository contains tutorials for the py4DSTEM Python package

    py4DSTEM Tutorials This repository contains tutorials for the py4DSTEM Python package. For more information about py4DSTEM, including installation ins

    11 Dec 23, 2022
    SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event Data

    SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event Data SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event Data Au

    14 Nov 28, 2022
    Memory-efficient optimum einsum using opt_einsum planning and PyTorch kernels.

    opt-einsum-torch There have been many implementations of Einstein's summation. numpy's numpy.einsum is the least efficient one as it only runs in sing

    Haoyan Huo 9 Nov 18, 2022
    An official implementation of "Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation" (ICCV 2021) in PyTorch.

    Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation This is an official implementation of the paper "Exploiting a Joint

    CV Lab @ Yonsei University 35 Oct 26, 2022
    Supplementary code for TISMIR paper "Sliding-Window Pitch-Class Histograms as a Means of Modeling Musical Form"

    Sliding-Window Pitch-Class Histograms as a Means of Modeling Musical Form This is supplementary code for the TISMIR paper Sliding-Window Pitch-Class H

    1 Nov 27, 2021
    Bayesian dessert for Lasagne

    Gelato Bayesian dessert for Lasagne Recent results in Bayesian statistics for constructing robust neural networks have proved that it is one of the be

    Maxim Kochurov 84 May 11, 2020
    CL-Gym: Full-Featured PyTorch Library for Continual Learning

    CL-Gym: Full-Featured PyTorch Library for Continual Learning CL-Gym is a small yet very flexible library for continual learning research and developme

    Iman Mirzadeh 36 Dec 25, 2022
    Implementation of PersonaGPT Dialog Model

    PersonaGPT An open-domain conversational agent with many personalities PersonaGPT is an open-domain conversational agent cpable of decoding personaliz

    ILLIDAN Lab 42 Jan 01, 2023
    Python package for covariance matrices manipulation and Biosignal classification with application in Brain Computer interface

    pyRiemann pyRiemann is a python package for covariance matrices manipulation and classification through Riemannian geometry. The primary target is cla

    447 Jan 05, 2023
    Subpopulation detection in high-dimensional single-cell data

    PhenoGraph for Python3 PhenoGraph is a clustering method designed for high-dimensional single-cell data. It works by creating a graph ("network") repr

    Dana Pe'er Lab 42 Sep 05, 2022
    Target Propagation via Regularized Inversion

    Target Propagation via Regularized Inversion The present code implements an ideal formulation of target propagation using regularized inverses compute

    Vincent Roulet 0 Dec 02, 2021
    Minimal implementation and experiments of "No-Transaction Band Network: A Neural Network Architecture for Efficient Deep Hedging".

    No-Transaction Band Network: A Neural Network Architecture for Efficient Deep Hedging Minimal implementation and experiments of "No-Transaction Band N

    19 Jan 03, 2023
    PyTorch code for our ECCV 2020 paper "Single Image Super-Resolution via a Holistic Attention Network"

    HAN PyTorch code for our ECCV 2020 paper "Single Image Super-Resolution via a Holistic Attention Network" This repository is for HAN introduced in the

    五维空间 140 Nov 23, 2022
    Generalized Matrix Means for Semi-Supervised Learning with Multilayer Graphs

    Generalized Matrix Means for Semi-Supervised Learning with Multilayer Graphs MATLAB implementation of the paper: P. Mercado, F. Tudisco, and M. Hein,

    Pedro Mercado 6 May 26, 2022
    An 16kHz implementation of HiFi-GAN for soft-vc.

    HiFi-GAN An 16kHz implementation of HiFi-GAN for soft-vc. Relevant links: Official HiFi-GAN repo HiFi-GAN paper Soft-VC repo Soft-VC paper Example Usa

    Benjamin van Niekerk 42 Dec 27, 2022
    Implementation of the federated dual coordinate descent (FedDCD) method.

    FedDCD.jl Implementation of the federated dual coordinate descent (FedDCD) method. Installation To install, just call Pkg.add("https://github.com/Zhen

    Zhenan Fan 6 Sep 21, 2022
    Rule-based Customer Segmentation

    Rule-based Customer Segmentation Business Problem A game company wants to create level-based new customer definitions (personas) by using some feature

    Cem Çaluk 2 Jan 03, 2022
    Denoising Diffusion Probabilistic Models

    Denoising Diffusion Probabilistic Models Jonathan Ho, Ajay Jain, Pieter Abbeel Paper: https://arxiv.org/abs/2006.11239 Website: https://hojonathanho.g

    Jonathan Ho 1.5k Jan 08, 2023
    A library of multi-agent reinforcement learning components and systems

    Mava: a research framework for distributed multi-agent reinforcement learning Table of Contents Overview Getting Started Supported Environments System

    InstaDeep Ltd 463 Dec 23, 2022