Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning

Last update: Nov 10, 2022

Related tags

Overview

Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning

This repository provides an implementation of the paper Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning accepted at AISTATS 2022 as oral presentation. We propose a noise-reduced data valuation method, Beta Shapley, which is powerful at capturing the importance of data points.

Quick start

We provide a notebook using the Covertype dataset. It shows how to compute the Beta Shapley value and its application on several downstream ML tasks.

--> Beta Shapley can identify noisy samples by focusing marginal contributions on small cardinalities.

--> Beta Shapley on the CIFAR100 test dataset. Mislabeled data points have negative Beta Shapley values, meaning they actually harm the model performance. Beta Shapley can detect mislabeled points.

Files

betashap/ShapEngine.py: main class for computing Beta-Shapley.

betashap/data.py: handles loading and preprocessing datasets.

Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning

Related tags

Overview

Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning

Quick start

Files

Owner

Yongchan Kwon

Control-Robot-Arm-using-PS4-Controller - A Robotic Arm based on Raspberry Pi and Arduino that controlled by PS4 Controller

Voice Conversion Using Speech-to-Speech Neuro-Style Transfer

PyTorch deep learning projects made easy.

Like Dirt-Samples, but cleaned up

Dynamic Token Normalization Improves Vision Transformers

JORLDY an open-source Reinforcement Learning (RL) framework provided by KakaoEnterprise

Rainbow: Combining Improvements in Deep Reinforcement Learning

A library for uncertainty quantification based on PyTorch

This is a five-step framework for the development of intrusion detection systems (IDS) using machine learning (ML) considering model realization, and performance evaluation.

Official Pytorch implementation of "Beyond Static Features for Temporally Consistent 3D Human Pose and Shape from a Video", CVPR 2021

DeepVoxels is an object-specific, persistent 3D feature embedding.

Pytorch for Segmentation

Robust Lane Detection via Expanded Self Attention (WACV 2022)

Python interface for SmartRF Sniffer 2 Firmware

【ACMMM 2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning

This is the official code of L2G, Unrolling and Recurrent Unrolling in Learning to Learn Graph Topologies.

Can we learn gradients by Hamiltonian Neural Networks?

Multiview 3D object detection on MultiviewC dataset through moft3d.

ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation

Eff video representation - Efficient video representation through neural fields