A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

Last update: Nov 05, 2022

Related tags

Overview

Feature Forge

This library provides a set of tools that can be useful in many machine learning applications (classification, clustering, regression, etc.), and particularly helpful if you use scikit-learn (although this can work if you have a different algorithm).

Most machine learning problems involve an step of feature definition and preprocessing. Feature Forge helps you with:

Defining and documenting features
Testing your features against specified cases and against randomly generated cases (stress-testing). This helps you making your application more robust against invalid/misformatted input data. This also helps you checking that low-relevance results when doing feature analysis is actually because the feature is bad, and not because there's a slight bug in your feature code.
Evaluating your features on a data set, producing a feature evaluation matrix. The evaluator has a robust mode that allows you some tolerance both for invalid data and buggy features.
Experimentation: running, registering, classifying and reproducing experiments for determining best settings for your problems.

Installation

Just pip install featureforge.

Documentation

Documentation is available at http://feature-forge.readthedocs.org/en/latest/

Contact information

Javier Mansilla <[email protected]> (jmansilla at github)
Daniel Moisset <[email protected]> (dmoisset at github)
Rafael Carrascosa <[email protected]> (rafacarrascosa at github)

Any contributions or suggestions are welcome, the official channel for this is submitting github pull requests or issues.

Changelog

0.1.7:

StatsManager api change (order of arguments swapped)
For experimentation, enabled a way of booking experiments forever.

0.1.6:

Bug fixes related to sparse matrices.
Small documentation improvements.
Reduced default logging verbosity.

0.1.5:

Using sparse numpy matrices by default.

0.1.4:

Discarded the need of using forked version of Schema library.

0.1.3:

Added support for running and generating stats for experiments

0.1.2:

Fixing installer dependencies

0.1.1:

Added support for python 3
Added support for bag-of-words features

0.1:

Initial release

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

Related tags

Overview

Feature Forge

Installation

Documentation

Contact information

Changelog

Owner

Machinalis

PyoMyo - Python Opensource Myo library

Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision (ICCV 2021)

🐦 Quickly annotate data from the comfort of your Jupyter notebook

Contrastive Loss Gradient Attack (CLGA)

[ACMMM 2021, Oral] Code release for "Elastic Tactile Simulation Towards Tactile-Visual Perception"

Inference code for "StylePeople: A Generative Model of Fullbody Human Avatars" paper. This code is for the part of the paper describing video-based avatars.

Extreme Dynamic Classifier Chains - XGBoost for Multi-label Classification

ICLR 2021, Fair Mixup: Fairness via Interpolation

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

NeRF Meta-Learning with PyTorch

Unofficial Alias-Free GAN implementation. Based on rosinality's version with expanded training and inference options.

PyTorch implementation of the paper: Label Noise Transition Matrix Estimation for Tasks with Lower-Quality Features

Code and data for ACL2021 paper Cross-Lingual Abstractive Summarization with Limited Parallel Resources.

Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification

Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)

Codebase for Attentive Neural Hawkes Process (A-NHP) and Attentive Neural Datalog Through Time (A-NDTT)

The repo of Feedback Networks, CVPR17

Using pretrained GROVER to extract the atomic fingerprints from molecule

A deep-learning pipeline for segmentation of ambiguous microscopic images.

Code for our ALiBi method for transformer language models.