A fast hierarchical dimensionality reduction algorithm.

Last update: Dec 12, 2022

Related tags

Overview

h-NNE: Hierarchical Nearest Neighbor Embedding

A fast hierarchical dimensionality reduction algorithm.

h-NNE is a general purpose dimensionality reduction algorithm such as t-SNE and UMAP. It stands out for its speed, simplicity and the fact that it provides a hierarchy of clusterings as part of its projection process. The algorithm is inspired by the FINCH clustering algorithm. For more information on the structure of the algorithm, please look at our corresponding paper in ArXiv:

M. Saquib Sarfraz*, Marios Koulakis*, Constantin Seibold, Rainer Stiefelhagen. Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality Reduction. CVPR 2022.

More details are available in the project documentation.

Installation

The project is available in PyPI. To install run:

pip install hnne

How to use h-NNE

The HNNE class implements the common methods of the sklearn interface.

Simple projection example

import numpy as np
from hnne import HNNE

data = np.random.random(size=(1000, 256))

hnne = HNNE(dim=2)
projection = hnne.fit_transform(data)

Projecting on new points

hnne = HNNE()
projection = hnne.fit_transform(data)

new_data_projection = hnne.transform(new_data)

Demos

The following demo notebooks are available:

Citation

If you make use of this project in your work, it would be appreciated if you cite the hnne paper:

@article{hnne,
  title={Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality Reduction},
  author={M. Saquib Sarfraz, Marios Koulakis, Constantin Seibold, Rainer Stiefelhagen},
  booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2022}
}

If you make use of the clustering properties of the algorithm please also cite:

 @inproceedings{finch,
   author    = {M. Saquib Sarfraz and Vivek Sharma and Rainer Stiefelhagen},
   title     = {Efficient Parameter-free Clustering Using First Neighbor Relations},
   booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
   pages = {8934--8943},
   year  = {2019}
}

A fast hierarchical dimensionality reduction algorithm.

Related tags

Overview

h-NNE: Hierarchical Nearest Neighbor Embedding

Installation

How to use h-NNE

Simple projection example

Projecting on new points

Demos

Citation

Owner

Marios Koulakis

SAVI2I: Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

FireFlyer Record file format, writer and reader for DL training samples.

Data and code to support "Applied Natural Language Processing" (INFO 256, Fall 2021, UC Berkeley)

Différents programmes créant une interface graphique a l'aide de Tkinter pour simplifier la vie des étudiants.

This repository contains (not all) code from my project on Named Entity Recognition in philosophical text

code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"

This script just scrapes the most recent Nepali news from Kathmandu Post and notifies the user about current events at regular intervals.It sends out the most recent news at random!

Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks

🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.

VoiceFixer VoiceFixer is a framework for general speech restoration.

Generate text line images for training deep learning OCR model (e.g. CRNN)

Official PyTorch implementation of SegFormer

Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.

Code for the paper "Language Models are Unsupervised Multitask Learners"

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

本插件是pcrjjc插件的重置版，可以独立于后端api运行

Code for EmBERT, a transformer model for embodied, language-guided visual task completion.

A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python

spaCy plugin for Transformers , Udify, ELmo, etc.