A library for end-to-end learning of embedding index and retrieval model

Last update: Dec 21, 2022

Related tags

Overview

Poeem

Poeem is a library for efficient approximate nearest neighbor (ANN) search, which has been widely adopted in industrial recommendation, advertising and search systems. Apart from other libraries, such as Faiss and ScaNN, which build embedding indexes with already learned embeddings, Poeem jointly learn the embedding index together with retrieval model in order to avoid the quantization distortion. Consequentially, Poeem is proved to outperform the previous methods significantly, as shown in our SIGIR paper. Poeem is written based on Tensorflow GPU version 1.15, and some of the core functionalities are written in C++, as custom TensorFlow ops. It is developed by JD.com Search.

For more details, check out our SIGIR 2021 paper here.

System Requirements

We only support Linux systems for now, e.g., CentOS and Ubuntu. Windows users might need to build the library from source.
Python 3.6 installation.
TensorFlow GPU version 1.15 (pip install tensorflow-gpu==1.15.0). Other TensorFlow versions are not tested.
CUDA toolkit 10.1, required by TensorFlow GPU 1.15.

Quick Start

Poeem aims at an almost drop-in utility for training and serving large scale embedding retrieval models. We try to make it easy to use as much as we can.

Install

Install poeem for most Linux system can be done easily with pip.

$ pip install poeem

Quick usage

As an extreme simple example, you can use Poeem simply by the following commands

>>> import tensorflow as tf, poeem
>>> hparams = poeem.embedding.PoeemHparam()
>>> poeem_indexing_layer = poeem.embedding.PoeemEmbed(64, hparams)
>>> emb = tf.random.normal([100, 64])  # original embedding before indexing layer
>>> emb_quantized, coarse_code, code, regularizer = poeem_indexing_layer.forward(emb)
>>> emb = emb - tf.stop_gradient(emb - emb_quantized)   # use this embedding for downstream computation
>>> with tf.Session() as sess:
>>>   sess.run(tf.global_variables_initializer())
>>>   sess.run(emb)

Tutorial

The above simple example, as a quick start, does not show how to build embedding index and how to serve it online. Experienced or advanced users who are interested in applying it in real-world or industrial system, can further read the tutorials.

Authors

The main authors of Poeem are:

Han Zhang wrote most Python models and conducted most of experiments.
Hongwei Shen wrote most of the C++ TensorFlow ops and managed the pip released package.
Yunjiang Jiang developed the rotation algorithm and wrote the related code.
Wen-Yun Yang initiated the Poeem project, wrote some of TensorFlow ops, integrated different parts and wrote the tutorials.

How to Cite

Reference to cite if you use Poeem in a research paper or in a real-world system

  @inproceeding{poeem_sigir21,
    title={Joint Learning of Deep Retrieval Model and Product Quantization based Embedding Index},
    author={Han Zhang, Hongwei Shen, Yiming Qiu, Yunjiang Jiang, Songlin Wang, Sulong Xu, Yun Xiao, Bo Long and Wen-Yun Yang},
    booktitle={The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval},
    pages={},
    year={2021}
}

License

MIT licensed

A library for end-to-end learning of embedding index and retrieval model

Related tags

Overview

Poeem

Content

System Requirements

Quick Start

Install

Quick usage

Tutorial

Authors

How to Cite

License

Owner

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

LewusBot - Twitch ChatBot built in python with twitchio library

ByT5: Towards a token-free future with pre-trained byte-to-byte models

NLP tool to extract emotional phrase from tweets 🤩

State-of-the-art NLP through transformer models in a modular design and consistent APIs.

Toward Model Interpretability in Medical NLP

Reading Wikipedia to Answer Open-Domain Questions

List of GSoC organisations with number of times they have been selected.

Th2En & Th2Zh: The large-scale datasets for Thai text cross-lingual summarization

Sinkhorn Transformer - Practical implementation of Sparse Sinkhorn Attention

使用Mask LM预训练任务来预训练Bert模型。训练垂直领域语料的模型表征，提升下游任务的表现。

Officile code repository for "A Game-Theoretic Perspective on Risk-Sensitive Reinforcement Learning"

Poetry PEP 517 Build Backend & Core Utilities

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

✨Rubrix is a production-ready Python framework for exploring, annotating, and managing data in NLP projects.

Korean Simple Contrastive Learning of Sentence Embeddings using SKT KoBERT and kakaobrain KorNLU dataset

⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x using fastT5.

Code to reprudece NeurIPS paper: Accelerated Sparse Neural Training: A Provable and Efficient Method to Find N:M Transposable Masks

PhoNLP: A BERT-based multi-task learning toolkit for part-of-speech tagging, named entity recognition and dependency parsing

Tensorflow Implementation of A Generative Flow for Text-to-Speech via Monotonic Alignment Search