RaceBERT -- A transformer based model to predict race and ethnicty from names

Last update: Nov 02, 2022

Related tags

Overview

RaceBERT -- A transformer based model to predict race and ethnicty from names

Installation

pip install racebert

Using a virtual environment is highly recommended! You may need to install pytorch as instructed here: https://pytorch.org/get-started/locally/

Paper

Todo

Usage

raceBERT predicts race (U.S census race) and ethnicity from names.

from racebert import RaceBERT

model = RaceBERT()

# To predict race
model.predict_race("Barack Obama")

>>> {"label": "nh_black", "score": 0.5196923613548279}

The race categories are:

Race	Label
Non-hispanic White	nh_white
Hispanic	hispanic
Non-hispanic Black	nh_black
Asian & Pacific Islander	api
American Indian & Alaskan Native	aian

# Predict ethnicity
model.predict_ethnicty("Arjun Gupta")

>>> {"label": "Asian,IndianSubContinent", "score": 0.9612812399864197}

The ethnicity categories are:

Ethnicity
GreaterEuropean,British
GreaterEuropean,WestEuropean,French
GreaterEuropean,WestEuropean,Italian
GreaterEuropean,WestEuropean,Hispanic
GreaterEuropean,Jewish
GreaterEuropean,EastEuropean
Asian,IndianSubContinent
Asian,GreaterEastAsian,Japanese
GreaterAfrican,Muslim
Asian,GreaterEastAsian,EastAsian
GreaterEuropean,WestEuropean,Nordic
GreaterEuropean,WestEuropean,Germanic
GreaterAfrican,Africans

GPU

If you have a GPU, you can speed up the computation by specifying the CUDA device when you instantiate the model.

from racebert import RaceBERT

model = RaceBERT(device=0)

# predict race in batch
model.predict_race(["Barack Obama", "George Bush"])

>>>
[
        {"label": "nh_black", "score": 0.5196923613548279},
        {"label": "nh_white", "score": 0.8365859389305115}
]

# predict ethnicity in batch
model.predict_ethnicity(["Barack Obama", "George Bush"])

HuggingFace

Alternatively, you can work with the transformers models hosted on the huggingface hub directly.

Race Model: https://huggingface.co/pparasurama/raceBERT
Ethnicity Model: https://huggingface.co/pparasurama/raceBERT-ethnicity

Please refer to the transformers documentation.

RaceBERT -- A transformer based model to predict race and ethnicty from names

Related tags

Overview

RaceBERT -- A transformer based model to predict race and ethnicty from names

Installation

Paper

Usage

GPU

HuggingFace

Owner

Prasanna Parasurama

Yolact-keras实例分割模型在keras当中的实现

[ICCV 2021] Deep Hough Voting for Robust Global Registration

Stacs-ci - A set of modules to enable integration of STACS with commonly used CI / CD systems

Ἀνατομή is a PyTorch library to analyze representation of neural networks

Framework for joint representation learning, evaluation through multimodal registration and comparison with image translation based approaches

Unrolled Generative Adversarial Networks

Styleformer - Official Pytorch Implementation

利用yolov5和TensorRT从0到1实现目标检测的模型训练到模型部署全过程

Pytorch implementation of the paper "COAD: Contrastive Pre-training with Adversarial Fine-tuning for Zero-shot Expert Linking."

DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

Few-shot Learning of GPT-3

A pure PyTorch implementation of the loss described in "Online Segment to Segment Neural Transduction"

CLIP (Contrastive Language–Image Pre-training) for Italian

deep-table implements various state-of-the-art deep learning and self-supervised learning algorithms for tabular data using PyTorch.

[EMNLP 2021] Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training

Image-to-image regression with uncertainty quantification in PyTorch

C3d-pytorch - Pytorch porting of C3D network, with Sports1M weights

DyNet: The Dynamic Neural Network Toolkit

Denoising images with Fourier Ring Correlation loss

An end-to-end machine learning library to directly optimize AUC loss