A python package to fine-tune transformer-based models for named entity recognition (NER).

Last update: Jul 30, 2022

Related tags

Text Data & NLP nerblackbox

Overview

nerblackbox

A python package to fine-tune transformer-based language models for named entity recognition (NER).

https://coveralls.io/repos/github/af-ai-center/nerblackbox/badge.svg?branch=master

Resources

Source Code: https://github.com/af-ai-center/nerblackbox
Documentation: https://af-ai-center.github.io/nerblackbox
PyPI: https://pypi.org/project/nerblackbox

About

Transformer-based language models like BERT have had a game-changing impact on Natural Language Processing.

In order to utilize Hugging Face's publicly accessible pretrained models for Named Entity Recognition, one needs to retrain (or "fine-tune") them using labeled text.

nerblackbox makes this easy.

You give it

a Dataset (labeled text)
a Pretrained Model (transformers)

and you get

the best Fine-tuned Model
its Performance on the dataset

Installation

pip install nerblackbox

Usage

see documentation: https://af-ai-center.github.io/nerblackbox

Citation

@misc{nerblackbox,
  author = {Stollenwerk, Felix},
  title  = {nerblackbox: a python package to fine-tune transformer-based language models for named entity recognition},
  year   = {2021},
  url    = {https://github.com/af-ai-center/nerblackbox},
}

A python package to fine-tune transformer-based models for named entity recognition (NER).

Related tags

Overview

nerblackbox

Resources

About

Installation

Usage

Citation

Owner

Felix Stollenwerk

基于Transformer的单模型、多尺度的VAE模型

AutoGluon: AutoML for Text, Image, and Tabular Data

This is a really simple text-to-speech app made with python and tkinter.

Text editor on python to convert english text to malayalam(Romanization/Transiteration).

A Paper List for Speech Translation

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.

Contains analysis of trends from Fitbit Dataset (source: Kaggle) to see how the trends can be applied to Bellabeat customers and Bellabeat products

This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"

An open-source NLP library: fast text cleaning and preprocessing.

Lingtrain Aligner — ML powered library for the accurate texts alignment.

A 30000+ Chinese MRC dataset - Delta Reading Comprehension Dataset

This project aims to conduct a text information retrieval and text mining on medical research publication regarding Covid19 - treatments and vaccinations.

AIDynamicTextReader - A simple dynamic text reader based on Artificial intelligence

GPT-Code-Clippy (GPT-CC) is an open source version of GitHub Copilot, a language model

Repository for Graph2Pix: A Graph-Based Image to Image Translation Framework

Official PyTorch Implementation of paper "NeLF: Neural Light-transport Field for Single Portrait View Synthesis and Relighting", EGSR 2021.

Pretrain CPM - 大规模预训练语言模型的预训练代码

中文空间语义理解评测

Synthetic data for the people.