TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

Last update: Feb 07, 2022

Related tags

Overview

TFPNER

TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

Named entity recognition (NER), which aims at identifying real-world entity mentions from texts, is a fundamental task in natural language processing with a wide range of applications. Previous approaches mainly focus on the original pure sentence but the Part of speech (POS) contains rich semantic information and contribute to the success of the Natural Language Processing task. To further improve the performance of the NER task, we proposed the five methods that employed POS tags fused with the original tokens based on the BERT model to achieve the NER task, including concatenating token and POS as one or two sentences, adding POS embedding as one of the embedding elements, model ensemble, and conduct the multi-attention between the token representations and POS representations. In this work, we addressed the CoNLL-2003 and Groningen Meaning Bank (GMB) datasets which can provide both NER tags and POS tags. From our experiments on two datasets, part of the proposed methods can show performance improvement in comparison with the baseline methods.

This is the project I worked with Haoqing Tang, the extraordinary computer scientist in CV & NLP area, during the interesting and memorable Master study period.

TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

Related tags

Overview

TFPNER

TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

This is the project I worked with Haoqing Tang, the extraordinary computer scientist in CV & NLP area, during the interesting and memorable Master study period.

Owner

Automated question generation and question answering from Turkish texts using text-to-text transformers

Contains descriptions and code of the mini-projects developed in various programming languages

American Sign Language (ASL) to Text Converter

End-to-end image captioning with EfficientNet-b3 + LSTM with Attention

中文生成式预训练模型

nlpcommon is a python Open Source Toolkit for text classification.

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

wxPython app for converting encodings, modifying and fixing SRT files

Mapping a variable-length sentence to a fixed-length vector using BERT model

Augmenty is an augmentation library based on spaCy for augmenting texts.

This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems

FedNLP: A Benchmarking Framework for Federated Learning in Natural Language Processing

Basic Utilities for PyTorch Natural Language Processing (NLP)

Convolutional Neural Networks for Sentence Classification

A python package to fine-tune transformer-based models for named entity recognition (NER).

Code for paper "Which Training Methods for GANs do actually Converge? (ICML 2018)"

Auto-researching tool generating word documents.

Modified GPT using average pooling to reduce the softmax attention memory constraints.

숭실대학교 컴퓨터학부 전공종합설계프로젝트

GooAQ 🥑 : Google Answers to Google Questions!