Text classification is one of the popular tasks in NLP that allows a program to classify free-text documents based on pre-defined classes.

Last update: Mar 17, 2022

Overview

Deep-Learning-for-Text-Document-Classification

Text classification is one of the popular tasks in NLP that allows a program to classify free-text documents based on pre-defined classes.

Why Text-Document Classification?

Today’s emergence of large digital documents makes the text classification task more crucial, especially for companies to maximize their workflow or even profits. Recently, the progress of NLP research on text classification has arrived at the state-of-the-art (SOTA). It has achieved terrific results, showing Deep Learning methods as the cutting-edge technology to perform such tasks. Hence, the need to assess the performance of the SOTA deep learning models for text classification is essential not only for academic purposes but also for AI practitioners or professionals that need guidance and benchmark on similar projects.

Owner

Happy N. Monday

GitHub Repository

LOT: A Benchmark for Evaluating Chinese Long Text Understanding and Generation

LOT: A Benchmark for Evaluating Chinese Long Text Understanding and Generation Tasks | Datasets | LongLM | Baselines | Paper Introduction LOT is a ben

46 Dec 28, 2022

Blue Brain text mining toolbox for semantic search and structured information extraction

Blue Brain Search Source Code DOI Data & Models DOI Documentation Latest Release Python Versions License Build Status Static Typing Code Style Securit

29 Dec 01, 2022

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

StyleSpeech - PyTorch Implementation PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation. Status (2021.06.09

142 Jan 06, 2023

Various Algorithms for Short Text Mining

Short Text Mining in Python Introduction This package shorttext is a Python package that facilitates supervised and unsupervised learning for short te

466 Dec 06, 2022

Code for ACL 2020 paper "Rigid Formats Controlled Text Generation"

SongNet SongNet: SongCi + Song (Lyrics) + Sonnet + etc. @inproceedings{li-etal-2020-rigid, title = "Rigid Formats Controlled Text Generation",

212 Dec 17, 2022

A Facebook Messenger Chatbot using NLP

A Facebook Messenger Chatbot using NLP This project is about creating a messenger chatbot using basic NLP techniques and models like Logistic Regressi

6 Nov 20, 2022

SAVI2I: Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors

SAVI2I: Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors [Paper] [Project Website] Pytorch implementation for SAVI2I. We

44 Dec 30, 2022

Main repository for the chatbot Bobotinho.

Bobotinho Bot Main repository for the chatbot Bobotinho. ℹ️ Introduction Twitch chatbot with entertainment commands. ‎ 💻 Technologies Concurrent code

14 Nov 29, 2022

Outreachy TFX custom component project

Schema Curation Custom Component Outreachy TFX custom component project This repo contains the code for Schema Curation Custom Component made as a par

5 Jul 16, 2021

Pipeline for training LSA models using Scikit-Learn.

Latent Semantic Analysis Pipeline for training LSA models using Scikit-Learn. Usage Instead of writing custom code for latent semantic analysis, you j

23 Sep 05, 2022

A full spaCy pipeline and models for scientific/biomedical documents.

This repository contains custom pipes and models related to using spaCy for scientific documents. In particular, there is a custom tokenizer that adds

1.3k Jan 03, 2023

This is the offline-training-pipeline for our project.

offline-training-pipeline This is the offline-training-pipeline for our project. We adopt the offline training and online prediction Machine Learning

0 Apr 22, 2022

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

English|简体中文 ERNIE是百度开创性提出的基于知识增强的持续学习语义理解框架，该框架将大数据预训练与多源丰富知识相结合，通过持续学习技术，不断吸收海量文本数据中词汇、结构、语义等方面的知识，实现模型效果不断进化。ERNIE在累积 40 余个典型 NLP 任务取得 SOTA 效果，并在 G

5.4k Jan 03, 2023

Text classification is one of the popular tasks in NLP that allows a program to classify free-text documents based on pre-defined classes.

Related tags

Overview

Deep-Learning-for-Text-Document-Classification

Why Text-Document Classification?

Owner

Happy N. Monday

LOT: A Benchmark for Evaluating Chinese Long Text Understanding and Generation

Blue Brain text mining toolbox for semantic search and structured information extraction

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Various Algorithms for Short Text Mining

Code for ACL 2020 paper "Rigid Formats Controlled Text Generation"

A Facebook Messenger Chatbot using NLP

SAVI2I: Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors

Main repository for the chatbot Bobotinho.

Outreachy TFX custom component project

Pipeline for training LSA models using Scikit-Learn.

A full spaCy pipeline and models for scientific/biomedical documents.

This is the offline-training-pipeline for our project.

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

Implementation of Fast Transformer in Pytorch

Journalism AI – Quotes extraction for modular journalism

Code for PED: DETR For (Crowd) Pedestrian Detection

用Resnet101+GPT搭建一个玩王者荣耀的AI

Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE

This is a really simple text-to-speech app made with python and tkinter.

Knowledge Graph,Question Answering System，基于知识图谱和向量检索的医疗诊断问答系统