【关于 NLP】那些你不知道的事

作者：杨夕、芙蕖、李玲、陈海顺、twilight、LeoLRH、JimmyDU、艾春辉、张永泰、金金金

介绍

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。

目录架构

一、【关于基础算法篇】那些你不知道的事

二、【关于机器学习算法篇】那些你不知道的事

三、【关于深度学习算法篇】那些你不知道的事

四、【关于 NLP 学习算法】那些你不知道的事

4.1 【关于信息抽取】那些你不知道的事

4.1.1 【关于命名实体识别】那些你不知道的事

4.1.2 【关于关系抽取】那些你不知道的事

【关于关系抽取】那些你不知道的事

4.1.3 【关于事件抽取】那些你不知道的事

【关于事件抽取】那些你不知道的事

4.2 【关于 NLP 预训练算法】那些你不知道的事

4.3 【关于文本分类】那些你不知道的事

4.4 【关于文本匹配】那些你不知道的事

4.5 【关于问答系统】那些你不知道的事

4.5.1 【关于 FAQ 检索式问答系统】那些你不知道的事

【关于 FAQ 检索式问答系统】那些你不知道的事

4.5.2 【关于问答系统工具篇】那些你不知道的事

【关于 Faiss 】那些你不知道的事

4.6 【关于对话系统】那些你不知道的事

4.7 【关于知识图谱】那些你不知道的事

五、【关于 NLP 技巧】那些你不知道的事

5.1 【关于少样本问题】那些你不知道的事

5.2 【关于脏数据】那些你不知道的事

【关于 “脏数据”处理】那些你不知道的事
- 一、动机
  - 1.1 何为“脏数据”？
  - 1.2 “脏数据” 会带来什么后果？
- 二、“脏数据” 处理篇
  - 2.1 “脏数据” 怎么处理呢？
  - 2.2 置信学习方法篇

5.3 【关于炼丹炉】那些你不知道的事

【关于 batch_size设置】那些你不知道的事
- 一、训练模型时，batch_size的设置，学习率的设置?

六、【关于 Python 】那些你不知道的事

【关于 Python 】那些你不知道的事

七、【关于 Tensorflow 】那些你不知道的事

【关于 Tensorflow 损失函数】那些你不知道的事

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含 自然语言处理各领域的 面试题积累。

Related tags

Overview

【关于 NLP】那些你不知道的事

介绍

目录架构

一、【关于 基础算法篇】那些你不知道的事

二、【关于 机器学习算法篇】那些你不知道的事

三、【关于 深度学习算法篇】那些你不知道的事

四、【关于 NLP 学习算法】那些你不知道的事

4.1 【关于 信息抽取】那些你不知道的事

4.1.1 【关于 命名实体识别】那些你不知道的事

4.1.2 【关于 关系抽取】那些你不知道的事

4.1.3 【关于 事件抽取】那些你不知道的事

4.2 【关于 NLP 预训练算法】那些你不知道的事

4.3 【关于 文本分类】那些你不知道的事

4.4 【关于 文本匹配】那些你不知道的事

4.5 【关于 问答系统】那些你不知道的事

4.5.1 【关于 FAQ 检索式问答系统】 那些你不知道的事

4.5.2 【关于 问答系统工具篇】 那些你不知道的事

4.6 【关于 对话系统】那些你不知道的事

4.7 【关于 知识图谱】那些你不知道的事

4.7.1 【关于 知识图谱】 那些你不知道的事

4.7.2 【关于 KBQA】那些你不知道的事

4.7.3 【关于 Neo4j】那些你不知道的事

4.8 【关于 文本摘要】 那些你不知道的事

4.9 【关于 知识表示学习】那些你不知道的事

五、【关于 NLP 技巧】那些你不知道的事

5.1 【关于 少样本问题】那些你不知道的事

5.2 【关于 脏数据】那些你不知道的事

5.3 【关于 炼丹炉】那些你不知道的事

六、【关于 Python 】那些你不知道的事

七、【关于 Tensorflow 】那些你不知道的事

Owner

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Associated Repository for "Translation between Molecules and Natural Language"

EMNLP 2021 paper "Pre-train or Annotate? Domain Adaptation with a Constrained Budget".

a test times augmentation toolkit based on paddle2.0.

تولید اسم های رندوم فینگیلیش

Generate text line images for training deep learning OCR model (e.g. CRNN)

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Uses Google's gTTS module to easily create robo text readin' on command.

PortaSpeech - PyTorch Implementation

ChatBotProyect - This is an unfinished project about a simple chatbot.

Revisiting Pre-trained Models for Chinese Natural Language Processing (Findings of EMNLP 2020)

结巴中文分词

The Internet Archive Research Assistant - Daily search Internet Archive for new items matching your keywords

Translate - a PyTorch Language Library

Signature remover is a NLP based solution which removes email signatures from the rest of the text.

PyWorld3 is a Python implementation of the World3 model

Product-Review-Summarizer - Created a product review summarizer which clustered thousands of product reviews and summarized them into a maximum of 500 characters, saving precious time of customers and helping them make a wise buying decision.

List of GSoC organisations with number of times they have been selected.

Use fastai-v2 with HuggingFace's pretrained transformers

Library for fast text representation and classification.

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。

一、【关于基础算法篇】那些你不知道的事

二、【关于机器学习算法篇】那些你不知道的事

三、【关于深度学习算法篇】那些你不知道的事

4.1 【关于信息抽取】那些你不知道的事

4.1.1 【关于命名实体识别】那些你不知道的事

4.1.2 【关于关系抽取】那些你不知道的事

4.1.3 【关于事件抽取】那些你不知道的事

4.3 【关于文本分类】那些你不知道的事

4.4 【关于文本匹配】那些你不知道的事

4.5 【关于问答系统】那些你不知道的事

4.5.1 【关于 FAQ 检索式问答系统】那些你不知道的事

4.5.2 【关于问答系统工具篇】那些你不知道的事

4.6 【关于对话系统】那些你不知道的事

4.7 【关于知识图谱】那些你不知道的事

4.7.1 【关于知识图谱】那些你不知道的事

4.8 【关于文本摘要】那些你不知道的事

4.9 【关于知识表示学习】那些你不知道的事

5.1 【关于少样本问题】那些你不知道的事

5.2 【关于脏数据】那些你不知道的事

5.3 【关于炼丹炉】那些你不知道的事