Converting CPT to bert form for use

Last update: Oct 14, 2021

Related tags

Overview

cpt-encoder

将CPT转成bert形式使用

说明

刚刚刷到又出了一种模型：CPT，看论文显示，在很多中文任务上性能比mac bert还好，就迫不及待想把它用起来。

根据对源码的研究，发现该模型在做nlu建模时主要用的encoder部分，也就是bert，因此我将这部分权重转为bert权重类型，方便做nlu任务。

当然，想要发挥CPT的性能，还是得用官方代码用生成方式来使用，如prompt。

性能还未测试，第一个epoch看起来和roberta差不多。

加载方式

使用huggingface的transformers就可以加载，和BERT一样的方式。

转换代码

见 convert_cpt_to_bert.py

转好的权重地址

cpt-encoder-base: https://pan.baidu.com/s/1PqUAWNczX9vVcFtRHcE5cg 提取码：2fo2

cpt-encoder-large: https://pan.baidu.com/s/1KwumkF1NRL6wX7aifnq4xA 提取码：ke7o

官方地址

论文：CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation

github：CPT

Reference

Converting CPT to bert form for use

Related tags

Overview

cpt-encoder

将CPT转成bert形式使用

说明

加载方式

转换代码

转好的权重地址

官方地址

Owner

黄辉

Indonesian Car License Plate Character Recognition using Tensorflow, Keras and OpenCV.

Alpha-IoU: A Family of Power Intersection over Union Losses for Bounding Box Regression

Image-to-image regression with uncertainty quantification in PyTorch

Neural-fractal - Create Fractals Using Complex-Valued Neural Networks!

Official Implementation of "Learning Disentangled Behavior Embeddings"

An Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual Learning

Torchlight2 lan game server tool - A message forwarding tool for Torchlight 2 lan game

Official code release for "Learned Spatial Representations for Few-shot Talking-Head Synthesis" ICCV 2021

Code of paper "Compositionally Generalizable 3D Structure Prediction"

Tensorflow/Keras Plug-N-Play Deep Learning Models Compilation

Unofficial PyTorch Implementation of UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations

HuSpaCy: industrial-strength Hungarian natural language processing

This is the official Pytorch implementation of "Lung Segmentation from Chest X-rays using Variational Data Imputation", Raghavendra Selvan et al. 2020

Pytorch implementation of forward and inverse Haar Wavelets 2D

Demo notebooks for Qiskit application modules demo sessions (Oct 8 & 15):

Multi-agent reinforcement learning algorithm and environment

Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

A benchmark dataset for mesh multi-label-classification based on cube engravings introduced in MeshCNN

Python package for downloading ECMWF reanalysis data and converting it into a time series format.