BARTScore: Evaluating Generated Text as Text Generation

Last update: Dec 17, 2022

Related tags

Deep Learning BARTScore

Overview

This is the Repo for the paper: BARTScore: Evaluating Generated Text as Text Generation

Updates

2021.06.28 Release online evaluation Demo
2021.06.25 Release online Explainable Leaderboard for Meta-evaluation
2021.06.22 Code will be released soon

Background

There is a recent trend that leverages neural models for automated evaluation in different ways, as shown in Fig.1.

(a) Evaluation as matching task. Unsupervised matching metrics aim to measure the semantic equivalence between the reference and hypothesis by using a token-level matching functions in distributed representation space (e.g. BERT) or discrete string space (e.g. ROUGE).

(b) Evaluation as regression task. Regression-based metrics (e.g. BLEURT) introduce a parameterized regression layer, which would be learned in a supervised fashion to accurately predict human judgments.

(c) Evaluation as ranking task. Ranking-based metrics (e.g. COMET) aim to learn a scoring function that assigns a higher score to better hypotheses than to worse ones.

(d) Evaluation as generation task. In this work, we formulate evaluating generated text as a text generation task from pre-trained language models.

BARTScore: Evaluating Generated Text as Text Generation

Related tags

Overview

Updates

Background

Owner

NeuLab

Efficient electromagnetic solver based on rigorous coupled-wave analysis for 3D and 2D multi-layered structures with in-plane periodicity

SimBERT升级版（SimBERTv2）！

Uses OpenCV and Python Code to detect a face on the screen

Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

Implementation EfficientDet: Scalable and Efficient Object Detection in PyTorch

Brain Tumor Detection with Tensorflow Neural Networks.

Framework to build and train RL algorithms

Implementation of experiments in the paper Clockwork Variational Autoencoders (project website) using JAX and Flax

Hyperopt for solving CIFAR-100 with a convolutional neural network (CNN) built with Keras and TensorFlow, GPU backend

Using multidimensional LSTM neural networks to create a forecast for Bitcoin price

Official Code for "Constrained Mean Shift Using Distant Yet Related Neighbors for Representation Learning"

Iranian Cars Detection using Yolov5s, PyTorch

Model Agnostic Interpretability for Multiple Instance Learning

Google Recaptcha solver.

Siamese-nn-semantic-text-similarity - A repository containing comprehensive Neural Networks based PyTorch implementations for the semantic text similarity task

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

Official PyTorch implementation of "RMGN: A Regional Mask Guided Network for Parser-free Virtual Try-on" (IJCAI-ECAI 2022)

Official release of MSHT: Multi-stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer axriv: http://arxiv.org/abs/2112.13513

Activity image-based video retrieval

A PyTorch implementation of Mugs proposed by our paper "Mugs: A Multi-Granular Self-Supervised Learning Framework".