TEDSummary is a speech summary corpus. It includes TED talks subtitle (Document), Title-Detail (Summary), speaker name (Meta info), MP4 URL, and utterance id

Last update: Dec 26, 2022

Related tags

Deep Learning TEDSummary

Overview

TEDSummary

TEDSummary is a speech summary corpus. It includes TED talks subtitle (Document), Title-Detail (Summary), speaker name (Meta info), MP4 URL, and utterance id. This script crawls the TEDTalk website to get the above information. However, this script does not supply audio data. You can use the utterance id to align TED-LIUM3 (https://www.openslr.org/51/) or extract audio from the MP4 file.

References

[1] Takatomo Kano, Atsunori Ogawa, Marc Delcroix, and Shinji Watanabe "Attention-based Multi-hypothesis Fusion for Speech Summarization," Proc. ASRU, pp. –, 2021

Citation
@inproceedings{attention-fusion,
author = {Takatomo Kano and Atsunori Ogawa and Marc Delcroix and Shinji Watanabe},
title = {Attention-based Multi-hypothesis Fusion for Speech Summarization},
booktitle = {{ASRU 2021 - 2021 IEEE Automatic Speech Recoginition and Understanding Workshop (ASRU)}},
pages={-},
year = {2021}
}

Install tools

Python 3. requests unidecode json tqdm unicodedata

How to run

cd TEDSummary/ python TEDListCrawler.py

Outputs

telklist.json: URLs list for tedtalks.
ted_summary.json: Summarization dataset. That includes summary IDs, TEDTalk URL, mp4 URL, document, abstract, title, speaker name, and uttrance id for Tedlium alignment.

TEDSummary is a speech summary corpus. It includes TED talks subtitle (Document), Title-Detail (Summary), speaker name (Meta info), MP4 URL, and utterance id

Related tags

Overview

TEDSummary

References

Install tools

How to run

Outputs

Owner

Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)

A list of all papers and resoureces on Semantic Segmentation

docTR by Mindee (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

This project deals with the detection of skin lesions within the ISICs dataset using YOLOv3 Object Detection with Darknet.

Hardware-accelerated DNN model inference ROS2 packages using NVIDIA Triton/TensorRT for both Jetson and x86_64 with CUDA-capable GPU

deep-table implements various state-of-the-art deep learning and self-supervised learning algorithms for tabular data using PyTorch.

🐾 Semantic segmentation of paws from cute pet images (PyTorch)

GUPNet - Geometry Uncertainty Projection Network for Monocular 3D Object Detection

Code for Towards Streaming Perception (ECCV 2020) :car:

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

CNN Based Meta-Learning for Noisy Image Classification and Template Matching

Optimizaciones incrementales al problema N-Body con el fin de evaluar y comparar las prestaciones de los traductores de Python en el ámbito de HPC.

Predicting path with preference based on user demonstration using Maximum Entropy Deep Inverse Reinforcement Learning in a continuous environment

BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond

Replication Code for "Self-Supervised Bug Detection and Repair" NeurIPS 2021

ESP32 python application to read data from a Tilt™ Hydrometer for homebrewing

Official Keras Implementation for UNet++ in IEEE Transactions on Medical Imaging and DLMIA 2018

Spontaneous Facial Micro Expression Recognition using 3D Spatio-Temporal Convolutional Neural Networks

TAPEX: Table Pre-training via Learning a Neural SQL Executor

BalaGAN: Image Translation Between Imbalanced Domains via Cross-Modal Transfer