Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Last update: Jan 07, 2023

Overview

Ego4D

EGO4D is the world's largest egocentric (first person) video ML dataset and benchmark suite, with 3,600 hrs (and counting) of densely narrated video and a wide range of annotations across five new benchmark tasks. It covers hundreds of scenarios (household, outdoor, workplace, leisure, etc.) of daily life activity captured in-the-wild by 926 unique camera wearers from 74 worldwide locations and 9 different countries. Portions of the video are accompanied by audio, 3D meshes of the environment, eye gaze, stereo, and/or synchronized videos from multiple egocentric cameras at the same event. The approach to data collection was designed to uphold rigorous privacy and ethics standards with consenting participants and robust de-identification procedures where relevant.

Public Documentation/Start Here: Ego4D Docs

For the CLI readme (to download/access): CLI README

For a demo notebook: Annotation Notebook

For the visualization engine: Viz README

For feature extraction: Feature README

License

Ego4D is released under the MIT License.

Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Related tags

Overview

Ego4D

License

Owner

Meta Research

Suite of 500 procedurally-generated NLP tasks to study language model adaptability

Synthetic LiDAR sequential point cloud dataset with point-wise annotations

[NeurIPS 2020] Official repository for the project "Listening to Sound of Silence for Speech Denoising"

Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

The coda and data for "Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach" (ACL '21)

Pure python implementation reverse-mode automatic differentiation

Training DiffWave using variational method from Variational Diffusion Models.

GndNet: Fast ground plane estimation and point cloud segmentation for autonomous vehicles using deep neural networks.

Angora is a mutation-based fuzzer. The main goal of Angora is to increase branch coverage by solving path constraints without symbolic execution.

Unofficial Tensorflow 2 implementation of the paper Implicit Neural Representations with Periodic Activation Functions

AirLoop: Lifelong Loop Closure Detection

PyTorch implementation of DeepDream algorithm

CondNet: Conditional Classifier for Scene Segmentation

Efficiently Disentangle Causal Representations

Code repository of the paper Neural circuit policies enabling auditable autonomy published in Nature Machine Intelligence

Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness

A collection of papers about Transformer in the field of medical image analysis.

Drone detection using YOLOv5

Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"

Official implementation of "GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators" (NeurIPS 2020)