Welcome to the comma.ai Calibration Challenge!

Your goal is to predict the direction of travel (in camera frame) from provided dashcam video.

This repo provides 10 videos. Every video is 1min long and 20 fps.
5 videos are labeled with a 2D array describing the direction of travel at every frame of the video with a pitch and yaw angle in radians.
5 videos are unlabeled. It is your task to generate the labels for them.
The example labels are generated using a Neural Network, and the labels were confirmed with a SLAM algorithm.
You can estimate the focal length to be 910 pixels.

Context

The devices that run openpilot are not mounted perfectly. The camera is not exactly aligned to the vehicle. There is some pitch and yaw angle between the camera of the device and the vehicle, which can vary between installations. Estimating these angles is essential for accurate control of the vehicle. The best way to start estimating these values is to predict the direction of motion in camera frame. More info can be found in this readme.

Deliverable

Your deliverable is the 5 labels called 5.txt to 9.txt. These labels should be a 2D array that contains the pitch and yaw angles of the direction of travel (in camera frame) of every frame of the respective videos. Zip them up and e-mail it to [email protected].

Evaluation

We will evaluate your mean squared error against our ground truth labels. Errors for frames where the car speed is less than 4m/s will be ignored. Those are also labeled as NaN in the example labels.

This repo includes an eval script that will give an error score (lower is better). You can use it to test your solutions against the labeled examples. We will use this script to evaluate your solution.

Hints

Keep the goal and evaluation script in mind, creative solutions are allowed.
Look at plots of your solutions before submitting.

$500 Prize CLAIMED

The first submission that scores an error under 25% on the unlabeled set, will receive a $500 prize.

The comma.ai Calibration Challenge!

Related tags

Overview

Welcome to the comma.ai Calibration Challenge!

Context

Deliverable

Evaluation

Hints

$500 Prize CLAIMED

Owner

comma.ai

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Towards Long-Form Video Understanding

Code for ICE-BeeM paper - NeurIPS 2020

This repository contains the code for EMNLP-2021 paper "Word-Level Coreference Resolution"

an Evolutionary Algorithm assisted GAN

Code for our paper "Graph Pre-training for AMR Parsing and Generation" in ACL2022

HAR-stacked-residual-bidir-LSTMs - Deep stacked residual bidirectional LSTMs for HAR

Using image super resolution models with vapoursynth and speeding them up with TensorRT

Learning Visual Words for Weakly-Supervised Semantic Segmentation

Official implementation of ETH-XGaze dataset baseline

Multispectral Object Detection with Yolov5

Anonymous implementation of KSL

Code and project page for ICCV 2021 paper "DisUnknown: Distilling Unknown Factors for Disentanglement Learning"

Randomizes the warps in a stock pokeemerald repo.

This is the pytorch re-implementation of the IterNorm

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.

PICARD - Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models

Linear Variational State Space Filters

PyTorch-lightning implementation of the ESFW module proposed in our paper Edge-Selective Feature Weaving for Point Cloud Matching

Combining Diverse Feature Priors