a dnn ai project to classify which food people are eating on audio recordings

Last update: Oct 24, 2021

Related tags

Overview

Deep Learning - EAT Challenge

About

This project is part of an AI challenge of the DeepLearning course 2021 at the University of Augsburg. The objective to be learned is a classification task telling which food people are eating on audio recordings.

Students

This project was created by:

Benjamin Möckl
Julian Göser
Marco Tröster

EAT Dataset Setup

For your convenience, the download of all external project assets (dataset and evaluation metrics) has been automated by a shell script. After executing the script you should be ready to run / develop the project code.

# download and unpack the dataset and metric files
./init_dataset_and_metrics.sh <dataset zip password>

How to Run

First, cache the input dataset as TFRecord files for a training session (e.g. naive training). This should massively improve your training performance (especially with low CPU / GPU resources).

# cache the preprocessed audio dataset as TFRecord file
python src/main.py preprocess_dataset naive

Now, you can launch a training session (e.g. naive training).

# process a training session
python src/main.py run_training naive

After that you can sample all inputs of the unknown test dataset using a trained model and export the prediction results for EAT challenge submission.

# evaluate the results for submission
python src/main.py eval_results naive

Valid training configurations are:

naive
noisy
autoenc
amplitude

Remark: Use a GPU empowered machine for amplitude training (although it won't be too rewarding anyways). Tested on Ubuntu 20.04. For running on Windows, the keras ModelCheckpoint Callback has to be switched to our SaveBestAccuracyCallback.

Training Results

Training	Approach Description	Test Acc.	Real Acc.
Naive	Train on audio melspectrograms using Conv2D	0.41	0.36
Noisy	Train on audio melspectrograms using custom noisy Conv2D	0.44	0.39
Amplitude	Train on audio amplitude using Conv1D	0.23	?.??
AutoEnc	Train on audio melspectrograms using an Auto Encoder	0.25	?.??

a dnn ai project to classify which food people are eating on audio recordings

Related tags

Overview

Deep Learning - EAT Challenge

About

Students

EAT Dataset Setup

How to Run

Training Results

Owner

Marco Tröster

code for our paper "Source Data-absent Unsupervised Domain Adaptation through Hypothesis Transfer and Labeling Transfer"

Discovering Dynamic Salient Regions with Spatio-Temporal Graph Neural Networks

Go from graph data to a secure and interactive visual graph app in 15 minutes. Batteries-included self-hosting of graph data apps with Streamlit, Graphistry, RAPIDS, and more!

The code for "Deep Level Set for Box-supervised Instance Segmentation in Aerial Images".

Code for IntraQ, PyTorch implementation of our paper under review

Megaverse is a new 3D simulation platform for reinforcement learning and embodied AI research

phylotorch-bito is a package providing an interface to BITO for phylotorch

Official implementation of the paper Do pedestrians pay attention? Eye contact detection for autonomous driving

MLOps will help you to understand how to build a Continuous Integration and Continuous Delivery pipeline for an ML/AI project.

TensorFlow 101: Introduction to Deep Learning for Python Within TensorFlow

Molecular Sets (MOSES): A benchmarking platform for molecular generation models

“Data Augmentation for Cross-Domain Named Entity Recognition” (EMNLP 2021)

Code for "Multi-Compound Transformer for Accurate Biomedical Image Segmentation"

Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal transformer that encodes language inputs and the full episode history of visual observations and actions.

Pytorch reimplement of the paper "A Novel Cascade Binary Tagging Framework for Relational Triple Extraction" ACL2020. The original code is written in keras.

BARTScore: Evaluating Generated Text as Text Generation

Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"

Management Dashboard for Torchserve

Tensorflow implementation for Self-supervised Graph Learning for Recommendation

Learning Confidence for Out-of-Distribution Detection in Neural Networks