This is a Deep Leaning API for classifying emotions from human face and human audios.

Last update: Oct 02, 2022

Overview

Emotion AI

This is a Deep Leaning API for classifying emotions from human face and human audios.

Starting the server

To start the server first you need to install all the packages used by running the following command:

pip install -r requirements.txt
# make sure your current directory is "server"

After that you can start the server by running the following commands:

change the directory from server to api:

cd api

run the app.py

python app.py

The server will start at a default PORT of 3001 which you can configure in the api/app.py on the Config class:

class AppConfig:
    PORT = 3001
    DEBUG = False

If everything went well you will be able to make api request to the server.

EmotionAI

Consist of two parallel models that are trained with different model architectures to save different task. The one is for audio classification and the other is for facial emotion classfication. Each model is served on a different endpoint but on the same server.

Audio Classification

Sending an audio file to the server at http://127.0.0.1:3001/api/classify/audio using the POST method we will be able to get the data that looks as follows as the json response from the server:

{
  "predictions": {
    "emotion": { "class": "sad", "label": 3, "probability": 0.22 },
    "emotion_intensity": { "class": "normal", "label": 0, "probability": 0.85 },
    "gender": { "class": "male", "label": 0, "probability": 1.0 }
  },
  "success": true
}

Classifying audios

Using cURL

To classify the audio using cURL make sure that you open the command prompt where the audio files are located for example in my case the audios are located in the audios folder so i open the command prompt in the audios folder or else i will provide the absolute path when making a cURL request for example

curl -X POST -F [email protected] http://127.0.0.1:3001/api/classify/audio

If everything went well we will get the following response from the server:

{
  "predictions": {
    "emotion": { "class": "sad", "label": 3, "probability": 0.22 },
    "emotion_intensity": { "class": "normal", "label": 0, "probability": 0.85 },
    "gender": { "class": "male", "label": 0, "probability": 1.0 }
  },
  "success": true
}

Using Postman client

To make this request with postman we do it as follows:

Change the request method to POST at http://127.0.0.1:3001/api/classify/audio
Click on form-data
Select type to be file on the KEY attribute
For the KEY type audio and select the audio you want to predict under value Click send
If everything went well you will get the following response depending on the audio you have selected:

{
  "predictions": {
    "emotion": { "class": "sad", "label": 3, "probability": 0.22 },
    "emotion_intensity": { "class": "normal", "label": 0, "probability": 0.85 },
    "gender": { "class": "male", "label": 0, "probability": 1.0 }
  },
  "success": true
}

Using JavaScript fetch api.
First you need to get the input from html
Create a formData object
make a POST requests

res.json()) .then((data) => console.log(data));">

const input = document.getElementById("input").files[0];
let formData = new FormData();
formData.append("audio", input);
fetch("http://127.0.0.1:3001/api/classify/audio", {
  method: "POST",
  body: formData,
})
  .then((res) => res.json())
  .then((data) => console.log(data));

If everything went well you will be able to get expected response.

{
  "predictions": {
    "emotion": { "class": "sad", "label": 3, "probability": 0.22 },
    "emotion_intensity": { "class": "normal", "label": 0, "probability": 0.85 },
    "gender": { "class": "male", "label": 0, "probability": 1.0 }
  },
  "success": true
}

Notebooks

If you want to see how the models were trained you can open the respective notebooks:

Audio Classification

This is a Deep Leaning API for classifying emotions from human face and human audios.

Related tags

Overview

Emotion AI

Starting the server

EmotionAI

Audio Classification

Classifying audios

Notebooks

Owner

crispengari

Gluon CV Toolkit

Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).

《LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classiﬁcation》(AAAI 2021) GitHub:

Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On

PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INTERSPEECH 2020)

Open source implementation of AceNAS: Learning to Rank Ace Neural Architectures with Weak Supervision of Weight Sharing

For IBM Quantum Challenge 2021 (May 20 - 26)

Machine Learning Platform for Kubernetes

Attention for PyTorch with Linear Memory Footprint

Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

Learning to trade under the reinforcement learning framework

Code to compute permutation and drop-column importances in Python scikit-learn models

Cryptocurrency Prediction with Artificial Intelligence (Deep Learning via LSTM Neural Networks)

A package for music online and offline rhythmic information analysis including music Beat, downbeat, tempo and meter tracking.

This program writes christmas wish programmatically. It is using turtle as a pen pointer draw christmas trees and stars.

DeepLab2: A TensorFlow Library for Deep Labeling

Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation

MLJetReconstruction - using machine learning to reconstruct jets for CMS

Repository for the AugmentedPCA Python package.

Use unsupervised and supervised learning to predict stocks