Global-Local Attention for Emotion Recognition

Last update: Apr 21, 2022

Related tags

Overview

Global-Local Attention for Emotion Recognition

Requirements

Python 3
Install tensorflow (or tensorflow-gpu) >= 2.0.0
Install some other packages

pip install cython
pip install opencv-python==4.3.0.36 matplotlib numpy==1.18.5 dlib

Dataset

We provide the NCAER-S dataset with original images and extracted faces (a .txt file with 4 bounding box coordinate) in the NCAERS dataset.

The dataset can be downloaded at Google Drive

Note that the dataset and label should have structure like the followings:

NCAER-S 
│
└───images
│   │
│   └───class_1
│   │   │   img1.jpg
│   │   │   img2.jpg
│   │   │   ...
│   └───class_2
│       │   img1.jpg
│       │   img2.jpg
│       │   ...
│   
└───crop
│   │
│   └───class_1
│   │   │   img1.txt
│   │   │   img2.txt
│   │   │   ...
│   └───class_2
│       │   img1.txt
│       │   img2.txt
│       │   ...

Running

Our code supports these types of execution with argument -m or --mode:

#extract faces from <train, val or test> dataset (specified in config.py)
python run.py -m extract dataset_type=train

#train the model with config specified in the config.py
python run.py -m train 

#evaluate the trained model on the dataset <dataset_type>
python run.py -m eval --dataset_type=test --trained_weights=path/to/weights

Evaluation

Our trained model is available at weights/glamor-net/Model.

Firstly, please download the dataset and extract it into "data/" directory.
Then specified the path to the test data (images and crop):

config = config.copy({
    'test_images': 'path_to_test_images',
    'test_crop':   'path_to_test_cropped_faces' #(.txt files),
})

Run this command to evaluate the model. We are using the classification accuracy as our evaluation metric.

# Evaluate our model in the test set
python run.py -m eval --dataset_type=test --trained_weights=weights/glamor-net/Model

Training

Firstly please extract the faces from train set (val set is optional)

Specify the path to the dataset in config.py (train_images, val_images, test_images)
Specify the desired face-extracted output path in config.py (train_crop, val_crop, test_crop)

config = config.copy({

    'train_images': 'path_to_training_images',
    'train_crop':   'path_to_training_cropped_faces' #(.txt files),

    'val_images': 'path_to_validation_images',
    'val_crop':   'path_to_validation_cropped_faces' #(.txt files)

})

Perform face extraction on both dataset_type by running the commands:

python run.py -m extract --dataset_type=<train, val or test>

Start training:

# Train a new model from sratch
python run.py -m train 

# Continue training a model that you had trained earlier
python run.py -m train --resume=path/to/trained_weights

# Resume the last checkpoint model
python run.py -m train --resume=last

Prediction

We support prediction on single image or on images in a directory by running this command:

# Predict on single image
python predict.py --trained_weights=weights/glamor-net/Model --input=test_images/1.jpg --output=path/to/out/directory

# Predict on images in directory
python predict.py --trained_weights=weights/glamor-net/Model --input=test_images/ --output=out/

Use the help option to see a description of all available command line arguments

Global-Local Attention for Emotion Recognition

Related tags

Overview

Global-Local Attention for Emotion Recognition

Requirements

Dataset

Running

Evaluation

Training

Prediction

Use the help option to see a description of all available command line arguments

Owner

Minh Nhat Le

CVPR '21: In the light of feature distributions: Moment matching for Neural Style Transfer

Training neural models with structured signals.

Official pytorch implementation of "DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion"

Interpretation of T cell states using reference single-cell atlases

Pairwise model for commonlit competition

FCA: Learning a 3D Full-coverage Vehicle Camouflage for Multi-view Physical Adversarial Attack

Adversarial-Information-Bottleneck - Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck (NeurIPS21)

Provide baselines and evaluation metrics of the task: traffic flow prediction

A toolset of Python programs for signal modeling and indentification via sparse semilinear autoregressors.

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

The implementation of "Bootstrapping Semantic Segmentation with Regional Contrast".

Adversarial Graph Augmentation to Improve Graph Contrastive Learning

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Blind Image Super-resolution with Elaborate Degradation Modeling on Noise and Kernel

This is an official implementation for "PlaneRecNet".

RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation (CIKM'17)

Project of 'TBEFN: A Two-branch Exposure-fusion Network for Low-light Image Enhancement '

OCR Post Correction for Endangered Language Texts

This code is an unofficial implementation of HiFiSinger.

An official repository for Paper "Uformer: A General U-Shaped Transformer for Image Restoration".