This repository contains the official implementation code of the paper Transformer-based Feature Reconstruction Network for Robust Multimodal Sentiment Analysis

Last update: Sep 30, 2022

Related tags

Deep Learning TFR-Net

Overview

This repository contains the official implementation code of the paper Transformer-based Feature Reconstruction Network for Robust Multimodal Sentiment Analysis, accepted at ACMMM 2021.

Note: We strongly recommend that you browse the overall structure of our code at first. If you have any question, feel free to contact us.

Support Models

In this framework, we support the following methods:

Type	Model Name	From
Baselines	TFN	Tensor-Fusion-Network
Baselines	MulT(without CTC)	Multimodal-Transformer
Baselines	MISA	MISA
Missing-Task	TFR-Net	TFR-Net

Usage

Clone this repo and install requirements.

git clone https://github.com/Columbine21/TFR-Net.git
cd TFR-Net

Data Preprocessing

Download datasets from the following links.

MOSI

download from CMU-MultimodalSDK

SIMS

download from Baidu Yun Disk [code: ozo2] or Google Drive
Notes: Please download new features CH_SIMS_unaligned_39.pkl from Baidu Yun Disk [code: g63s] or Google Drive, which is compatible with our new code structure. The md5 code is a5b2ed3844200c7fb3b8ddc750b77feb.

Download Bert-Base, Chinese from Google-Bert.
Convert Tensorflow into pytorch using transformers-cli
Install python dependencies
Organize features and save them as pickle files with the following structure.

Notes: CH_SIMS_unaligned_39.pkl is compatible with the following structure

Dataset Feature Structure

0) "regression_labels": [] }, "valid": {***}, # same as the "train" "test": {***}, # same as the "train" } ">

{
    "train": {
        "raw_text": [],
        "audio": [],
        "vision": [],
        "id": [], # [video_id$_$clip_id, ..., ...]
        "text": [],
        "text_bert": [],
        "audio_lengths": [],
        "vision_lengths": [],
        "annotations": [],
        "classification_labels": [], # Negative(< 0), Neutral(0), Positive(> 0)
        "regression_labels": []
    },
    "valid": {***}, # same as the "train" 
    "test": {***}, # same as the "train"
}

Modify config/config_regression.py to update dataset pathes.

Run

sh test.sh

Paper

Please cite our paper if you find our work useful for your research:

@inproceedings{yu2020ch,
  title={CH-SIMS: A Chinese Multimodal Sentiment Analysis Dataset with Fine-grained Annotation of Modality},
  author={Yu, Wenmeng and Xu, Hua and Meng, Fanyang and Zhu, Yilin and Ma, Yixiao and Wu, Jiele and Zou, Jiyun and Yang, Kaicheng},
  booktitle={Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics},
  pages={3718--3727},
  year={2020}
}

@inproceedings{yuan2021transformer,
  title={Transformer-based Feature Reconstruction Network for Robust Multimodal Sentiment Analysis},
  author={Yuan, Ziqi and Li, Wei and Xu, Hua and Yu, Wenmeng},
  booktitle={Proceedings of the 29th ACM International Conference on Multimedia},
  pages={4400--4407},
  year={2021}
}

This repository contains the official implementation code of the paper Transformer-based Feature Reconstruction Network for Robust Multimodal Sentiment Analysis

Related tags

Overview

Support Models

Usage

Data Preprocessing

Dataset Feature Structure

Run

Paper

Owner

Ziqi Yuan

Code repo for EMNLP21 paper "Zero-Shot Information Extraction as a Unified Text-to-Triple Translation"

Adds timm pretrained backbone to pytorch's FasterRcnn model

Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning

QuadTree Attention for Vision Transformers (ICLR2022)

CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms

Weakly Supervised Scene Text Detection using Deep Reinforcement Learning

Narya API allows you track soccer player from camera inputs, and evaluate them with an Expected Discounted Goal (EDG) Agent

ilpyt: imitation learning library with modular, baseline implementations in Pytorch

PECOS - Prediction for Enormous and Correlated Spaces

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

Scene-Text-Detection-and-Recognition (Pytorch)

PyTorch Implementation for Fracture Detection in Wrist Bone X-ray Images

Code release for NeurIPS 2020 paper "Co-Tuning for Transfer Learning"

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

SmartSim Infrastructure Library.

MetaTTE: a Meta-Learning Based Travel Time Estimation Model for Multi-city Scenarios

CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes (AAAI2022)

CPF: Learning a Contact Potential Field to Model the Hand-object Interaction

AI Face Mesh: This is a simple face mesh detection program based on Artificial intelligence.

This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”