PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".

Last update: Dec 12, 2022

Related tags

Overview

Maria: A Visual Experience Powered Conversational Agent

This repository is the Pytorch implementation of our paper "Maria: A Visual Experience Powered Conversational Agent" in ACL 2021.

In this paper, we present Maria, a neural conversation agent powered by the visual world experiences which are retrieved from a large-scale image index. Maria consists of three flexible components, i.e., text-to-image retriever, visual concept detector and visual-knowledge-grounded response generator.

Coming soon!

Summary

Maria: A Visual Experience Powered Conversational Agent

Dependencies

python 3.7
pytorch 1.4.0
Ubuntu 18.04

Usage

Citation

If you find this paper helps your research, please kindly consider citing our paper in your publications.

@inproceedings{liang2021maria,
   title={Maria: A Visual Experience Powered Conversational Agent},
   author={Liang, Zujie and Hu, Huang and Xu, Can and Chongyang, Tao and Geng, Xiubo and Chen, Danqi and Liang, Fan and Jiang, Daxin},
   booktitle={Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL)},
   year={2021}
}

Acknowledgment

Special thanks to the authors of OSCAR, vokenization, and py-bottom-up-attention.

PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".

Related tags

Overview

Maria: A Visual Experience Powered Conversational Agent

Summary

Dependencies

Usage

Text-to-Image Retrieval Model

Bottom-up Detector Model

Dialog Generation Model

Citation

Acknowledgment

Owner

Jokie

In this tutorial, you will perform inference across 10 well-known pre-trained object detectors and fine-tune on a custom dataset. Design and train your own object detector.

Hierarchical Attentive Recurrent Tracking

一个目标检测的通用框架(不需要cuda编译)，支持Yolo全系列(v2~v5)、EfficientDet、RetinaNet、Cascade-RCNN等SOTA网络。

The code for "Deep Level Set for Box-supervised Instance Segmentation in Aerial Images".

Simple tools for logging and visualizing, loading and training

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Convenient tool for speeding up the intern/officer review process.

Using deep learning model to detect breast cancer.

N-HiTS: Neural Hierarchical Interpolation for Time Series Forecasting

My personal Home Assistant configuration.

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark

DeOldify - A Deep Learning based project for colorizing and restoring old images (and video!)

PaddleRobotics is an open-source algorithm library for robots based on Paddle, including open-source parts such as human-robot interaction, complex motion control, environment perception, SLAM positioning, and navigation.

A PyTorch implementation of a Factorization Machine module in cython.

Official PyTorch Implementation of Mask-aware IoU and maYOLACT Detector [BMVC2021]

Place holder for HOPE: a human-centric and task-oriented MT evaluation framework using professional post-editing

Algorithmic trading with deep learning experiments

[Pedestron] Generalizable Pedestrian Detection: The Elephant In The Room. @ CVPR2021

Generative Flow Networks

The world's simplest facial recognition api for Python and the command line