Automatic meme generation model using Tensorflow Keras.

Last update: Jan 13, 2022

Related tags

Overview

Memefly

You can find the project at MemeflyAI.

Contributors

Nick Buukhalter	Harsh Desai	Han Lee

Project Overview

Trello Board

Product Canvas

Automatic meme generation model using Tensorflow Keras. Model is Dockerized and served as a REST API with FastAPI/uvicorn ASGI endpoint. A separate serving model serving is done with a combination of FastAPI/uvicorn ASGI endpoint with models served using Tensorflow Serving on Sagemaker.

Tech Stack

Python Packages

Numpy
Pandas
Tensorflow
FastAPI
Selenium

DevOps

Tensorflow Serving
Docker
MySQL
MongoDB
AWS ECR
AWS Elastic Beanstalk
AWS S3
AWS Sagemaker

Architecture

Predictions

We used an encoder-decoder architecture for the meme generation task. Pre-trained Inception V3 architecture and weights are used as the encoder to extract embeddings from an input image. At the same time, we encode the texts into text embeddings and concat them together with image embeddings. For the decoder, we used GRU to to map the image and text embeddings to predict the next word in the text string.

At training time, we repeat the same image embeddings as input and send in text sequences in order, e.g., 0. this, 1. this is, 2. this is a, 3. this is a sequence. The model will try to predict the next word in the sequence given the input image embedding and text embeddings. We denote the beginning and the end of a text sequence with startseq and endseq.

At inferencing time, we send in image embeddings and the seed token startseq to the model, and then repeatly send in the image embeddings and the prediction output of the previous timestep, until either we see endseq or reach maximum sentence length. To improve the quality of the output, we used beam search to greedily select the best N sentences. But it has to be noted that beam search is neither optimal nor complete algorithm.

To increase varieties, we tried 1) adding Guassian noise to the input image and 2) choosing top N sentence scores using beam search.

The architecture is summarized here:

In-sample Meme

Out-of-sample Meme

Batch Example Outputs

Explanatory Variables

Image
Text

Data Sources

Please see Data Engineering for details.

Python Notebooks

Training Notebook

Inferencing Notebook

How to connect to the web API

Please see Machine Learning Engineering - Deployment for details.

How to connect to the data API

Please see Data Engineering for details.

Contributing

When contributing to this repository, please first discuss the change you wish to make via issue, email, or any other method with the owners of this repository before making a change.

Please note we have a code of conduct. Please follow it in all your interactions with the project.

Issue/Bug Request

If you are having an issue with the existing project code, please submit a bug report under the following guidelines:

Check first to see if your issue has already been reported.
Check to see if the issue has recently been fixed by attempting to reproduce the issue using the latest master branch in the repository.
Create a live example of the problem.
Submit a detailed bug report including your environment & browser, steps to reproduce the issue, actual and expected outcomes, where you believe the issue is originating from, and any potential solutions you have considered.

Feature Requests

We would love to hear from you about new features which would improve this app and further the aims of our project. Please provide as much detail and information as possible to show us why you think your new feature should be implemented.

Pull Requests

If you have developed a patch, bug fix, or new feature that would improve this app, please submit a pull request. It is best to communicate your ideas with the developers first before investing a great deal of time into a pull request to ensure that it will mesh smoothly with the project.

Remember that this project is licensed under the MIT license, and by submitting a pull request, you agree that your work will be, too.

Pull Request Guidelines

Ensure any install or build dependencies are removed before the end of the layer when doing a build.
Update the README.md with details of changes to the interface, including new plist variables, exposed ports, useful file locations and container parameters.
Ensure that your code conforms to our existing code conventions and test coverage.
Include the relevant issue number, if applicable.
You may merge the Pull Request in once you have the sign-off of two other developers, or if you do not have permission to do that, you may request the second reviewer to merge it for you.

Attribution

These contribution guidelines have been adapted from this good-Contributing.md-template.

Documentation

See Data Engineering for details on the data engineering of our project.

See Machine Learning Engineering - Training for details on the training part of our project.

See Machine Learning Engineering - Deployment for details on the deployment of our project.

Automatic meme generation model using Tensorflow Keras.

Related tags

Overview

Memefly

Contributors

Project Overview

Tech Stack

Python Packages

DevOps

Architecture

Predictions

In-sample Meme

Out-of-sample Meme

Batch Example Outputs

Explanatory Variables

Data Sources

Python Notebooks

How to connect to the web API

How to connect to the data API

Contributing

Issue/Bug Request

Feature Requests

Pull Requests

Pull Request Guidelines

Attribution

Documentation

Owner

BloomTech Labs

The source code and dataset for the RecGURU paper (WSDM 2022)

Efficient and Scalable Physics-Informed Deep Learning and Scientific Machine Learning on top of Tensorflow for multi-worker distributed computing

torchsummaryDynamic: support real FLOPs calculation of dynamic network or user-custom PyTorch ops

[NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.

Official implementation of "OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association" in PyTorch.

Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.

Interactive Image Generation via Generative Adversarial Networks

Tools for the Cleveland State Human Motion and Control Lab

FasterAI: A library to make smaller and faster models with FastAI.

TRACER: Extreme Attention Guided Salient Object Tracing Network implementation in PyTorch

McGill Physics Hackathon 2021: Reaction-Diffusion Models for the Generation of Biological Patterns

Code and real data for the paper "Counterfactual Temporal Point Processes", available at arXiv.

Github project for Attention-guided Temporal Coherent Video Object Matting.

A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)

DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition, TPAMI 2021

Deploying PyTorch Model to Production with FastAPI in CUDA-supported Docker

Voice Conversion Using Speech-to-Speech Neuro-Style Transfer

CR-Fill: Generative Image Inpainting with Auxiliary Contextual Reconstruction. ICCV 2021

Hierarchical Motion Encoder-Decoder Network for Trajectory Forecasting (HMNet)

Image-generation-baseline - MUGE Text To Image Generation Baseline