Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning

Last update: Aug 09, 2022

Related tags

Overview

Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning

Code for the paper Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning (TMM 2021).

Introduction

Automatic typography is important because it helps designers avoid highly repetitive tasks and amateur users achieve high-quality textual layout designs. However, there are often many parameters and complicated aesthetic rules that need to be adjusted in automatic typography work. In this paper, we propose an efficient deep aesthetics learning approach to generate harmonious textual layout over natural images, which can be decomposed into two stages, saliency-aware text region proposal and aesthetics-based textual layout selection. Our method incorporates both semantic features and visual perception principles. First, we propose a semantic visual saliency detection network combined with a text region proposal algorithm to generate candidate text anchors with various positions and sizes. Second, a discriminative deep aesthetics scoring model is developed to assess the aesthetic quality of the candidate textual layouts. The results demonstrate that our method can generate harmonious textual layouts in various actual scenarios with better performance.

Dependencies and Installation

Python 3
PyTorch >= 1.0

Notes of compilation

For Python3 users, before you start to build the source code and install the packages, please specify the architecture of your GPU card and CUDA_HOME path in both ./roi_align/make.sh and ./rod_align/make.sh
Build and install by running:
```
bash make_all.sh
```

Usage

Download the source code and the pretrained models: gdi-basnet and SMT.
Make sure your device is CUDA enabled. Build and install source code of roi_align_api and rod_align_api.
Run SmartText_demo.py to test the pretrained model on your images.
```
python SmartText_demo.py -opt test_opt.yml
```

Acknowledgement

This work is the extension of our conference version (ICME 2020). Some codes of this repository benefit from BASNet and GAIC. Thanks for their excellent work!

Citation

If you find this work useful, please cite our paper:

@article{li2021harmonious,
    title     = {Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning},
    author    = {Li, Chenhui and Zhang, Peiying and Wang, Changbo},
    journal   = {IEEE Transactions on Multimedia},
    year      = {2021},
    publisher = {IEEE}
}

Contact

If you have any question, contact us through email at [email protected].

Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning

Related tags

Overview

Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning

Introduction

Dependencies and Installation

Notes of compilation

Usage

Acknowledgement

Citation

Contact

Owner

Code for GNMR in ICDE 2021

Code for the ECIR'22 paper "Evaluating the Robustness of Retrieval Pipelines with Query Variation Generators"

[AI6101] Introduction to AI & AI Ethics is a core course of MSAI, SCSE, NTU, Singapore

MPLP: Metapath-Based Label Propagation for Heterogenous Graphs

A light-weight image labelling tool for Python designed for creating segmentation data sets.

The Python ensemble sampling toolkit for affine-invariant MCMC

Retrieve and analysis data from SDSS (Sloan Digital Sky Survey)

🚩🚩🚩

PyTorch code for the NAACL 2021 paper "Improving Generation and Evaluation of Visual Stories via Semantic Consistency"

Predict and time series avocado hass

Supervised 3D Pre-training on Large-scale 2D Natural Image Datasets for 3D Medical Image Analysis

Deduplicating Training Data Makes Language Models Better

A fast model to compute optical flow between two input images.

Jittor Medical Segmentation Lib -- The assignment of Pattern Recognition course (2021 Spring) in Tsinghua University

Clean and readable code for Decision Transformer: Reinforcement Learning via Sequence Modeling

A pytorch implementation of faster RCNN detection framework (Use detectron2, it's a masterpiece)

Mosaic of Object-centric Images as Scene-centric Images (MosaicOS) for long-tailed object detection and instance segmentation.

Training, generation, and analysis code for Learning Particle Physics by Example: Location-Aware Generative Adversarial Networks for Physics

Trash Sorter Extraordinaire is a software which efficiently detects the different types of waste in a pile of random trash through feeding it pictures or videos.

Implementation of Axial attention - attending to multi-dimensional data efficiently