VectorAscent: Generate vector graphics from a textual description

Example

"a painting of an evergreen tree"

python text_to_painting.py --prompt "a painting of an evergreen tree" --num_iter 2500 --use_blob --subdir vit_rn50_useblob

We rely on CLIP for its aligned text and image encoders, and diffvg, a differentiable vector graphics rasterizer. Differentiable rendering allows us to generate raster images from vector paths, but isn't provided textual descriptions. We use CLIP to score the similarity between raster graphics and textual captions. Using gradient ascent, we can then optimize for a vector graphic whose rasterization has high similarity with a user-provided caption, backpropagating through CLIP and diffvg to the vector graphics parameters. This project is partially inspired by Deep Daze, a caption-guided raster graphics generator.

Quick start

Requirements:

torch
torchvision
matplotlib
numpy
scikit-image
clip
diffvg

Install our dependencies and CLIP.

conda install --yes -c pytorch pytorch=1.7.1 torchvision cudatoolkit=11.0
pip install ftfy regex tqdm numpy matplotlib scikit-image
pip install git+https://github.com/openai/CLIP.git

Then follow these instructions to install diffvg.

Generate vector graphics from a textual caption

Related tags

Overview

VectorAscent: Generate vector graphics from a textual description

Example

Quick start

Owner

Ajay Jain

Help you discover excellent English projects and get rid of disturbing by other spoken language

A website which allows you to play with the GPT-2 transformer

Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

This repository consists of a complete guide on natural language processing (NLP) in Python where we'll learn various techniques for implementing NLP including parsing & text processing and understand how to use NLP for text feature engineering.

GVT is a generic translation tool for parts of text on the PC screen with Text to Speak functionality.

SurvTRACE: Transformers for Survival Analysis with Competing Events

To be a next-generation DL-based phenotype prediction from genome mutations.

An ultra fast tiny model for lane detection, using onnx_parser, TensorRTAPI, torch2trt to accelerate. our model support for int8, dynamic input and profiling. (Nvidia-Alibaba-TensoRT-hackathon2021)

This repository will contain the code for the CVPR 2021 paper "GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields"

Journey is a NLP-Powered Developer assistant

Dual languaged (rus+eng) tool for packing and unpacking archives of Silky Engine.

Torchrecipes provides a set of reproduci-able, re-usable, ready-to-run RECIPES for training different types of models, across multiple domains, on PyTorch Lightning.

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents

LUKE -- Language Understanding with Knowledge-based Embeddings

This repository contains the code, data, and models of the paper titled "CrossSum: Beyond English-Centric Cross-Lingual Abstractive Text Summarization for 1500+ Language Pairs".

Tool which allow you to detect and translate text.

A multi-voice TTS system trained with an emphasis on quality

Flaxformer: transformer architectures in JAX/Flax

Parrot is a paraphrase based utterance augmentation framework purpose built to accelerate training NLU models