Auto-Lama combines object detection and image inpainting to automate object removals

Last update: Dec 09, 2022

Related tags

Overview

Auto-Lama

Auto-Lama combines object detection and image inpainting to automate object removals. It is build on top of DE:TR from Facebook Research and Lama from Samsung Research. The entire process is extremely simple:

Objects are detected using the detector.
Masks are generated based on the bounding boxes drawn by the detector.
The original image is sent to the inpainter along with the masks.

Demo

Masking

There are currently a few ways of generating masks:

Masking objects with specified indices.
Masking one main object at a time.
Masking all other objects other than the main object.

Future Goals

Use a more precise segmentation method other than bounding boxes
Implementing a detector that has more

Environment Setup

Prerequisites

docker
make
conda

Building Environment

make build-conda-env
conda activate auto-lama
make build-env

Cleaning Directory

make clean

Detect and Inpaint

Setup

The default config for the detector is

PARAMETERS = {
    "model_name": "facebook/detr-resnet-50",
    "threshold": 0.9,
    "max_items": 10,
    "save_destination": "./test_images",
    "output_destination": "./output_images",
    "max_width": 2000,
    "max_height": 2000,
    "resize": True,
    "resize_scale": 0.75,
    "excluded_objects": [91],
    "image_format": "PNG",
    "mask_target_items": [],
}

Please reference here for the target items that you want to mask, as the default DE:TR uses the COCO Dataset,

Run

make detect_and_inpaint IMAGE_PATH=path/to/image or make detect_and_inpaint IMAGE_PATH={image_url}

Auto-Lama combines object detection and image inpainting to automate object removals

Related tags

Overview

Auto-Lama

Demo

Masking

Future Goals

Environment Setup

Prerequisites

Building Environment

Cleaning Directory

Detect and Inpaint

Setup

Run

Owner

4K videos with annotated masks in our ICCV2021 paper 'Internal Video Inpainting by Implicit Long-range Propagation'.

[CVPR 2016] Unsupervised Feature Learning by Image Inpainting using GANs

The mini-AlphaStar (mini-AS, or mAS) - mini-scale version (non-official) of the AlphaStar (AS)

Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point Clouds (CVPR 2022, Oral)

[NeurIPS-2021] Slow Learning and Fast Inference: Efficient Graph Similarity Computation via Knowledge Distillation

GB-CosFace: Rethinking Softmax-based Face Recognition from the Perspective of Open Set Classification

Code for: Gradient-based Hierarchical Clustering using Continuous Representations of Trees in Hyperbolic Space. Nicholas Monath, Manzil Zaheer, Daniel Silva, Andrew McCallum, Amr Ahmed. KDD 2019.

Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration

Generating retro pixel game characters with Generative Adversarial Networks. Dataset "TinyHero" included.

A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud Completion

GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation. (CVPR 2021)

This is the official repository of XVFI (eXtreme Video Frame Interpolation)

Social Fabric: Tubelet Compositions for Video Relation Detection

CLIP2Video: Mastering Video-Text Retrieval via Image CLIP

A convolutional recurrent neural network for classifying A/B phases in EEG signals recorded for sleep analysis.

TART - A PyTorch implementation for Transition Matrix Representation of Trees with Transposed Convolutions

Materials for upcoming beginner-friendly PyTorch course (work in progress).

E-Ink Magic Calendar that automatically syncs to Google Calendar and runs off a battery powered Raspberry Pi Zero

DCGAN LSGAN WGAN-GP DRAGAN PyTorch

Stochastic Normalizing Flows