python library for invisible image watermark (blind image watermark)

Last update: Jan 07, 2023

Overview

invisible-watermark

invisible-watermark is a python library and command line tool for creating invisible watermark over image.(aka. blink image watermark, digital image watermark). The algorithm doesn't reply on the original image.

Note that this library is still experimental and it doesn't support GPU acceleration, carefully deploy it on the production environment. The default method dwtDCT(one variant of frequency methods) is ready for on-the-fly embedding, the other methods are too slow on a CPU only environment.

supported algorithms

Discrete wavelet transform + Discrete cosine transform frequency embedding algorithm variants.
RivaGAN, a deep-learning model trained from Hollywood2 movie clips dataset.

speed

default embedding method dwtDct is fast and suitable for on-the-fly embedding
dwtDctSvd is 3x slower and rivaGan is 10x slower, for large image they are not suitable for on-the-fly embedding

accuracy

The algorithm cannot gurantee to decode the original watermarks 100% accurately even though we don't apply any attack.
Known defects: Test shows all algorithms do not perform well for web page screenshots or posters with homogenous background color

Supported Algorithms

frequency methods

dwtDct: DWT + DCT transform, embed watermark bit into max non-trivial coefficient of block dct coefficents

dwtDctSvd: DWT + DCT transform, SVD decomposition of each block, embed watermark bit into singular value decomposition

rivaGan: encoder/decoder model with Attention mechanism + embed watermark bits into vector.

background:

Discrete wavelet transform

Discrete cosine transform.

RivaGAN, a deep-learning model trained from Hollywood2 movie clips dataset.

How to install

pip install invisible-watermark

Library API

Embed watermark

example embed 4 characters (32 bits) watermark

import cv2
from imwatermark import WatermarkEncoder

bgr = cv2.imread('test.png')
wm = 'test'

encoder = WatermarkEncoder()
encoder.set_watermark('bytes', wm.encode('utf-8'))
bgr_encoded = encoder.encode(bgr, 'dwtDct')

cv2.imwrite('test_wm.png', bgr_encoded)

Decode watermark

example decode 4 characters (32 bits) watermark

import cv2
from imwatermark import WatermarkDecoder

bgr = cv2.imread('test_wm.png')

decoder = WatermarkDecoder('bytes', 32)
watermark = decoder.decode(bgr, 'dwtDct')
print(watermark.decode('utf-8'))

CLI Usage

embed watermark:  ./invisible-watermark -v -a encode -t bytes -m dwtDct -w 'hello' -o ./test_vectors/wm.png ./test_vectors/original.jpg

decode watermark: ./invisible-watermark -v -a decode -t bytes -m dwtDct -l 40 ./test_vectors/wm.png

positional arguments:
  input                 The path of input

optional arguments:
  -h, --help            show this help message and exit
  -a ACTION, --action ACTION
                        encode|decode (default: None)
  -t TYPE, --type TYPE  bytes|b16|bits|uuid|ipv4 (default: bits)
  -m METHOD, --method METHOD
                        dwtDct|dwtDctSvd|rivaGan (default: maxDct)
  -w WATERMARK, --watermark WATERMARK
                        embedded string (default: )
  -l LENGTH, --length LENGTH
                        watermark bits length, required for bytes|b16|bits
                        watermark (default: 0)
  -o OUTPUT, --output OUTPUT
                        The path of output (default: None)
  -v, --verbose         print info (default: False)

Test Result

For better doc reading, we compress all images in this page, but the test is taken on 1920x1080 original image.

Methods are not robust to resize or aspect ratio changed crop but robust to noise, color filter, brightness and jpg compress.

rivaGan outperforms the default method on crop attack.

only default method is ready for on-the-fly embedding.

Input

Input Image: 1960x1080 Image

Watermark:

For freq method, we use 64bits, string expression "qingquan"

For RivaGan method, we use 32bits, string expression "qing"

Parameters: only take U frame to keep image quality, scale=36

Attack Performance

Watermarked Image

Attacks	Freq Method	RivaGan
JPG Compress	Pass	Pass
Noise	Pass	Pass
Brightness	Pass	Pass
Overlay	Pass	Pass
Mask	Pass	Pass
crop 7x5	Fail	Pass
Resize 50%	Fail	Fail
Rotate 30 degress	Fail	Fail

Running Speed (CPU Only)

Image	Method	Encoding	Decoding
1920x1080	dwtDct	300-350ms	150ms-200ms
1920x1080	dwtDctSvd	1500ms-2s	~1s
1920x1080	rivaGan	~5s	4-5s
600x600	dwtDct	70ms	60ms
600x600	dwtDctSvd	185ms	320ms
600x600	rivaGan	1s	600ms

RivaGAN Experimental

Further, We will deliver the 64bit rivaGan model and test the performance on GPU environment.

Detail: https://github.com/DAI-Lab/RivaGAN

Zhang, Kevin Alex and Xu, Lei and Cuesta-Infante, Alfredo and Veeramachaneni, Kalyan. Robust Invisible Video Watermarking with Attention. MIT EECS, September 2019.[PDF]

You might also like...

[CVPR 2021] Unsupervised Degradation Representation Learning for Blind Super-Resolution

DASR Pytorch implementation of "Unsupervised Degradation Representation Learning for Blind Super-Resolution", CVPR 2021 [arXiv] Overview Requirements

318 Dec 24, 2022

[NeurIPS 2020] Blind Video Temporal Consistency via Deep Video Prior

pytorch-deep-video-prior (DVP) Official PyTorch implementation for NeurIPS 2020 paper: Blind Video Temporal Consistency via Deep Video Prior TensorFlo

90 Oct 19, 2022

DAN: Unfolding the Alternating Optimization for Blind Super Resolution

DAN-Basd-on-Openmmlab DAN: Unfolding the Alternating Optimization for Blind Super Resolution We reproduce DAN via mmediting based on open-sourced code

72 Dec 13, 2022

Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

Real-ESRGAN Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data Ported from https://github.com/xinntao/Real-ESRGAN Depend

44 Dec 27, 2022

Towards Flexible Blind JPEG Artifacts Removal (FBCNN, ICCV 2021)

282 Jan 2, 2023

Towards Flexible Blind JPEG Artifacts Removal (FBCNN, ICCV 2021)

Towards Flexible Blind JPEG Artifacts Removal (FBCNN, ICCV 2021) Jiaxi Jiang, Kai Zhang, Radu Timofte Computer Vision Lab, ETH Zurich, Switzerland 🔥

80 Sep 30, 2021

Implementation of temporal pooling methods studied in [ICIP'20] A Comparative Evaluation Of Temporal Pooling Methods For Blind Video Quality Assessment

5 Sep 16, 2022

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis [Paper] [Online Demo] The following results are obtained by our SCUNet with purely syn

312 Jan 7, 2023

Fast image augmentation library and easy to use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about library: https://www.mdpi.com/2078-2489/11/2/125

Albumentations Albumentations is a Python library for image augmentation. Image augmentation is used in deep learning and computer vision tasks to inc

11.4k Jan 9, 2023

Comments

Potentially performance issues

When using an image larger than 1MB the performance degradates quickly. What we observe is that with an image of 1920 × 1080 the performance is great, but using an image of 9504 × 6336 inside a container with 20GB of RAM after ~40 minutes the flask repository we put on top of the library crashes because the container is OOM. Is there a way to improve performance in this sense?

opened by luca-simonetti 0

CLI decode doesn't work if output image is JPG

I'm trying to use as CLI and python script to generate a wmrked JPG but watermark decode doesn't show anything:

C:\Users\me\AppData\Local\Programs\Python\Python310\Scripts>py invisible-watermark "F:\JPEG\_DSC5341.jpg" -v -a encode -t bytes -m dwtDct -w '1234' -o "F:\JPEG\_DSC5341-w.jpg"
watermark length: 48
encode time ms: 2819.3318843841553

C:\Users\me\AppData\Local\Programs\Python\Python310\Scripts>py invisible-watermark "F:\JPEG\_DSC5341-w.jpg" -v -a decode -t bytes -m dwtDct -l 48
decode time ms: 1944.9546337127686

It's like there is no watermark impressed in it, unless I use a PNG as output. I posted the images I'm using for test purpouses.

raw img test wm img test_wm

opened by TheNemus 0

Example code not working

encoded the image, then decoding returned nothing, not "test" like expected.

edit: tried with a different png image: Traceback (most recent call last): File "/home/me/whisper/decode.py", line 8, in print(watermark.decode('utf-8')) UnicodeDecodeError: 'utf-8' codec can't decode byte 0xff in position 0: invalid start byte

opened by ClashSAN 0
How does this work?

I think a quick blurb about how the watermarks implemented by this package work would be helpful. Is it the pixel rounding that I can read about here? https://invisiblewatermark.net/how-invisible-watermarks-work.html

opened by kevinlinxc 0

python library for invisible image watermark (blind image watermark)

Related tags

Overview

invisible-watermark

Supported Algorithms

How to install

Library API

Embed watermark

Decode watermark

CLI Usage

Test Result

Input

Attack Performance

Running Speed (CPU Only)

RivaGAN Experimental

You might also like...

[CVPR 2021] Unsupervised Degradation Representation Learning for Blind Super-Resolution

[NeurIPS 2020] Blind Video Temporal Consistency via Deep Video Prior

DAN: Unfolding the Alternating Optimization for Blind Super Resolution

Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

Towards Flexible Blind JPEG Artifacts Removal (FBCNN, ICCV 2021)

Towards Flexible Blind JPEG Artifacts Removal (FBCNN, ICCV 2021)

Implementation of temporal pooling methods studied in [ICIP'20] A Comparative Evaluation Of Temporal Pooling Methods For Blind Video Quality Assessment

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis

Fast image augmentation library and easy to use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about library: https://www.mdpi.com/2078-2489/11/2/125

Comments

Potentially performance issues

CLI decode doesn't work if output image is JPG

Example code not working

How does this work?

Releases(0.1.5)

0.1.5(Feb 1, 2021)

0.1.3(Jan 31, 2021)

Owner

Shield Mountain

[CIKM 2019] Code and dataset for "Fi-GNN: Modeling Feature Interactions via Graph Neural Networks for CTR Prediction"

The repository contains reproducible PyTorch source code of our paper Generative Modeling with Optimal Transport Maps, ICLR 2022.

Malware Bypass Research using Reinforcement Learning

Convex optimization for fun and profit.

The Medical Detection Toolkit contains 2D + 3D implementations of prevalent object detectors such as Mask R-CNN, Retina Net, Retina U-Net, as well as a training and inference framework focused on dealing with medical images.

Implementation of Gans

Multi-objective gym environments for reinforcement learning.

Code release for Local Light Field Fusion at SIGGRAPH 2019

Pytorch modules for paralel models with same architecture. Ideal for multi agent-based systems

Character Controllers using Motion VAEs

ICLR 2021: Pre-Training for Context Representation in Conversational Semantic Parsing

Generic ecosystem for feature extraction from aerial and satellite imagery

Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)

SegNet including indices pooling for Semantic Segmentation with tensorflow and keras

Hardware-accelerated DNN model inference ROS2 packages using NVIDIA Triton/TensorRT for both Jetson and x86_64 with CUDA-capable GPU

A library for implementing Decentralized Graph Neural Network algorithms.

The official PyTorch implementation of paper BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition

Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

Official implementation of Self-supervised Image-to-text and Text-to-image Synthesis

General Multi-label Image Classification with Transformers