[CVPR 2021 Oral] ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis

Last update: Dec 22, 2022

Related tags

Deep Learning ForgeryNet

Overview

ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis

ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis

[arxiv|pdf|video|webpage]

Yinan He, Bei Gan, Siyu Chen, Yichun Zhou, Guojun Yin, Luchuan Song, Lu Sheng, Jing Shao, Ziwei Liu

In CVPR 2021

Abstract: The rapid progress of photorealistic synthesis techniques has reached at a critical point where the boundary between real and manipulated images starts to blur. Thus, benchmarking and advancing digital forgery analysis have become a pressing issue. However, existing face forgery datasets either have limited diversity or only support coarse-grained analysis. To counter this emerging threat, we construct the ForgeryNet dataset, an extremely large face forgery dataset with unified annotations in image- and video-level data across four tasks: 1) Image Forgery Classification, including two-way (real / fake), three-way (real / fake with identity-replaced forgery approaches / fake with identity-remained forgery approaches), and n-way (real and 15 respective forgery approaches) classification. 2) Spatial Forgery Localization, which segments the manipulated area of fake images compared to their corresponding source real images. 3) Video Forgery Classification, which re-defines the video-level forgery classification with manipulated frames in random positions. This task is important because attackers in real world are free to manipulate any target frame. and 4) Temporal Forgery Localization, to localize the temporal segments which are manipulated. ForgeryNet is by far the largest publicly available deep face forgery dataset in terms of data-scale (2.9 million images, 221,247 videos), manipulations (7 image-level approaches, 8 video-level approaches), perturbations (36 independent and more mixed perturbations) and annotations (6.3 million classification labels, 2.9 million manipulated area annotations and 221,247 temporal forgery segment labels). We perform extensive benchmarking and studies of existing face forensics methods and obtain several valuable observations.

Dataset is coming soon.

License and Citation

The use of this software is RESTRICTED to non-commercial research and educational purposes.

@article{he2021forgerynet,
  title={ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis},
  author={He, Yinan and Gan, Bei and Chen, Siyu and Zhou, Yichun and Yin, Guojun and Song, Luchuan and Sheng, Lu and Shao, Jing and Liu, Ziwei},
  journal={arXiv preprint arXiv:2103.05630},
  year={2021}
}

[CVPR 2021 Oral] ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis

Related tags

Overview

ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis

License and Citation

Owner

Yinan He

Async API for controlling Hue Lights

CFC-Net: A Critical Feature Capturing Network for Arbitrary-Oriented Object Detection in Remote Sensing Images

Explainable Medical ImageSegmentation via GenerativeAdversarial Networks andLayer-wise Relevance Propagation

Implementation of CVPR 2020 Dual Super-Resolution Learning for Semantic Segmentation

This is the official pytorch implementation for our ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering" on VQA Task

PyTorch implementations of the paper: "DR.VIC: Decomposition and Reasoning for Video Individual Counting, CVPR, 2022"

Based on Stockfish neural network(similar to LcZero)

converts nominal survey data into a numerical value based on a dictionary lookup.

Code and hyperparameters for the paper "Generative Adversarial Networks"

Pytorch version of SfmLearner from Tinghui Zhou et al.

Cleaned test data list of DukeMTMC-reID, ICCV2021

Code for the paper "M2m: Imbalanced Classification via Major-to-minor Translation" (CVPR 2020)

Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"

Speech recognition tool to convert audio to text transcripts, for Linux and Raspberry Pi.

[ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation

hySLAM is a hybrid SLAM/SfM system designed for mapping

Determined: Deep Learning Training Platform

Pytorch implementation for "Open Compound Domain Adaptation" (CVPR 2020 ORAL)

Hierarchical Metadata-Aware Document Categorization under Weak Supervision (WSDM'21)

Convolutional Neural Network for Text Classification in Tensorflow