DeepFaceGen: A Large-scale Universal Evaluation Benchmark For Face Forgery Detection

Abstract: With the rapid development of AI-generated content (AIGC) technology, the production of realistic fake facial images and videos that deceive human visual perception has become possible. Consequently, various face forgery detection techniques have been proposed to identify such fake facial content. However, evaluating the effectiveness and generalizability of these detection techniques remains a significant challenge. To address this, we have constructed a large-scale evaluation benchmark called DeepFaceGen, aimed at quantitatively assessing the effectiveness of face forgery detection and facilitating the iterative development of forgery detection technology. DeepFaceGen consists of 776, 990 real face image/video samples and 773, 812 face forgery image/video samples, generated using 34 mainstream face generation techniques. During the construction process, we carefully consider important factors such as content diversity, fairness across ethnicities, and availability of comprehensive labels, in order to ensure the versatility and convenience of DeepFaceGen. Subsequently, DeepFaceGen is employed in this study to evaluate and analyze the performance of 20 mainstream face forgery detection techniques from various perspectives. Through extensive experimental analysis, we derive significant findings and propose potential directions for future research.

Dataset Construction

Image Real Subsets 
├── Celeb-DF
│   ├── 0_celebA-HQ.jpg
│   ├──...
│   ├── 9999_celebA-HQ.jpg
│   ├──00000
│   │  ├── 0_YouTube-real.png
│   ├──...
├── ForgeryNet
│   ├── real000001.jpg
│   ├──...
│   ├── real463101.jpg
├── real_image_labels_relative.csv (It contains the file paths of the image-level real samples in the dataset except for the added celebA-HQ.)
├── real_celebA-HQ_labels_relative.csv (celebA-HQ is the newly added data. It contains the file paths of celebA-HQ samples.)

Image Forged Subsets 
├── SD1
│   ├── output_0.png
│   ├──...
│   ├── output_40319.png
├── tra_FSGAN
│   ├── Output00000.jpg
│   ├──...
│   ├── Output10499.jpg
├──...
├── pix2pix
│   ├── fake00001.png
│   ├──...
│   ├── fake10000.png
├── i2i_image_labels_relative.csv (It contains the file paths of all the image2image forged samples in the dataset.)
├── t2i_image_labels_relative.csv (It contains the file paths of all the text2image forged samples in the dataset.)
├── tra_image_labels_relative.csv (It contains the file paths of all the localized editing-based forged samples in the dataset.)

Video Real Subsets 
├── CMLR
│   ├── *.mp4
├── ForgeryNet
│   ├── *.mp4
├── Celeb-DF
│   ├── *.mp4
├── CN-CVS
│   ├── *.mp4
├── real_video_labels_relative.csv (It contains the file paths of all the video-level real samples in the dataset.)

Video Forged Subsets 
├── Animatediff-rs
│   ├── output_0.mp4
│   ├──...
│   ├── output_40319.mp4
├── tra_FaceShifter
│   ├── Output_2_000000.mp4
│   ├──...
│   ├── Output078721.mp4
├──...
├── t2v_video_labels_relative.csv (It contains the file paths of all the text2video forged samples in the dataset.)
├── tra_video_labels_relative.csv (It contains the file paths of all the localized editing-based forged samples in the dataset.)

Download Data

DeepFaceGen consists of 776, 990 real face image/video samples and 773, 812 face forgery image/video samples. The dataset contains the following four parts.

Image Real Subsets (It is composed of Celeb-DF and ForgeryNet.)
Image Forged Subsets (It is composed of 27 categories generating by mainstream face generation techniques.)
Video Real Subsets (It is composed of CMLR, ForgeryNet, Celeb-DF and CN-CVS.)
Video Forged Subsets (It is composed of 16 categories generating by mainstream face generation techniques.)

The complete dataset can be downloaded at this link.

Benchmark

For code reproducibility, we conducted a comprehensive benchmark performance evaluation using mainstream face forgery detection techniques. All the code is sourced from the following github projects.

DASH-Lab/FakeAVCeleb, they provide executable code for MesoNet, EfficientNet-B0, Xception, and F3-Net.
erprogs/CViT, they provide executable code for CViT.
liangchen527/SLADD, they provide executable code for SLADD.
QingyuLiu/Exposing-the-Deception, they provide executable code for Exposing.
yyk-wew/F3Net, they provide executable code for F3-Net at the image-level.
VISION-SJTU/RECCE, they provide executable code for RECCE.
ICTMCG/DNA-Det, they provide executable code for DNDnet.
SCLBD/DeepfakeBench, they provide executable code for LSDA and TALL.
ZhendongWang6/AltFreezing, they provide executable code for AltFreezing.
ZhendongWang6/DIRE, they provide executable code for DIRE.
beibuwandeluori/DRCT, they provide executable code for DRCT.
chuangchuangtan/NPR-DeepfakeDetection, they provide executable code for NPR.
WisconsinAIVision/UniversalFakeDetect, they provide executable code for UnivFD.

Thank them for their contributions to this project and the field !

License

The use of this project is RESTRICTED to Creative Commons Attribution-NonCommercial 4.0 International Public License (CC BY-NC 4.0). More details about the license could be found in LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
models		models
Composition.png		Composition.png
README.md		README.md
dataset.png		dataset.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepFaceGen: A Large-scale Universal Evaluation Benchmark For Face Forgery Detection

Dataset Construction

Download Data

Benchmark

License

About

Releases

Packages

Languages

HengruiLou/DeepFaceGen

Folders and files

Latest commit

History

Repository files navigation

DeepFaceGen: A Large-scale Universal Evaluation Benchmark For Face Forgery Detection

Dataset Construction

Download Data

Benchmark

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages