Characteristics and prevalence of fake social media profiles with AI-generated faces (2401.02627v2)
Abstract: Recent advancements in generative AI have raised concerns about their potential to create convincing fake social media accounts, but empirical evidence is lacking. In this paper, we present a systematic analysis of Twitter (X) accounts using human faces generated by Generative Adversarial Networks (GANs) for their profile pictures. We present a dataset of 1,420 such accounts and show that they are used to spread scams, spam, and amplify coordinated messages, among other inauthentic activities. Leveraging a feature of GAN-generated faces -- consistent eye placement -- and supplementing it with human annotation, we devise an effective method for identifying GAN-generated profiles in the wild. Applying this method to a random sample of active Twitter users, we estimate a lower bound for the prevalence of profiles using GAN-generated faces between 0.021% and 0.044% -- around 10K daily active accounts. These findings underscore the emerging threats posed by multimodal generative AI. We release the source code of our detection method and the data we collect to facilitate further investigation. Additionally, we provide practical heuristics to assist social media users in recognizing such accounts.
- Trueface: A dataset for the detection of synthetic face images from social networks. In 2022 IEEE International Joint Conference on Biometrics, pages 1–7. IEEE, 2022.
- On the opportunities and risks of foundation models. arXiv:2108.07258, 2021.
- The value of ai guidance in human examination of synthetically-generated faces. Proceedings of the AAAI Conference on Artificial Intelligence, 37(5):5930–5938, Jun. 2023. doi: 10.1609/aaai.v37i5.25734. URL https://ojs.aaai.org/index.php/AAAI/article/view/25734.
- Testing human ability to detect ‘deepfake’ images of human faces. Journal of Cybersecurity, 9(1):tyad011, 2023.
- A comprehensive survey of AI-generated content (AIGC): A history of generative AI from GAN to ChatGPT. arXiv:2303.04226, 2023.
- The coming age of adversarial social bot detection. First Monday, 2021.
- Deep learning based computer generated face identification using convolutional neural network. Applied Sciences, 8(12):2610, 2018.
- Diffusion models beat GANs on image synthesis. Advances in neural information processing systems, 34:8780–8794, 2021.
- The rise of social bots. Communications of the ACM, 59(7):96–104, 2016.
- Generative language models and automated influence operations: Emerging threats and potential mitigations. arXiv:2301.04246, 2023.
- Generative adversarial nets. Advances in neural information processing systems, 27, 2014.
- The emerging threat of AI-driven cyber attacks: A review. Applied Artificial Intelligence, 36(1):2037254, 2022.
- Eyes tell all: Irregular pupil shapes reveal GAN-generated faces. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 2904–2908. IEEE, 2022.
- Denoising diffusion probabilistic models. Advances in neural information processing systems, 33:6840–6851, 2020.
- Exposing GAN-generated faces using inconsistent corneal specular highlights. In ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 2500–2504. IEEE, 2021.
- Human heuristics for ai-generated language are flawed. Proceedings of the National Academy of Sciences, 120(11):e2208839120, 2023.
- A style-based generator architecture for generative adversarial networks. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 4401–4410, 2019.
- Analyzing and improving the image quality of stylegan. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 8110–8119, 2020.
- One millisecond face alignment with an ensemble of regression trees. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1867–1874, 2014.
- More real than real: A study on human visual perception of synthetic faces [applications corner]. IEEE Signal Processing Magazine, 39(1):109–116, 2021.
- Deep learning face attributes in the wild. In Proceedings of the IEEE international conference on computer vision, pages 3730–3738, 2015.
- Detection of GAN-generated fake images over social networks. In 2018 IEEE conference on multimedia information processing and retrieval, pages 384–389. IEEE, 2018.
- Addressing the harms of AI-generated inauthentic content. Nature Machine Intelligence, 5:679–680, 2023. doi: 10.1038/s42256-023-00690-w. URL https://rdcu.be/dgGfk.
- Exposing GAN-generated profile photos from compact embeddings. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 884–892, 2023.
- Ganbot: a GAN-based framework for social bot detection. Social Network Analysis and Mining, 12:1–11, 2022.
- Detecting GAN generated fake images using co-occurrence matrices. arXiv:1903.06836, 2019.
- Ai-synthesized faces are indistinguishable from real faces and more trustworthy. Proceedings of the National Academy of Sciences, 119(8):e2120481119, 2022.
- Uncovering coordinated networks on social media: Methods and case studies. In Proceedings of the International AAAI Conference on Web and Social Media, volume 15, pages 455–466, 2021.
- Just another day on twitter: a complete 24 hours of twitter data. In Proceedings of the International AAAI Conference on Web and Social Media, volume 17, pages 1073–1081, 2023.
- Hierarchical text-conditional image generation with CLIP latents. arXiv:2204.06125, 1(2):3, 2022.
- Towards the detection of diffusion model deepfakes. arXiv:2210.14571, 2022.
- Detection of novel social bots by ensembles of specialized classifiers. In Proceedings of 29th ACM International Conference on Information & Knowledge Management (CIKM), pages 2725–2732, 2020. doi: 10.1145/3340531.3412698. URL https://doi.org/10.1145/3340531.3412698.
- Generative modeling by estimating gradients of the data distribution. Advances in neural information processing systems, 32, 2019.
- GAN-generated faces detection: A survey and new perspectives. arXiv:2202.07145, 2022.
- The rise and potential of large language model based agents: A survey. arXiv:2309.07864, 2023.
- Weaponized AI for cyber attacks. Journal of Information Security and Applications, 57:102722, 2021.
- Exposure to social bots amplifies perceptual biases and regulation propensity. Scientific Reports, 13(1):20707, 2023.
- Anatomy of an AI-powered malicious social botnet. arXiv:2307.16336, 2023.
- Arming the public with artificial intelligence to counter social bots. Human Behavior and Emerging Technologies, 1(1):48–61, 2019a.
- Scalable and Generalizable Social Bot Detection through Data Selection. Proceedings of the AAAI Conference on Artificial Intelligence, 34(01):1096–1103, 2020. ISSN 2374-3468.
- Diffusion models: A comprehensive survey of methods and applications. ACM Computing Surveys, 56(4):1–39, 2023.
- Exposing GAN-synthesized faces using landmark locations. In Proceedings of the ACM workshop on information hiding and multimedia security, pages 113–118, 2019b.
- A Comprehensive Capability Analysis of GPT-3 and GPT-3.5 Series Models. arXiv:2303.10420, 2023.