Artificial social media campaign creation for benchmarking and challenging detection approaches

Janina S Pohl, Dennis Assenmacher, Moritz V Seiler, Heike Trautmann, Christian Grimme

January, 2022

Abstract

Social media platforms are essential for information sharing and, thus, prone to coordinated dis- and misinformation campaigns. Nevertheless, research in this area is hampered by strict data sharing regulations imposed by the platforms, resulting in a lack of benchmark data. Previous work focused on circumventing these rules by either pseudonymizing the data or sharing fragments. In this work, we will address the benchmarking crisis by presenting a methodology that can be used to create artificial campaigns out of original campaign building blocks. We conduct a proof-of-concept study using the freely available generative language model GPT-Neo in this context and demonstrate that the campaign patterns can flexibly be adapted to an underlying social media stream and evade state-of-the-art campaign detection approaches based on stream clustering. Thus, we not only provide a framework for artificial benchmark generation but also demonstrate the possible adversarial nature of such benchmarks for challeng- ing and advancing current campaign detection methods.

Type

Conference paper

Publication

Proceedings of the 16th International Conference on Web and Social Media. NEATCLasS, Association for the Advancement of Artificial Intelligence (AAI), Hybrid: Atlanta, Georgia, US and Online.

Dennis Assenmacher

PostDoc

My research interests include distributed robotics, mobile computing and programmable matter.