This released dataset is used to support the CCS'20 paper Lies in the Air: Characterizing Fake-base-station Spam Ecosystem in China.
ACM Reference Format: Yiming Zhang, Baojun Liu, Chaoyi Lu, Zhou Li, Haixin Duan, Shuang Hao, Mingxuan Liu, Ying Liu, Dong Wang and Qiang Li. 2020. Lies in the Air: Characterizing Fake-base-station Spam Ecosystem in China. In Proceedings of the 2020 ACM SIGSAC Conference on Computer and Communications Security (CCS ’20), November 9–13, 2020, Virtual Event, USA. ACM, New York, NY, USA, 14 pages. https://doi.org/10.1145/3372297.3417257
This dataset contains 14K spam messages sent from real-world Fake Base Stations in China, manually labeled under 14 categories by researchers. For privacy reasons, we released the pre-processed version of this dataset, with all contacts in the messages being anonymized.
We expect this dataset could help other researchers move further to understand the FBS spamming ecosystem.
Please indicate the source-link and cite the above CCS'20 paper when using this dataset. In addition, if you need more relevant data or have any questions, please contact zhangyiming@tsinghua.edu.cn. We would provide responses as soon as possible.