J-Guard

Journalism Guided Adversarially Robust Detection of AI-generated News

Abatract

The rapid proliferation of AI-generated text online is profoundly reshaping the information landscape. Among various types of AI-generated text, AI-generated news presents a significant threat as it can be a prominent source of misinformation online. While several recent efforts have focused on detecting AI-generated text in general, these methods require enhanced reliability, given concerns about their vulnerability to simple adversarial attacks. Furthermore, due to the eccentricities of news writing, applying these detection methods for AI-generated news can produce false positives, potentially damaging the reputation of news organizations. To address these challenges, we leverage the expertise of an interdisciplinary team to develop a framework, J-Guard, capable of steering existing supervised AI text detectors for detecting AI-generated news while boosting adversarial robustness. By incorporating stylistic cues inspired by the unique journalistic attributes, J-Guard effectively distinguishes between real-world journalism and AI-generated news articles. Our experiments on news articles generated by a vast array of AI models, including ChatGPT (GPT3.5), demonstrate the effectiveness of J-Guard in enhancing detection capabilities while maintaining an average performance decrease of as low as 7% when faced with adversarial attacks. Read more: Link

Instructions

Code for journalism feature extraction can be found in the notebook
Main framework code is available in this notebook
Refer the Section 8 of the paper about data accessing

Conference

This Paper is Accepted to The 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (IJCNLP-AACL 2023)

Citation

@article{kumarage2023j,
 title={J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News},
 author={Kumarage, Tharindu and Bhattacharjee, Amrita and Padejski, Djordje and Roschke, Kristy and Gillmor, Dan and Ruston, Scott and Liu, Huan and Garland, Joshua},
 journal={arXiv preprint arXiv:2309.03164},
 year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
J-Guard Framework.ipynb		J-Guard Framework.ipynb
Journalism_Feature_Extractor.ipynb		Journalism_Feature_Extractor.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

J-Guard Framework.ipynb

J-Guard Framework.ipynb

Journalism_Feature_Extractor.ipynb

Journalism_Feature_Extractor.ipynb

LICENSE

LICENSE

README.md

README.md

Repository files navigation

J-Guard

Abatract

Instructions

Conference

Citation

About

Releases

Packages

Languages

License

TSKumarage/J-Guard

Folders and files

Latest commit

History

Repository files navigation

J-Guard

Abatract

Instructions

Conference

Citation

About

Resources

License

Stars

Watchers

Forks

Languages