GitHub - oldaandozerskaya/DAGPap22: the notebook and generated texts created for the DAGPap22

This repo contains the notebook and generated texts created for the DAGPap22: Detecting automatically generated scientific papers Shared Task (SDP-2022, COLING-2022). Our work focuses on comparing different transformer-based models as well as using additional datasets and techniques to deal with imbalanced classes. As a final submission, we utilized an ensemble of SciBERT, RoBERTa, and DeBERTa fine-tuned using random oversampling technique. Our model achieved 99.24% in terms of F1-score. The official evaluation results have put our system at the third place.

The corresponding paper is: Anna Glazkova, Maksim Glazkov. 2022. Detecting Generated Scientific Papers using an Ensemble of Transformer Models. In Proceedings of the Third Workshop on Scholarly Document Processing. Association for Computational Linguistics.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
generated_texts		generated_texts
README.md		README.md
sdp2022.ipynb		sdp2022.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of Contents

About

Releases

Packages

Languages

oldaandozerskaya/DAGPap22

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages