A bot, searching for plagiarism on Stack Overflow.
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
.settings
data
src/main
.classpath
.gitignore
.project
.travis.yml
CONTRIBUTING.md
LICENSE Create LICENSE Jan 14, 2018
PULL_REQUEST_TEMPLATE.md
README.md
dependency-reduced-pom.xml
feedback.md
pom.xml

README.md

Build Status

What is Guttenberg?

Guttenberg is a bot that searches for plagiarism or duplicated answers on Stack Overflow. It's currently running in SOBotics under the user Guttenberg.

Implementation

Every 60 seconds, Guttenberg fetches the most recent answers (the "targets") on Stack Overflow. For each of these answers, possibly related posts (for example answers to related questions) are collected. All those posts will be checked for different characteristics (such as the Jaro-Winkler distance of the posts). If at least one of the characteristics meets the requirements, a message like this will be posted in chat:

sample chat message

What to do with the reports

In this file, we collected information on how we react to Guttenberg's reports such as comments we leave or if and how we flag.

Accuracy

We are already collecting data with CopyPastor to provide statisctics, but since there are not that many posts to report, it will take a while until we have enough data.