AI detector research tool

Overview

This tool allows you to test the accuracy of various AI detectors. It is a command line tool designed to make it easy to test a large number of detectors at the same time using the same data.

Description

The tool takes a set of text files and runs them through a number of AI detectors. It then outputs the results to a CSV file. The tool also generates a confusion matrix to show the accuracy of the detectors. But what is a confusion matrix? A confusion matrix is a table that is used to describe the performance of a classification model. It shows the number of correct and incorrect predictions made by the classification model compared to the actual outcomes. This table is extremely useful for comparing the performance of different detectors as it will show the true positives, false positives, true negatives and false negatives for each detector. This allows you to see which detectors are the most accurate.

Requirements

Python 3.9 or higher
API keys for the detectors you want to test

Installation

Clone this repository or download the zip file
Install the requirements using pip install -r requirements.txt

Usage

Make a note of your API keys for the detectors you want to test
Run the tool using python main.py
Follow the instructions in the tool adding your API keys when prompted
The tool will run the detectors and output the results to a CSV file

Sample workflow:

python main.py
Type Y/N to select Originality.ai API: y
Enter your Originality.ai API key: YOUR_API_KEY
Enter the directory path for AI text files: data/ai/
Enter the directory path for human text files: data/human/
Enter the input CSV file path: data/input.csv
Enter the output CSV file name: output.csv

Tool will process the data. This may take a while.
Would you like to generate a confusion matrix? (y/n): y
Press enter to exit...

Usage notes

The tool will only run the detectors you have API keys for
If when the tool is finished you are not prompted to generate a confusion matrix or the generation fails run python matrix.py to generate the confusion matrix

Input Data Format

The tool expects data to be in .txt files in a folder which is passed to the tool when it is run. Or if you are trying to process a csv file it expects the columns to be in the following order:

text,dataset,label
sample text,gpt-3,ai

The dataset column can simply be 'ai' or 'human' this column is used to name the rows on the output

Adding detectors

To add a detector you need to do the following:

Find the detectors API documentation
Find the endpoint for the detector
Find the parameters required for the endpoint
Add the detector to the detectors.py file in the following format:

    "post_parameters": {
        # The endpoint URL for the API.
        "endpoint": "YOUR_API_ENDPOINT_URL",

        # The body of the POST request. This usually contains the text to be analyzed.
        # The actual contents will depend on what the API expects.
        # Add or remove parameters as needed depending on the API requirements.
        "body": {"PARAMETER_NAME": "PARAMETER_VALUE"},

        # The headers for the POST request. This usually includes the API key and content type.
        # Add or remove headers as needed depending on the API requirements.
        "headers": {"HEADER_NAME": "HEADER_VALUE"},

        # Information about where the API key is included in the request.
        "API_KEY_POINTER": {
            # The location that the API key will end up (usually 'headers' or 'body').
            "location": "headers_or_body",

            # The actual API key. This is usually read from an environment variable or input by the user.
            "value": "YOUR_API_KEY",

            # The name of the key or field where the API key is included. e.g 'x-api-key' or 'api_key'.
            "key_name": "API_KEY_HEADER_OR_PARAMETER_NAME",
        },

        # The key in the body of the POST request where the text to be analyzed is included. e.g 'text' or 'content'.
        "text_key": "KEY_NAME_FOR_TEXT",
    },

    "response": {
        # The expected response from the API. The actual structure will depend on what the API returns.
        # This should include mappings for how to interpret the API's response.
        # Add or remove mappings as needed.
        # e.g if the API returns a JSON object with a key called 'result' and the value of 'result' is a list of objects
        # with a key called 'score' then the mapping would be:
        # "result": {
        #     "score": "score"
        # }
        "200": {
            "result": {
                "MAPPING_FOR_DESIRED_OUTPUT": "RESPONSE_KEY_PATH",
            }
        }
    },
}

Links to api docs for detectors

Originality.ai DOCS - to specify a particular version please check the docs and add it to the appropriate place in the api_endpoints.py file
Sapling.ai DOCS
GPTZero DOCS - to specify a particular version please check the docs and add it to the appropriate place in the api_endpoints.py file
Writer.com DOCS
Copyleaks DOCS - Please follow Copyleaks instructions for setting up the API key as it is a bit more complicated than the other detectors

Contributing

We welcome contributions to this project! Here are some ways you can help:

Reporting Bugs

If you find a bug, please report it by opening a GitHub issue. Be sure to include:

Steps to reproduce the bug
Expected behavior
Actual behavior

This information will help us diagnose and fix the bug faster.

Suggesting Enhancements

We're always looking for ways to improve the tool! If you have an idea for an enhancement, open a GitHub issue and describe:

The current behavior
Your proposed change and why it would be useful
An example use case

Pull Requests

If you want to directly contribute code:

Fork the repo
Clone your fork
Make changes on a branch
Write clear, concise commit message
Open a pull request against main

Ensure your PR adheres to the following:

Code is clean and well-formatted
Documentation is updated if needed
Commit messages are clear and detailed

Sharing Your Experience

We want to hear about your experience using the tool - good and bad. Let us know what worked and what didn't. Share stories of how this tool has helped your research. The more we hear from you, the better we can make the tool for everyone!

Thanks for contributing!

License

MIT License

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
benchmark_set		benchmark_set
.env.example		.env.example
.gitignore		.gitignore
.pylintrc		.pylintrc
README.md		README.md
analyze_output.py		analyze_output.py
api_endpoints.py		api_endpoints.py
main.py		main.py
matrix.py		matrix.py
requirements.txt		requirements.txt
text_analyzer.py		text_analyzer.py

OriginalityAI/AI-detector-research-tool

Folders and files

Latest commit

History

Repository files navigation