Modularize detection checks #90

ristomcgehee · 2023-12-23T21:24:28Z

This PR refactors the JS SDK to perform the detection checks in a modular manner. This will make it easier to add new checks in the future as well as to customize which checks run by default.

I want to highlight that this is a BREAKING change for the SDK and will be a breaking change to the API once the same changes are made to it. I believe this is an acceptable change because at least in open source GitHub, no one is using the parameters that I am removing. For this code search, the only result of note is a demo notebook in LangChain, and it's only the output that includes the fields I'm removing.

#91 updates files in server along with a few others.

Once #88 is merged, I'd be willing to update the Python SDK to use the modular check logic.

A few notes about the changes in this PR:

Our code currently uses the term "check" which I've changed to "tactic". In programming in general, the term "check" is widely used, so it wouldn't be the best term to surface to external users of this project. I think "tactic" fits well since in the future we will be allowing users to define a collection of "tactics" to make up a "strategy".
Our code currently allows the user at detection time to disable certain checks (tactics) or to provide a different threshold score to determine if prompt injection is detected. With my PR, users retain the ability to do the same via the tacticOverrides parameter.

Part of #13

seanpmorgan

LGTM thanks Risto!

ristomcgehee mentioned this pull request Dec 24, 2023

Update server to match sdk interface for modular checks #91

Draft

4 tasks

ristomcgehee force-pushed the modularize-detection-checks branch from dadc392 to a7458d7 Compare December 24, 2023 05:19

Modularize detection checks

a13af9c

ristomcgehee force-pushed the modularize-detection-checks branch from a7458d7 to a13af9c Compare December 25, 2023 01:47

seanpmorgan added the okay-to-test label Dec 27, 2023

Use npx for tests

0644066

seanpmorgan added okay-to-test and removed okay-to-test labels Jan 10, 2024

ristomcgehee added okay-to-test and removed okay-to-test labels Jan 13, 2024

ristomcgehee force-pushed the modularize-detection-checks branch from fc5a216 to eb21ad9 Compare January 13, 2024 05:23

ristomcgehee added okay-to-test and removed okay-to-test labels Jan 13, 2024

Fix test error

07a8d42

ristomcgehee force-pushed the modularize-detection-checks branch from eb21ad9 to 07a8d42 Compare January 13, 2024 05:26

ristomcgehee added okay-to-test and removed okay-to-test labels Jan 13, 2024

ristomcgehee mentioned this pull request Jan 13, 2024

Tests for Python SDK #92

Merged

seanpmorgan approved these changes Jan 14, 2024

View reviewed changes

seanpmorgan merged commit b54c827 into protectai:main Jan 14, 2024
2 checks passed

This was referenced Jan 19, 2024

Fix JS API client #96

Open

Modularize detection checks Python SDK #103

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modularize detection checks #90

Modularize detection checks #90

ristomcgehee commented Dec 23, 2023 •

edited

Loading

seanpmorgan left a comment

Modularize detection checks #90

Modularize detection checks #90

Conversation

ristomcgehee commented Dec 23, 2023 • edited Loading

seanpmorgan left a comment

Choose a reason for hiding this comment

ristomcgehee commented Dec 23, 2023 •

edited

Loading