Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Defence - Filtering #44

Closed
heatherlogan-scottlogic opened this issue Jul 20, 2023 · 4 comments · Fixed by #191
Closed

Defence - Filtering #44

heatherlogan-scottlogic opened this issue Jul 20, 2023 · 4 comments · Fixed by #191
Labels
backend Requires work on the backend enhancement New feature or request frontend Requires work on the frontend
Milestone

Comments

@heatherlogan-scottlogic
Copy link
Contributor

Use a blocklist of words and phrases to check user input for potential malicious content

https://learnprompting.org/docs/prompt_hacking/defensive_measures/filtering

Each defence should include the following:

A frontend component for the defensive measure on the left side bar.
Checkbox to toggle the defensive measure.
Some way to get a description of the defensive measure.
Pulsing the defensive measure component when it captures malicious content.

@heatherlogan-scottlogic heatherlogan-scottlogic added enhancement New feature or request frontend Requires work on the frontend backend Requires work on the backend defence labels Jul 20, 2023
@gsproston-scottlogic gsproston-scottlogic modified the milestone: Phase 2 Aug 10, 2023
@gsproston-scottlogic
Copy link
Contributor

Removed this from the phase 2 milestone as it's not strictly needed.

@heatherlogan-scottlogic heatherlogan-scottlogic added this to the Phase 2 milestone Aug 24, 2023
@heatherlogan-scottlogic
Copy link
Contributor Author

thought this would be an interesting one for phase 2 - e.g. specifically not allowing "budget" etc

@heatherlogan-scottlogic heatherlogan-scottlogic linked a pull request Aug 24, 2023 that will close this issue
@heatherlogan-scottlogic
Copy link
Contributor Author

@gsproston-scottlogic i was thinking of splitting this into two - one for filtering user input and one for bot output - what do you think?

@gsproston-scottlogic
Copy link
Contributor

@gsproston-scottlogic i was thinking of splitting this into two - one for filtering user input and one for bot output - what do you think?

Yeah that makes sense!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backend Requires work on the backend enhancement New feature or request frontend Requires work on the frontend
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants