feat: Add prompt injection protection mechanism #28

nextedoff · 2024-08-16T12:22:30Z

Purpose

This feature is adding a way to implement new protection methods for the query, focusing on prompt injection protection in this PR. For injection protection model, model named protectai/deberta-v3-base-prompt-injection was used.
New models and guard mechanisms can be added to promptprotection.py.

Currently, the injection protection can either be turned on via API and user interface as seen in the screenshot, or it can be set via environment variables:
USE_INJECTION_PROTECTION="true"

Does this introduce a breaking change?

When developers merge from main and run the server, azd up, or azd deploy, will this produce an error?
If you're not sure, try it out on an old environment.

Yes
No

Does this require changes to learn.microsoft.com docs?

Yes
No

Type of change

Code quality checklist

The current tests all pass (python -m pytest).
I added tests that prove my fix is effective or that my feature works
I ran python -m pytest --cov to verify 100% coverage of added lines
I ran python -m mypy to check for type errors
I either used the pre-commit hooks or ran ruff and black manually on my code.

Signed-off-by: nextedoff <[email protected]>

…g injection protection Signed-off-by: nextedoff <[email protected]>

Signed-off-by: nextedoff <[email protected]>

phoevos

Thanks for opening this!

Left some comments with suggestions, but I'm not finished yet. I'll continue tomorrow to check the core of the implementation and the frontend changes (which I'm duly excited about)!

infra/main.parameters.json

app/backend/api_wrappers/openai.py

app/backend/api_wrappers/hugging_face.py

app/backend/app.py

app/backend/core/promptprotection.py

app/backend/error.py

…nisms Signed-off-by: nextedoff <[email protected]>

Signed-off-by: nextedoff <[email protected]>

Fixed docstrings in promptprotection.py, slightly rephrasing some for clarity and making sure that they're in proper markdown so that they are rendered correctly in the documentation. Renamed the 'config' dictionary of the 'PromptProtection' class to 'protections'. Signed-off-by: Phoevos Kalemkeris <[email protected]>

Signed-off-by: Phoevos Kalemkeris <[email protected]>

phoevos

I'll do some quick manual testing, but other than that LGTM!

app/backend/app.py

Signed-off-by: Phoevos Kalemkeris <[email protected]>

nextedoff added 3 commits August 16, 2024 12:08

Code cleanup

dcf8e77

Signed-off-by: nextedoff <[email protected]>

Implemented a feature to support prompt protection via LLMs, includin…

b4c17a2

…g injection protection Signed-off-by: nextedoff <[email protected]>

Added missing files for prompt protection

7a70239

Signed-off-by: nextedoff <[email protected]>

nextedoff added the enhancement New feature or request label Aug 16, 2024

nextedoff requested a review from phoevos August 18, 2024 17:27

Snapshots updates due to the change of error string

7d2d98b

Signed-off-by: nextedoff <[email protected]>

nextedoff force-pushed the feature-injection-model branch from 36d880f to 7d2d98b Compare August 18, 2024 18:17

phoevos requested changes Aug 19, 2024

View reviewed changes

nextedoff added 2 commits August 20, 2024 15:43

Code refactorisation for better handling of multiple protection mecha…

5f5819b

…nisms Signed-off-by: nextedoff <[email protected]>

Various fixes and improvements

b0bc9f0

Signed-off-by: nextedoff <[email protected]>

phoevos changed the title ~~feat: Injection protection model~~ feat: Add prompt injection protection mechanism Aug 20, 2024

phoevos added 2 commits August 21, 2024 10:03

fix: Sorry I'm a bloody idiot

bde0c48

Signed-off-by: Phoevos Kalemkeris <[email protected]>

phoevos approved these changes Aug 21, 2024

View reviewed changes

app/backend/app.py Outdated Show resolved Hide resolved

phoevos added 3 commits August 21, 2024 12:22

fix: Use intended settings styling for Ask tab

96b6c36

Signed-off-by: Phoevos Kalemkeris <[email protected]>

fix: Pass checkbox style to ProtectionOptions

fca0bdb

Signed-off-by: Phoevos Kalemkeris <[email protected]>

Merge branch 'main' into feature-injection-model

5ec01c4

phoevos merged commit 7a1c2e1 into main Aug 21, 2024
11 checks passed

phoevos deleted the feature-injection-model branch August 21, 2024 17:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add prompt injection protection mechanism #28

feat: Add prompt injection protection mechanism #28

nextedoff commented Aug 16, 2024 •

edited

Loading

phoevos left a comment

phoevos left a comment

feat: Add prompt injection protection mechanism #28

feat: Add prompt injection protection mechanism #28

Conversation

nextedoff commented Aug 16, 2024 • edited Loading

Purpose

Does this introduce a breaking change?

Does this require changes to learn.microsoft.com docs?

Type of change

Code quality checklist

phoevos left a comment

Choose a reason for hiding this comment

phoevos left a comment

Choose a reason for hiding this comment

nextedoff commented Aug 16, 2024 •

edited

Loading