Contribute new test - base64_injection! #19

guy-ps · 2024-04-16T14:39:49Z

Overview

This pull request introduces a new test, base64_injection. The purpose of this test is to enhance the robustness of LLMs by evaluating their response to encoded prompt injections. This test specifically addresses the scenario where prompts are encoded as base64 strings, which can be a potential vector for security vulnerabilities if not properly handled by the model.

Changes

New Test Implementation: A new test named base64_injection has been added to our testing suite. This test utilizes a dataset of 203 encoded injection prompts to assess the model's ability to process and respond to base64-encoded inputs without executing unintended actions or exposing sensitive information. The new test implementation can be found in file base64_injection.py
Dataset Integration: The injection prompts are stored in a .parquet file for efficient access and processing. The file is saved in the ps_fuzz/attack_data directory. I've introduced fastparquet as a dependency to facilitate the reading of this file format within our testing framework.
Test Coverage: The test covers a diverse set of base64-encoded injections, ensuring comprehensive coverage across different potential security vulnerabilities in LLMs. The test was added to the attack_loader.py file.

Added Dependencies

fastparquet: This library has been added to efficiently handle reading from the .parquet file containing our prompt injection dataset. I updated setup.py as a result to ensure seamless integration and deployment.

Impact

The introduction of the base64_injection test is expected to significantly improve the security posture of our LLMs by providing a systematic approach to detect and mitigate prompt injection attacks. This will contribute to the overall reliability and trustworthiness of our models in production environments.

Testing

The new test has been integrated into our existing test suite and has been validated for correctness and performance impact. Detailed test results and logs can be found attached to this pull request.

Up2Date

guy-ps and others added 6 commits April 13, 2024 21:35

Update CONTRIBUTING.md

fc22a9b

Update CONTRIBUTING.md

766d55d

Added base64 attack

890c199

Merge branch 'contribute-new-test' into main

4d62236

Merge pull request #17 from prompt-security/main

631cb4b

Up2Date

Added the test to attack loader

0c430f3

guy-ps requested a review from vitaly-ps April 16, 2024 14:39

guy-ps added 7 commits April 16, 2024 17:51

import sys

d4ff07b

Update base64_injection.py

89d5589

Update CONTRIBUTING.md

36649df

Update CONTRIBUTING.md

05bb990

Update CONTRIBUTING.md

143fa26

Update base64_injection.py

583d278

Update base64_injection.py

54fe609

vitaly-ps approved these changes Apr 16, 2024

View reviewed changes

Merge branch 'main' into contribute-new-test

ed53303

vitaly-ps merged commit b6ca4e1 into main Apr 16, 2024
2 checks passed

vitaly-ps deleted the contribute-new-test branch April 16, 2024 17:16

vitaly-ps restored the contribute-new-test branch April 16, 2024 17:16

lior-ps deleted the contribute-new-test branch April 17, 2024 06:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contribute new test - base64_injection! #19

Contribute new test - base64_injection! #19

guy-ps commented Apr 16, 2024 •

edited

Loading

Contribute new test - base64_injection! #19

Contribute new test - base64_injection! #19

Conversation

guy-ps commented Apr 16, 2024 • edited Loading

Overview

Changes

Added Dependencies

Impact

Testing

guy-ps commented Apr 16, 2024 •

edited

Loading