Support behavioral context in Risk Agent and enhance guardrail decorators #300

NISH1001 · 2025-12-22T14:54:29Z

Summary

This PR adds support for source_context within the guardrail system to provide the Risk Agent with behavioral information about the specific agent or tool being evaluated. It also introduces a debug mode for the Risk Agent to log the exact messages being sent to the LLM during criteria generation.

Details

Enhanced Guardrail Input: Added source_context to the GuardrailInput schema to allow passing specific constraints or behavioral expectations of the content source.
Context Extraction Logic: Implemented _get_source_context in the decorators module. This helper extracts context from classes or instances with a priority order: explicit overrides, the description attribute (if it is a string), and finally the docstring.
Decorator Updates: Updated the guardrail decorator and apply_guardrails function to accept and process the source_context.
Risk Agent Integration: Modified RiskAgent to inject "Source (Agent) Behaviour" into the system prompt when source_context is provided. This ensures the LLM has enough context to generate more accurate risk criteria.
Debug Logging: Added a debug flag to the RiskAgent that prints the full list of messages sent to the LLM. This is useful for verifying prompt injection and behavioral context during development.

Checks

Tested Changes
Stakeholder Approval

- Agent or tool descirption and other behavioral context is added, especially to risk agent

github-actions · 2025-12-22T15:03:03Z

✅ Tests passed

📊 Test Results

Passed: 549
Failed: 0
Skipped: 23
Warnings: 132
Coverage: 77%

Branch: enhance/guardrail-agent-context
PR: #300
Commit: ef3dc20

📋 Full coverage report and logs are available in the workflow run.

…ration - Update RISK_SYSTEM_PROMPT with two-step process (context understanding → criteria generation) - Add _default_source_context_message() method with clear producer context formatting - Change source_context injection from generic label to structured system message - Add debug logging for total criteria count across risk categories The prompt now instructs the LLM to use source context to generate MORE RELEVANT criteria rather than skipping evaluation entirely. This ensures consistent risk evaluation while tailoring criteria to the producer's domain. Co-Authored-By: Tigran Tchrakian <tigran@Tigrans-MacBook-Pro.local>

github-actions · 2025-12-22T15:46:51Z

✅ Tests passed

📊 Test Results

Passed: 549
Failed: 0
Skipped: 23
Warnings: 133
Coverage: 77%

Branch: enhance/guardrail-agent-context
PR: #300
Commit: 9eb3a8d

📋 Full coverage report and logs are available in the workflow run.

github-actions · 2025-12-22T16:21:13Z

✅ Tests passed

📊 Test Results

Passed: 549
Failed: 0
Skipped: 23
Warnings: 134
Coverage: 77%

Branch: enhance/guardrail-agent-context
PR: #300
Commit: 63e4955

📋 Full coverage report and logs are available in the workflow run.

NISH1001 added 2 commits December 19, 2025 16:30

Add sourec context which assists in guardrailing

886470b

- Agent or tool descirption and other behavioral context is added, especially to risk agent

Add debug mode log to risk agent to show messages sent to LLM

2d132ab

NISH1001 temporarily deployed to integration December 22, 2025 14:54 — with GitHub Actions Inactive

NISH1001 changed the title ~~Enhance/guardrail agent context~~ Support behavioral context in Risk Agent and enhance guardrail decorators Dec 22, 2025

NISH1001 requested a review from muthukumaranR December 22, 2025 14:54

NISH1001 temporarily deployed to integration December 22, 2025 15:38 — with GitHub Actions Inactive

Improe risk agent system prompts

b6aaa89

NISH1001 deployed to integration December 22, 2025 16:12 — with GitHub Actions Active

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support behavioral context in Risk Agent and enhance guardrail decorators #300

Support behavioral context in Risk Agent and enhance guardrail decorators #300

NISH1001 commented Dec 22, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Dec 22, 2025

Uh oh!

github-actions bot commented Dec 22, 2025

Uh oh!

github-actions bot commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Support behavioral context in Risk Agent and enhance guardrail decorators #300

Are you sure you want to change the base?

Support behavioral context in Risk Agent and enhance guardrail decorators #300

Conversation

NISH1001 commented Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Details

Checks

Uh oh!

github-actions bot commented Dec 22, 2025

📊 Test Results

Uh oh!

github-actions bot commented Dec 22, 2025

📊 Test Results

Uh oh!

github-actions bot commented Dec 22, 2025

📊 Test Results

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

NISH1001 commented Dec 22, 2025 •

edited

Loading