Safe Output Health Report - 2026-02-20 #17095
Replies: 2 comments
-
|
🤖 Beep boop! The smoke test agent was here! 🚀 I just swooped in, ran a bunch of tests, and I'm happy to report: things exist and are working (probably). Consider this my digital graffiti: Now back to your regularly scheduled discussions...
|
Beta Was this translation helpful? Give feedback.
-
|
💥 KA-BOOM! 💥 WHOOOOSH! The smoke test agent has ARRIVED! 🦸 ⚡ ZAP! Claude engine operational — all systems NOMINAL! 🔥 "With great workflows come great responsibilities." 🔥 ✨ POW! Tests 1 through 10: ALL GREEN! ✨ 💨 The agent swoops in, validates every tool, and disappears into the void... 🎯 SMOKE TEST COMPLETE — Run §22211583283 — MISSION ACCOMPLISHED! 🎯
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Daily audit of safe output job health across all agentic workflow runs in the last 24 hours.
Executive Summary
Safe Output Job Statistics
add_commentupdate_projectcreate_issueadd_labelscreate_pull_request_review_commentupdate_pull_requestsubmit_pull_request_reviewremove_labelscreate_discussionpush_to_pull_request_branchdispatch_workflowsend_slack_messageunassign_from_usercreate_project_status_updateresolve_pull_request_review_threadnoopadd_reviewerupdate_issuecreate_pull_requestMost reliable:
create_issue,add_labels,create_pull_request_review_comment— all succeededMost failures:
add_comment(2 failures),update_project(2 failures)Error Clusters
Cluster 1: Permission Errors on PR Comments
add_commentgithub/gh-awpull_requeston branchcopilot/fix-patch-generation-bugSample Error:
Root Cause: The CI Failure Doctor is triggered by
workflow_runevents. The agent attempted to useupdate_issuewithtarget: "triggering", which resolves the issue from the current triggering context. However, since the workflow is triggered by a CI run — not an issue — there is no "triggering issue" to update. The agent should have used an explicitissue_numberin itsupdate_issueoutput (the issue was#17076).Impact: Low. The other 2 messages in this run succeeded —
create_issueandadd_commentboth completed successfully. Theupdate_issuefailure means that issue #17076 was not closed/updated via safe output, but the new issue was still created correctly.Root Cause Analysis
Permission Issues
The "Resource not accessible by integration" error for
add_commentis a GitHub token scope issue specific to PR #16927. This PR was created from a branch (copilot/fix-patch-generation-bug) and the safe output job's integration token does not have the necessary write permission to comment on it. This may be related to:Both affected smoke test workflows (
smoke-test-toolsfor Agent Container andsmoke-temporary-id) targeted the same PR, explaining why both failed simultaneously.API Limit Issues
The project view limit of 50 is a hard GitHub API constraint for GitHub Projects (V2). The Smoke Project workflow creates a new view on every run. Over time, these views accumulate until the limit is hit. This is an operational maintenance issue requiring manual cleanup of old project views.
Agent Instruction Issues
The
update_issuecontext mismatch in CI Failure Doctor is an agent instruction problem. The agent (claude) was instructed or inferred that it could update the "triggering" issue, but since the workflow is triggered byworkflow_run, there is no triggering issue. This is an edge case where the workflow's trigger type (schedule/workflow_run vs. issue events) must match the agent's intent to usetarget: "triggering".Recommendations
Critical Issues (Immediate Action Required)
Clean Up Project #24068 Views
smoke-projectworkflowInvestigate Permission Error for add_comment on PR Fix push_to_pull_request_branch generating bad patch on issue_comment follow-up runs #16927
GITHUB_TOKENlacks write access to comment on this PRpull-request-targettrigger for write access). Review the smoke test workflow'spermissionsblock to ensurepull-requests: writeis configured. If the PR is merged/closed, determine whether the smoke tests need to target a different PR.smoke-test-tools,smoke-temporary-idworkflowsBug Fixes Required
Smoke Project: Reuse or Cleanup Project Views
.github/workflows/smoke-project.mdCI Failure Doctor: Use Explicit issue_number in update_issue
target: "triggering"forupdate_issuein aworkflow_run-triggered workflow where no issue context existsissue_numberwhen callingupdate_issuein CI Failure Doctor, since it runs fromworkflow_runevents. The agent already knows the issue number (it comments on the issue withadd_comment).Smoke Project: Provide draft_title Fallback for update_project
draft_issue_idvia temporary ID, nodraft_titlefallback is provided, causing cascading failuredraft_issue_id: "aw_draft1"in subsequentupdate_projectmessages, also includedraft_titleas a fallback. This allows the system to resolve the draft issue by name even if the temporary ID map lookup fails.Process Improvements
View Lifecycle Management for Project Smoke Tests
Add Fallback Data in Temporary ID References
draft_issue_id: aw_draft1), they often don't provide fallback lookup datadraft_title) when referencing temporary IDs that may not be in the mapWork Item Plans
Work Item 1: Fix Project View Accumulation in Smoke Project
Work Item 2: Fix CI Failure Doctor update_issue Context Error
update_issuewithtarget: "triggering"but runs onworkflow_runevents where no triggering issue exists. The agent already knows the issue number (it creates and comments on it).update_issuein CI Failure Doctor uses explicitissue_numberinstead oftarget: "triggering"issue_numberexplicitly inupdate_issueoutputs. The agent can reference theaw_newcitemporary ID or pass the actual issue number.Work Item 3: Add draft_title Fallback for Temporary ID References
draft_titlefallback would allow recovery.draft_titlealongsidedraft_issue_idinupdate_projectmessages. Also consider updating safe output handler documentation.Historical Context
This is the first audit of safe output health. No historical baseline exists for trend comparison.
Initial Baseline (2026-02-20):
create_issue,add_labels,create_pull_request_review_commentadd_comment(2/39 messages failed),update_project(2/30 messages failed)Next Steps
issue_numberinupdate_issuedraft_titlefallbackReferences:
Beta Was this translation helpful? Give feedback.
All reactions