Skip to content

fix(prompts): reduce false positive injection warnings#32

Merged
xjdr-noumena merged 1 commit into
mainfrom
fix/prompt-injection-warning-noise
Jun 22, 2026
Merged

fix(prompts): reduce false positive injection warnings#32
xjdr-noumena merged 1 commit into
mainfrom
fix/prompt-injection-warning-noise

Conversation

@xjdr-noumena

Copy link
Copy Markdown
Contributor

Summary\n- tighten prompt-injection warning guidance to require concrete evidence before warning the user\n- stop appending the malware mitigation reminder to every benign FileReadTool result\n- keep cyber-risk policy coverage in the system prompt while reducing noisy false positives\n\nFixes #25\n\n## Validation\nbash\n/home/xjdr/.bun/bin/bun test \\n src/constants/prompts.security.test.ts \\n src/constants/productIdentityAudit.test.ts \\n src/tools/FileReadTool/FileReadTool.test.ts \\n src/tools/toolPromptRouting.test.ts\n\n\nResult: 85 pass, 0 fail.

Tightens the tool-result prompt-injection guidance so warnings require concrete evidence, and stops appending the malware mitigation reminder to every benign FileReadTool result.\n\nThe cyber-risk policy text remains in the system prompt; this removes the unconditional per-read warning that caused noisy false positives.\n\nFixes #25\n\nValidation:\n/home/xjdr/.bun/bin/bun test src/constants/prompts.security.test.ts src/constants/productIdentityAudit.test.ts src/tools/FileReadTool/FileReadTool.test.ts src/tools/toolPromptRouting.test.ts
@xjdr-noumena xjdr-noumena merged commit d2757be into main Jun 22, 2026
@xjdr-noumena xjdr-noumena deleted the fix/prompt-injection-warning-noise branch June 22, 2026 14:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant