Agentic Lazy Bias: Verification Pass Failures
Hardening self-correction loops against 'False Positives'.
Steps
- Enforce a 'Double-Blind' verification pass using a different model.
- Implement 'Negative Testing' where the verifier is told the output is wrong.
- Use a 'Checklist-as-a-Tool' that the agent must complete for every turn.
- Audit verifier logs for 'Instant Approvals' (< 2 tokens of reasoning).
- Rotate the verifier's system prompt to prevent pattern-matching bias.