markdown-exfil-tester
Black-box test whether an LLM chatbot is vulnerable to markdown/HTML exfil (the Copilot CVE-2025-32711 / Feb 2026 ChatGPT / ForcedLeak class). Spins up a local sink, sends 36 exfil payloads, renders each LLM response in headless Chromium, correlates via network to confirm real exfil vs CSP-blocked.
Garak, Augustus, and Promptfoo test text-layer injection but never answer 'did the frontend actually fetch the attacker URL?' Lakera + Protect AI have this behind paywall. 36 payloads, 27 tests, E2E verified.
