Show HN: Deep-XPIA – Prompt injection benchmark for multi-agent AI systems
Deep-XPIA is a new benchmark for evaluating the robustness of multi-agent AI systems against prompt injection attacks through multi-hop cross-prompt challenges.
deep-xpia - multi-hop cross-prompt injection benchmark