Detailed comparison for LLMs
Mistral 8x22B is the overall winner in this comparison!
Comparison of Attack Success Rate (ASR) metrics for Mistral Mistral 8x22B and OpenAI GPT-5.1 Codex across the three attack methods.