Detailed comparison for LLMs
Claude 3.7 Sonnet is the overall winner in this comparison!
Comparison of Attack Success Rate (ASR) metrics for Anthropic Claude 3.7 Sonnet and OpenAI GPT-4.1 across the three attack methods.