Detailed comparison for LLMs
Grok-4.1 Thinking is the overall winner in this comparison!
Comparison of Attack Success Rate (ASR) metrics for Anthropic Claude 3.5 Sonnet v1 and xAI Grok-4.1 Thinking across the three attack methods.