Detailed comparison for LLMs
Claude 3.5 Sonnet v2 is the overall winner in this comparison!
Comparison of Attack Success Rate (ASR) metrics for Anthropic Claude 3.5 Sonnet v2 and Mistral Mistral Large 3 675B Instruct 2512 Eagle across the three attack methods.