Detailed comparison for LLMs
Claude 3.5 Sonnet v2 is the overall winner in this comparison!
Comparison of Attack Success Rate (ASR) metrics for Anthropic Claude 3.5 Sonnet v2 and Google Gemma 3n E4B Instructed across the three attack methods.