Detailed comparison for LLMs
Claude 4.0 Sonnet is the overall winner in this comparison!
Comparison of Attack Success Rate (ASR) metrics for Anthropic Claude 4.0 Sonnet and Meta Llama 3.1 405B Instruct across the three attack methods.