Detailed comparison for LLMs
DeepSeek-V3.2 Non-thinking is the overall winner in this comparison!
Comparison of Attack Success Rate (ASR) metrics for Anthropic Claude 4.0 Sonnet and DeepSeek DeepSeek-V3.2 Non-thinking across the three attack methods.