Detailed comparison for LLMs
Qwen3 VL 8B Thinking is the overall winner in this comparison!
Comparison of Attack Success Rate (ASR) metrics for Alibaba Qwen3 VL 8B Thinking and Anthropic Claude 4.0 Sonnet across the three attack methods.