Alibaba Qwen3 VL 235B A22B Instruct vs Anthropic Claude 3.7 Sonnet
Detailed comparison for LLMs
AlibabaAnthropic
Head-to-Head Overview
Claude 3.7 Sonnet is the overall winner in this comparison!
LLM Metric
Comparison of Attack Success Rate (ASR) metrics for Alibaba Qwen3 VL 235B A22B Instruct and Anthropic Claude 3.7 Sonnet across the three attack methods.
Overall (ASR)
Qwen3 VL 235B A22B Instruct
100.000
Claude 3.7 Sonnet
20.280
TAP Attack Method (ASR)
Qwen3 VL 235B A22B Instruct
100.000
Claude 3.7 Sonnet
31.430
Crescendo Attack Method (ASR)
Qwen3 VL 235B A22B Instruct
100.000
Claude 3.7 Sonnet
25.270
Zero-Shot (ASR)
Qwen3 VL 235B A22B Instruct
100.000
Claude 3.7 Sonnet
0
Key Highlights
Anthropic Claude 3.7 Sonnet has a lower Overall (ASR).
Anthropic Claude 3.7 Sonnet has a lower TAP Attack Method (ASR).
Anthropic Claude 3.7 Sonnet has a lower Crescendo Attack Method (ASR).