Detailed comparison for LLMs
Claude 3.5 Sonnet v1 is the overall winner in this comparison!
Comparison of Attack Success Rate (ASR) metrics for Alibaba Qwen3 VL 8B Instruct and Anthropic Claude 3.5 Sonnet v1 across the three attack methods.