Detailed comparison for LLMs
Command 2X is the overall winner in this comparison!
Comparison of Attack Success Rate (ASR) metrics for Alibaba Qwen3 VL 235B A22B Thinking and Cohere Command 2X across the three attack methods.