Detailed comparison for LLMs
It's a close match between Qwen3 VL 235B A22B Thinking and DeepSeek R1 Distill Llama 8B!
Comparison of Attack Success Rate (ASR) metrics for Alibaba Qwen3 VL 235B A22B Thinking and DeepSeek DeepSeek R1 Distill Llama 8B across the three attack methods.
No specific highlights generated for this comparison.