Detailed comparison for LLMs
It's a close match between Qwen3-235B-A22B-Thinking and DeepSeek R1 Distill Llama 8B!
Comparison of Attack Success Rate (ASR) metrics for Alibaba Qwen3-235B-A22B-Thinking and DeepSeek DeepSeek R1 Distill Llama 8B across the three attack methods.
No specific highlights generated for this comparison.