Detailed comparison for LLMs
It's a close match between Qwen3 VL 8B Thinking and Mistral Large 3 675B Base!
Comparison of Attack Success Rate (ASR) metrics for Alibaba Qwen3 VL 8B Thinking and Mistral Mistral Large 3 675B Base across the three attack methods.
No specific highlights generated for this comparison.