Detailed comparison for LLMs
It's a close match between Qwen3 VL 32B Thinking and GPT OSS 120B!
Comparison of Attack Success Rate (ASR) metrics for Alibaba Qwen3 VL 32B Thinking and OpenAI GPT OSS 120B across the three attack methods.
No specific highlights generated for this comparison.