Detailed comparison for LLMs
It's a close match between Qwen3 VL 8B Instruct and Gemini 2.0 Flash Thinking!
Comparison of Attack Success Rate (ASR) metrics for Alibaba Qwen3 VL 8B Instruct and Google Gemini 2.0 Flash Thinking across the three attack methods.
No specific highlights generated for this comparison.