Detailed comparison for LLMs
GPT-4o Mini is the overall winner in this comparison!
Comparison of Attack Success Rate (ASR) metrics for Alibaba Qwen3 VL 8B Thinking and OpenAI GPT-4o Mini across the three attack methods.