Detailed comparison for LLMs
GPT-5.2 is the overall winner in this comparison!
Comparison of Attack Success Rate (ASR) metrics for Alibaba Qwen QwQ 32B and OpenAI GPT-5.2 across the three attack methods.