Detailed comparison for LLMs
Qwen 2.5 72B is the overall winner in this comparison!
Comparison of Attack Success Rate (ASR) metrics for Alibaba Qwen 2.5 72B and OpenAI GPT-4.1 across the three attack methods.