Detailed comparison for LLMs
GPT-4.1 is the overall winner in this comparison!
Comparison of Attack Success Rate (ASR) metrics for Moonshot AI Kimi K2-Thinking-0905 and OpenAI GPT-4.1 across the three attack methods.