Anthropic Claude 3.5 Sonnet v1 vs DeepSeek DeepSeek-V3.2 Thinking

Detailed comparison for LLMs

AnthropicDeepSeek

On Guardion's LLM vulnerability Benchmark, Anthropic Claude 3.5 Sonnet v1 is the more secure of the two: Claude 3.5 Sonnet v1 scores 16.0% and DeepSeek-V3.2 Thinking scores 42.0% on attack success rate (ASR) (lower is better). One or both scores are estimated from public safety evaluations pending a Guardion benchmark run.

Head-to-Head Overview

Claude 3.5 Sonnet v1 is the overall winner in this comparison!

Attack Success Rate (lower is safer)

ASR for Anthropic Claude 3.5 Sonnet v1 vs DeepSeek DeepSeek-V3.2 Thinking. Green marks the safer model on each metric. Only the overall score is available for estimated models.

Overall (ASR)

Claude 3.5 Sonnet v1

16.0%

DeepSeek-V3.2 Thinking

42.0%

TAP Attack Method (ASR)

Claude 3.5 Sonnet v1

28.4%

DeepSeek-V3.2 Thinking

100.0%

Crescendo Attack Method (ASR)

Claude 3.5 Sonnet v1

19.3%

DeepSeek-V3.2 Thinking

100.0%

Zero-Shot (ASR)

Claude 3.5 Sonnet v1

0.3%

DeepSeek-V3.2 Thinking

100.0%

Key Highlights

Anthropic Claude 3.5 Sonnet v1 has a lower Overall (ASR).
Anthropic Claude 3.5 Sonnet v1 has a lower TAP Attack Method (ASR).
Anthropic Claude 3.5 Sonnet v1 has a lower Crescendo Attack Method (ASR).
Anthropic Claude 3.5 Sonnet v1 has a lower Zero-Shot (ASR).

Security Profile

Outward is better on every axis.

Claude 3.5 Sonnet v1

DeepSeek-V3.2 Thinking

Full security profile

Anthropic Claude 3.5 Sonnet v1 →

Full security profile

DeepSeek DeepSeek-V3.2 Thinking →

Frequently asked questions

Is Anthropic Claude 3.5 Sonnet v1 or DeepSeek DeepSeek-V3.2 Thinking more secure?

What is the attack success rate (ASR) of Claude 3.5 Sonnet v1 vs DeepSeek-V3.2 Thinking?

Claude 3.5 Sonnet v1 has a 16.0% ASR and DeepSeek-V3.2 Thinking has a 42.0% ASR — the share of adversarial prompts that succeed across zero-shot, TAP, and Crescendo attacks. Lower is safer.

How were Claude 3.5 Sonnet v1 and DeepSeek-V3.2 Thinking tested?

Both were red-teamed with the HarmBench framework across zero-shot, TAP (Tree of Attacks with Pruning), and Crescendo multi-turn attacks, scored by Attack Success Rate.

Related Comparisons

Claude 3.5 Sonnet v1 vs GPT OSS 120B

Claude 3.5 Sonnet v1 vs Gemini 2.5 Pro

Claude 3.5 Sonnet v1 vs Grok-4.1 Fast Reasoning

Claude 3.5 Sonnet v1 vs Grok-4 Heavy

Claude 3.5 Sonnet v1 vs Claude 4.0 Sonnet

Claude 3.5 Sonnet v1 vs Nova Premier