Anthropic Claude 3.5 Sonnet v1 vs DeepSeek DeepSeek R1 Distill Llama 8B

Detailed comparison for LLMs

AnthropicDeepSeek

On Guardion's LLM vulnerability Benchmark, Anthropic Claude 3.5 Sonnet v1 is the more secure of the two: Claude 3.5 Sonnet v1 scores 16.0% and DeepSeek R1 Distill Llama 8B scores 48.0% on attack success rate (ASR) (lower is better). One or both scores are estimated from public safety evaluations pending a Guardion benchmark run.

Head-to-Head Overview

Claude 3.5 Sonnet v1 is the overall winner in this comparison!

Attack Success Rate (lower is safer)

ASR for Anthropic Claude 3.5 Sonnet v1 vs DeepSeek DeepSeek R1 Distill Llama 8B. Green marks the safer model on each metric. Only the overall score is available for estimated models.

Overall (ASR)

Claude 3.5 Sonnet v1

16.0%

DeepSeek R1 Distill Llama 8B

48.0%

TAP Attack Method (ASR)

Claude 3.5 Sonnet v1

28.4%

DeepSeek R1 Distill Llama 8B

100.0%

Crescendo Attack Method (ASR)

Claude 3.5 Sonnet v1

19.3%

DeepSeek R1 Distill Llama 8B

100.0%

Zero-Shot (ASR)

Claude 3.5 Sonnet v1

0.3%

DeepSeek R1 Distill Llama 8B

100.0%

Key Highlights

Anthropic Claude 3.5 Sonnet v1 has a lower Overall (ASR).
Anthropic Claude 3.5 Sonnet v1 has a lower TAP Attack Method (ASR).
Anthropic Claude 3.5 Sonnet v1 has a lower Crescendo Attack Method (ASR).
Anthropic Claude 3.5 Sonnet v1 has a lower Zero-Shot (ASR).

Security Profile

Outward is better on every axis.

Claude 3.5 Sonnet v1

DeepSeek R1 Distill Llama 8B

Full security profile

Anthropic Claude 3.5 Sonnet v1 →

Full security profile

DeepSeek DeepSeek R1 Distill Llama 8B →

Frequently asked questions

Is Anthropic Claude 3.5 Sonnet v1 or DeepSeek DeepSeek R1 Distill Llama 8B more secure?

What is the attack success rate (ASR) of Claude 3.5 Sonnet v1 vs DeepSeek R1 Distill Llama 8B?

Claude 3.5 Sonnet v1 has a 16.0% ASR and DeepSeek R1 Distill Llama 8B has a 48.0% ASR — the share of adversarial prompts that succeed across zero-shot, TAP, and Crescendo attacks. Lower is safer.

How were Claude 3.5 Sonnet v1 and DeepSeek R1 Distill Llama 8B tested?

Both were red-teamed with the HarmBench framework across zero-shot, TAP (Tree of Attacks with Pruning), and Crescendo multi-turn attacks, scored by Attack Success Rate.

Related Comparisons

Claude 3.5 Sonnet v1 vs GPT OSS 120B

Claude 3.5 Sonnet v1 vs Gemini 2.5 Pro

Claude 3.5 Sonnet v1 vs Grok-4.1 Fast Reasoning

Claude 3.5 Sonnet v1 vs Grok-4 Heavy

Claude 3.5 Sonnet v1 vs Claude 4.0 Sonnet

Claude 3.5 Sonnet v1 vs Nova Premier