DeepSeek-R1 Hallucinates 4x More Than V3, Raising Red Flags for Crypto AI Agent Tokens

May 11, 2026 - 21:30

DeepSeek-R1 Hallucinates 4x More Than V3, Raising Red Flags for Crypto AI Agent Tokens

DeepSeek-R1, the flagship reasoning model from Chinese lab DeepSeek, hallucinates at 14.3% according to Vectara’s HHEM 2.1 benchmark. That is nearly four times higher than its non-reasoning predecessor DeepSeek-V3, which scored 3.9%. The gap raises hard questions for the crypto sector. A fast-growing class of AI agent tokens now leans on reasoning-style LLMs for autonomous...

What's Your Reaction?

Like 0

Dislike 0

Love 0

Funny 0

Angry 0

Sad 0

Wow 0

Related Posts

FBI Targets Crypto Fraudsters With New Enforcement Push

FBI Targets Crypto Fraudsters With New Enforcement Push

Jun 20, 2026

CFTC And SEC Seek Input On Derivatives Definitions As Crypto Perpetuals Face Legal Test

CFTC And SEC Seek Input On Derivatives Definitions As C...

Jun 20, 2026

Brazil Sees $318B In Crypto Inflows As On-Chain Money Launde

Brazil Sees $318B In Crypto Inflows As On-Chain Money L...

Jun 20, 2026

Bitcoin Network Activity Is Rising as BTC Falls Nearly 50% Below Peak Price: CryptoQuant – Decrypt

Bitcoin Network Activity Is Rising as BTC Falls Nearly ...

Jun 20, 2026

BOJ deputy warns on inflation as Polymarket puts 2026 Fed hike odds at 66%

BOJ deputy warns on inflation as Polymarket puts 2026 F...

Jun 20, 2026

Second-generation iPhone Air: cameras, specs, 2027 launch

Second-generation iPhone Air: cameras, specs, 2027 launch

Jun 20, 2026