StepFun’s Voice AI Topped Every Benchmark. It Also Hears Your Sighs – Decrypt

May 26, 2026 - 19:00
StepFun’s Voice AI Topped Every Benchmark. It Also Hears Your Sighs – Decrypt
In brief StepAudio 2.5 Realtime is an end-to-end real-time speech model with fully customizable personas in Chinese and English. StepFun claims first place across all five voice AI benchmarks tested in April 2026, beating GPT Realtime 1.5 and Gemini Live. The model was trained on a million-scale persona dataset and tuned with roleplay-specific RLHF to...

What's Your Reaction?

Like Like 0
Dislike Dislike 0
Love Love 0
Funny Funny 0
Angry Angry 0
Sad Sad 0
Wow Wow 0