⚛️Stalecollected in 59m

阶跃星辰 Storms AI Playoffs, Tops New Six Tigers

PostLinkedIn
⚛️Read original on 量子位

💡New Chinese LLM crashes AI playoffs, leads 'New Six Tigers' tier – benchmark threat?

⚡ 30-Second TL;DR

What Changed

阶跃星辰 enters AI playoffs successfully

Why It Matters

This elevates emerging Chinese LLMs in global competitions, pressuring leaders like Qwen and DeepSeek. Practitioners may need to reassess benchmarks as new challengers disrupt top tiers.

What To Do Next

Check 阶跃星辰 rankings on AI model leaderboards like LMSYS Arena today.

Who should care:Researchers & Academics

🧠 Deep Insight

Web-grounded analysis with 6 cited sources.

🔑 Enhanced Key Takeaways

  • 阶跃星辰(StepFun)于2026年2月初开源Step 3.5 Flash模型,采用MoE架构,总参数1960亿,每Token激活约110亿参数,推理速度达每秒350 Token[1].
  • Step 3.5 Flash在OpenRouter Fastest Models排名中首日位列全球最快,生成速率167 Tokens/s,并在高难度数学推理中表现出色[1].
  • 公司近期还发布视觉语言模型Step-3 VL-10B达到同规模SOTA水平,以及语音模型Step-Audio-R1.1登顶Artificial Analysis Speech Reasoning全球第一[1].

🛠️ Technical Deep Dive

  • Step3模型采用Mixture-of-Experts (MoE)架构,总参数321B (VLM),每Token激活38B参数[3].
  • 架构细节:61层(含5密层),隐藏维度7168,MFA注意力机制,低秩Query维度2048,64 Query heads (头维度256),48专家/3选专家+1共享专家[3].
  • 支持最大上下文长度65536,使用Deepseek V3分词器,LLM总参数316B[3].
  • Step 3.5 Flash:MoE架构,总参数1960亿,每Token激活110亿,优化复杂Agent工作流,单请求代码任务最高350 Tokens/s[1].

🔮 Future ImplicationsAI analysis grounded in cited sources

阶跃星辰将强化中国开源AI基础模型领导地位
其连续发布Step 3.5 Flash、Step-3 VL和Step-Audio等多模态SOTA模型,结合CTO朱亦博在GTC 2026分享优化经验,加速多模态训练效率提升[1][3][4].
MoE架构将主导高效大模型部署
Step3和Step 3.5 Flash通过MFA和AFD等创新实现低端加速器高效运行,并在全球速度榜单领先,推动开源模型成本优化[1][3].

Timeline

2026-01
发布首款模型,开启StepFun模型系列
2026-02
开源Step 3.5 Flash和Step-3 VL-10B等多模态模型
2026-02
Step-Audio-R1.1登顶全球语音推理榜单
2026-02
Step 3.5 Flash进入OpenRouter全球最快模型排名
2026-02
阶跃星辰跻身AI新六小虎第一梯队

📎 Sources (6)

Factual claims are grounded in the sources below. Forward-looking analysis is AI-generated interpretation.

  1. cfbond.com — Wap 991119525
  2. xyzlabs.substack.com — Open Reasoner Zero a Breakthrough
  3. GitHub — Step3
  4. NVIDIA — Gtc26 S81883
  5. llm-stats.com — Stepfun
  6. modelscope.cn — Step 3.5 Flash
📰

Weekly AI Recap

Read this week's curated digest of top AI events →

👉Related Updates

AI-curated news aggregator. All content rights belong to original publishers.
Original source: 量子位