⚛️量子位•Freshcollected in 25m
Shengshu Tech Raises ~$280M Series B

💡Massive $280M fund for world models to unify digital-physical AI productivity
⚡ 30-Second TL;DR
What Changed
Nearly 2B RMB (~$280M) Series B funding completed
Why It Matters
Boosts China's AI infrastructure race with massive funding for world models, potentially accelerating embodied AI and simulation tech adoption.
What To Do Next
Review Shengshu Tech's world model whitepapers for robotics simulation integration.
Who should care:Founders & Product Leaders
🧠 Deep Insight
AI-generated analysis for this event.
🔑 Enhanced Key Takeaways
- •The funding round was led by prominent investors including Alibaba, Baidu, and Zhipu AI, signaling strong strategic backing from China's major AI ecosystem players.
- •Shengshu Technology is best known for its Vidu model, a video generation AI capable of producing high-consistency, 16-second long videos from text prompts.
- •The capital injection is specifically earmarked for scaling compute infrastructure and accelerating the development of multimodal world models that move beyond simple video generation into interactive simulation.
📊 Competitor Analysis▸ Show
| Feature | Shengshu (Vidu) | OpenAI (Sora) | Runway (Gen-3) |
|---|---|---|---|
| Core Focus | General World Models | Video Generation | Creative Video Tools |
| Consistency | High (Temporal/Spatial) | High (Simulated) | Medium-High |
| Accessibility | China-market focused | Global (Limited) | Global (Public) |
| Architecture | U-ViT (Diffusion) | DiT (Diffusion) | Latent Diffusion |
🛠️ Technical Deep Dive
- •Utilizes a U-ViT (Unified Vision Transformer) architecture, which treats visual data as tokens, allowing for more efficient scaling compared to traditional U-Net based diffusion models.
- •Employs a proprietary 'Diffusion Transformer' approach that integrates spatial-temporal attention mechanisms to maintain object permanence across long-duration video sequences.
- •Focuses on 'World Model' training objectives, where the model is trained to predict future frames based on physical laws and causal relationships rather than just pixel-level interpolation.
🔮 Future ImplicationsAI analysis grounded in cited sources
Shengshu will pivot from video generation to interactive 3D simulation.
The company's stated goal of bridging digital and physical worlds requires moving from passive video output to active, physics-based environment interaction.
Vidu will integrate directly into Alibaba and Baidu cloud ecosystems.
The strategic investment from these cloud giants suggests a move to provide Vidu as a foundational API service for their enterprise customers.
⏳ Timeline
2024-04
Shengshu Technology releases Vidu, China's first Sora-like video generation model.
2024-07
Vidu model undergoes major update to support 16-second video generation and improved prompt adherence.
2026-04
Company secures ~280M USD in Series B funding to scale world model development.
📰
Weekly AI Recap
Read this week's curated digest of top AI events →
👉Related Updates
AI-curated news aggregator. All content rights belong to original publishers.
Original source: 量子位 ↗
