🏠IT之家•Freshcollected in 4m
Alibaba Leads 2B RMB Vidu Funding Round

💡Huge funding fuels video AI race—Vidu challenges Sora, Seedance
⚡ 30-Second TL;DR
What Changed
Alibaba Cloud leads ~2B RMB (~$280M) round for Shengshu Tech
Why It Matters
Accelerates Shengshu's video AI edge against ByteDance and Kuaishou, signaling investor bets on multimodal models. Boosts Alibaba Cloud's AI ecosystem via portfolio startups.
What To Do Next
Test Vidu's Q3 model on Artificial Analysis for video gen benchmarks.
Who should care:Founders & Product Leaders
🧠 Deep Insight
AI-generated analysis for this event.
🔑 Enhanced Key Takeaways
- •Shengshu Technology's core team originates from Tsinghua University's Institute for AI Industry Research (AIR), founded by former Baidu President Ya-Qin Zhang, providing a strong academic foundation for their world model development.
- •The funding round values Shengshu Technology at a significant premium, reflecting investor confidence in their proprietary 'U-ViT' architecture which integrates diffusion models with transformer-based visual tokenization.
- •Beyond video generation, the company is pivoting its R&D focus toward 'General World Models' (GWMs) that aim to simulate physical laws and object interactions, moving beyond simple text-to-video synthesis.
📊 Competitor Analysis▸ Show
| Feature | Vidu (Shengshu) | Sora (OpenAI) | Kling (Kuaishou) |
|---|---|---|---|
| Architecture | U-ViT (Diffusion/Transformer) | DiT (Diffusion Transformer) | 3D VAE + Diffusion |
| Max Duration | 16s (Single-shot) | 60s (Multi-shot) | 10s (Single-shot) |
| Primary Focus | General World Model | Cinematic/Creative | Realistic/Human Motion |
| Benchmark Rank | Top 10 (Artificial Analysis) | N/A (Limited Access) | Top 5 (Artificial Analysis) |
🛠️ Technical Deep Dive
- •Architecture: Utilizes a proprietary U-ViT (Unified Vision Transformer) framework that treats video frames as visual tokens, allowing for high-fidelity temporal consistency.
- •Training Data: Employs a massive, curated dataset of high-resolution video-text pairs, specifically optimized for long-range temporal coherence and physical object permanence.
- •Inference Optimization: Implements custom CUDA kernels for accelerated diffusion sampling, enabling sub-second latency for initial frame generation.
- •World Model Integration: Incorporates physics-informed loss functions during training to better simulate gravity, collision, and material properties in generated scenes.
🔮 Future ImplicationsAI analysis grounded in cited sources
Shengshu will integrate Vidu into Alibaba Cloud's 'Model Studio' platform as a primary enterprise API.
The strategic investment by Alibaba Cloud strongly suggests a move to monetize Vidu through their existing enterprise AI infrastructure.
The company will release an open-source version of a smaller, distilled Vidu model by Q4 2026.
To compete with open-source leaders like Stable Video Diffusion, Shengshu needs to establish a developer ecosystem to maintain market relevance.
⏳ Timeline
2024-04
Shengshu Technology officially unveils Vidu at the Zhongguancun Forum.
2024-07
Vidu opens public access to global users, marking its transition from beta to commercial availability.
2025-03
Shengshu reports a 10x increase in active user base compared to the previous year.
2026-04
Alibaba Cloud leads a 2 billion RMB funding round to accelerate General World Model R&D.
📰
Weekly AI Recap
Read this week's curated digest of top AI events →
👉Related Updates
AI-curated news aggregator. All content rights belong to original publishers.
Original source: IT之家 ↗



