🇨🇳cnBeta (Full RSS)•Mar 16, 2026Stalecollected in 44m

NVIDIA Launches AI Space Computing

Post LinkedIn

🇨🇳Read original on cnBeta (Full RSS)

#space-ai #edge-computing #gtc-announcementspace-1-vera-rubin

💡NVIDIA puts data center AI in orbit—vital for space edge computing devs

⚡ 30-Second TL;DR

What Changed

Announced at GTC conference on March 16 night

Why It Matters

This launch enables scalable AI processing in space, accelerating satellite analytics and autonomous missions for defense and commercial space sectors. AI practitioners gain new edge deployment options beyond Earth.

What To Do Next

Test Jetson Orin developer kits for SWaP-constrained AI prototypes simulating space edge inference.

Who should care:Developers & AI Engineers

🧠 Deep Insight

Web-grounded analysis with 8 cited sources.

🔑 Enhanced Key Takeaways

•NVIDIA Vera Rubin GPU features a 2.3 kW TDP and 50 petaFLOPS of NVFP4 AI inference performance per GPU, with 288GB HBM4 capacity.
•Vera CPU includes 88 custom Olympus Arm cores supporting spatial multithreading for 176 threads, paired with up to 1.5TB LPDDR5X memory at 1.2 TB/s bandwidth.[1][2][3][4][5]
•Rubin platform introduces sixth-generation NVLink at 3.6 TB/s per GPU and NVLink-C2C at 1.8 TB/s for coherent CPU-GPU memory access.[1][3][4]
•Vera Rubin NVL72 rack delivers 3.6 exaFLOPS NVFP4 inference and integrates components like NVLink 6 switch, ConnectX-9, and BlueField-4 DPU.[2][3][4][6]

🛠️ Technical Deep Dive

•Rubin GPU: 224 Streaming Multiprocessors (SMs) with fifth-generation Tensor Cores for NVFP4/FP8, expanded SFUs for attention/sparse compute; 288GB HBM4 (8 stacks), 22.2 TB/s bandwidth, 2.3 kW TDP.[1][3][4]
•Vera CPU: 88 Olympus Armv9.2 cores, spatial multithreading (176 threads/core via resource partitioning), <50W for 1.5TB LPDDR5X at 1.2 TB/s bandwidth using SOCAMM modules.[2][3][4][5]
•Interconnects: NVLink 6 at 3.6 TB/s GPU-to-GPU, NVLink-C2C 1.8 TB/s coherent CPU-GPU; NVL72 rack: 72 GPUs + 36 Vera CPUs, 20.7TB HBM4, 260 TB/s aggregate NVLink.[1][2][3][4]
•Rack-scale: NVL72 offers 3.6 EFLOPS NVFP4 inference; includes BlueField-4 DPU (64-core Grace CPU), ConnectX-9 networking.[2][3][4][6]

🔮 Future ImplicationsAI analysis grounded in cited sources

Rubin NVL72 provides 5x Blackwell inference performance at 10x lower cost-per-token

NVIDIA claims Rubin delivers 50 PFLOPS NVFP4 inference per GPU versus Blackwell's 10 PFLOPS, targeting H2 2026 launch for optimized AI economics.[4]

Vera Rubin enables rack-scale model partitioning without inter-rack communication

3.6 TB/s NVLink per GPU and 260 TB/s rack aggregate bandwidth eliminate partitioning bottlenecks for massive AI models.[1][2]

SOCAMM LPDDR5X improves AI factory uptime via modular, serviceable memory

Up to 1.5TB capacity with fault isolation addresses soldered memory limitations in prior Grace-based systems.[2][3][5]

⏳ Timeline

2025-01

NVIDIA announces Rubin platform architecture with Vera CPU and Rubin GPU previews.

2026-01

Rubin AI compute platform and Vera Rubin NVL72 launched at CES with HBM4 and NVLink 6 details.

2026-03

Vera Rubin specs upgraded to 2.3 kW TDP and 22.2 TB/s bandwidth ahead of late 2026 production.

2026-03

Space-optimized Vera Rubin module, IGX Thor, Jetson Orin unveiled at GTC for orbital AI.

📎 Sources (8)

Factual claims are grounded in the sources below. Forward-looking analysis is AI-generated interpretation.

🇨🇳Read original article on cnBeta (Full RSS)

📰

Weekly AI Recap

Read this week's curated digest of top AI events →

👉Related Updates

Same topic

Explore #space-ai

Same product

AI-curated news aggregator. All content rights belong to original publishers.
Original source: cnBeta (Full RSS) ↗

⚡ 30-Second TL;DR

🧠 Deep Insight

🔑 Enhanced Key Takeaways

🛠️ Technical Deep Dive

🔮 Future ImplicationsAI analysis grounded in cited sources

⏳ Timeline

📎 Sources (8)

👉Related Updates

Intel Cancels 3rd-Gen Arc Gaming GPUs

Trump Replaces Entire NSF Board

Chinese 6nm GPU Earns Microsoft WHQL Cert

Microsoft Eyes Fedora for Azure Linux Rebuild