🗾Freshcollected in 83m

Why SoftBank chooses NVIDIA for its AI supercomputer

Why SoftBank chooses NVIDIA for its AI supercomputer
PostLinkedIn
🗾Read original on ITmedia AI+ (日本)

💡Understand the real-world engineering trade-offs behind building a 122 billion yen AI supercomputer.

⚡ 30-Second TL;DR

What Changed

SoftBank invested 122 billion yen in AI infrastructure

Why It Matters

Reinforces NVIDIA's dominance in the high-performance computing market. It provides insight into the trade-offs between cost and performance for large-scale AI infrastructure.

What To Do Next

Evaluate the long-term maintenance and ecosystem support costs when selecting hardware for large-scale AI clusters.

Who should care:Developers & AI Engineers

🧠 Deep Insight

AI-generated analysis for this event.

🔑 Enhanced Key Takeaways

  • SoftBank's supercomputer project is part of a broader strategic initiative to build a sovereign AI infrastructure in Japan, reducing reliance on overseas cloud providers.
  • The infrastructure utilizes NVIDIA's Blackwell architecture, specifically leveraging the GB200 NVL72 platform to handle massive parameter counts for large language models.
  • SoftBank is integrating its proprietary AI models, such as the 'SBCM' (SoftBank Corp Model), directly into this hardware stack to optimize inference performance.
  • The investment includes a significant focus on liquid cooling technology to manage the thermal output of high-density GPU clusters, which was a primary driver for the initial hardware stability challenges.
  • This initiative aligns with the Japanese government's 'AI and Software-Defined Infrastructure' policy, which provides subsidies for domestic companies building large-scale compute clusters.
📊 Competitor Analysis▸ Show
FeatureSoftBank (NVIDIA-based)Competitor (e.g., AWS/Google)Local Japanese Cloud Providers
Primary HardwareNVIDIA GB200 NVL72Custom Silicon (Trainium/TPU)NVIDIA H100/H200 Clusters
ControlSovereign/PrivatePublic Cloud/SharedSovereign/Shared
LatencyUltra-low (Dedicated)Variable (Network dependent)Low (Regional)

🛠️ Technical Deep Dive

  • Architecture: Utilizes NVIDIA GB200 NVL72, connecting 72 Blackwell GPUs and 36 Grace CPUs in a single rack.
  • Interconnect: Employs 5th Gen NVLink for 1.8TB/s bidirectional bandwidth per GPU.
  • Cooling: Implementation of direct-to-chip liquid cooling systems to support high TDP (Thermal Design Power) per rack.
  • Software Stack: Integration of NVIDIA AI Enterprise software suite for orchestration, including NeMo for model training and TensorRT-LLM for inference optimization.

🔮 Future ImplicationsAI analysis grounded in cited sources

SoftBank will achieve a 30% reduction in inference costs for its internal AI services by 2027.
The transition to Blackwell-based hardware significantly improves energy efficiency and tokens-per-watt metrics compared to previous generation clusters.
SoftBank will launch a commercial 'AI-as-a-Service' platform for Japanese enterprises by Q4 2026.
The scale of the 122 billion yen investment exceeds internal requirements, necessitating a commercialization strategy to recoup capital expenditures.

Timeline

2023-11
SoftBank announces plans to build a large-scale generative AI computing platform in Japan.
2024-05
SoftBank confirms the expansion of its AI compute infrastructure, targeting the deployment of NVIDIA's latest GPU architectures.
2025-02
SoftBank begins initial testing of high-density liquid-cooled server racks to address thermal management issues.
2026-01
SoftBank officially integrates the 122 billion yen supercomputer cluster into its production environment.
📰

Weekly AI Recap

Read this week's curated digest of top AI events →

👉Related Updates

AI-curated news aggregator. All content rights belong to original publishers.
Original source: ITmedia AI+ (日本)