Meta Launches 4 New AI Chips: MTIA

💡Meta's custom AI chips challenge Nvidia dominance—key for infra builders.
⚡ 30-Second TL;DR
What Changed
Meta developed 4 new MTIA processors
Why It Matters
Meta's custom chips could reduce dependency on Nvidia, cutting costs and securing AI supply chains for hyperscalers. This accelerates the trend of big tech designing specialized AI silicon. Practitioners gain potential new hardware options beyond GPUs.
What To Do Next
Monitor Meta's AI blog for MTIA benchmark releases to assess inference performance gains.
🧠 Deep Insight
Web-grounded analysis with 9 cited sources.
🔑 Enhanced Key Takeaways
- •Next-gen MTIA (v2) uses TSMC 5nm process, operates at 1.35GHz, and delivers 102.4 TFLOPS INT8 GEMM performance with 90W TDP[1][2].
- •MTIA v2 features an 8x8 PE grid with 3.5x dense and 7x sparse compute gains over v1, plus tripled local PE storage and doubled on-chip SRAM[1][2].
- •MTIA v2 supports inference only, not training, and platform-level tests show 6x serving throughput improvement over v1 systems[1][2].
- •MTIA-2 enters production on TSMC 3nm with CoWoS-S packaging and Broadcom support, debuting H1 2026; MTIA-3 follows in H2 2026[5].
🛠️ Technical Deep Dive
- •Architecture: 8x8 grid of 64 processing elements (PEs) connected via mesh network, supporting thread/data/instruction/memory-level parallelism[1][4][7].
- •Memory: 384KB local SRAM per PE (3x v1), 256MB on-chip SRAM (2x v1), up to 128GB off-chip LPDDR5; bandwidths: 1TB/s local per PE, 2.7TB/s on-chip, 176GB/s off-chip[1][2].
- •Performance: 102.4 TFLOPS/s INT8 GEMM, 51.2 TFLOPS/s FP16/BF16, 708 TOPS INT8 with sparsity; vector core 3.2 TFLOPS/s INT8[1][2].
- •Connectivity: 8x PCIe Gen4 (16GB/s host), TDP 90W (2.6x v1), die size 421mm² (25.6x16.4mm), TSMC 5nm[1][2].
- •Deployment: 12 accelerators per Yosemite V3 server with PCIe switches for inter-accelerator communication[4].
🔮 Future ImplicationsAI analysis grounded in cited sources
⏳ Timeline
📎 Sources (9)
Factual claims are grounded in the sources below. Forward-looking analysis is AI-generated interpretation.
- Meta AI — Next Generation Meta Training Inference Accelerator AI Mtia
- nextplatform.com — 1654250
- dl.acm.org — 3695053
- Meta AI — Meta Training Inference Accelerator AI Mtia
- trendforce.com — News Metas Mtia 3 AI Chip Reportedly Tipped for 2h26 Debut Built on TSMC 3nm with Guc Support
- globenewswire.com — Meta Platforms Global Mtia AI Processor Deployment Analysis Report 2026 V1 Freya V2 Artemis and V3 Iris As Well As Insights Into the Future V4 V5 and V6 Asics
- mashdigi.com — Meta Is Reportedly Testing a New Self Made Chip for Accelerating Artificial Intelligence Training Which Is Expected to Be Put Into Use in 2026
- ejlwireless.com — Toc Aixpu Meta Utr Profile
- ainvest.com — Meta Abandons Advanced Chip Project Turns External Partners 2602
📰 Event Coverage
Weekly AI Recap
Read this week's curated digest of top AI events →
👉Related Updates
AI-curated news aggregator. All content rights belong to original publishers.
Original source: Wired AI ↗
