🇬🇧The Register - AI/ML•Mar 12, 2026Stalecollected in 11m

Meta Unveils Four Superior Custom AI Chips

Post LinkedIn

🇬🇧Read original on The Register - AI/ML

#ai-chips #custom-silicon #hyperscale-hardwaremeta-custom-ai-chips

💡Meta's custom AI chips beat commercial silicon at gigawatt scale—vital for AI infra builders.

⚡ 30-Second TL;DR

What Changed

Meta revealed four custom AI chips for its AI services

Why It Matters

Meta's custom chips reduce dependency on third-party hardware, potentially cutting costs for hyperscale AI training. This accelerates in-house optimization for Llama models and services. Signals broader trend of big tech building proprietary AI infra.

What To Do Next

Review Meta's engineering blog for benchmarks comparing these chips to Nvidia GPUs.

Who should care:Enterprise & Security Teams

🧠 Deep Insight

Web-grounded analysis with 7 cited sources.

🔑 Enhanced Key Takeaways

•Meta's custom chips are part of the MTIA (Meta Training and Inference Accelerator) family, with the next-generation version featuring 128 MB on-chip memory, 64 GB off-chip LPDDR5, and clock speeds up to 1.35 GHz at 90 watts per chip[2].
•Meta has a multi-year partnership with AMD to deploy up to 6 gigawatts of custom Instinct MI450 GPUs and 6th Gen EPYC CPUs (Venice/Venano), with first shipments in 2H 2026, complementing its in-house silicon efforts[3][5].
•The MTIA chips are deployed in a rack-scale system holding up to 72 accelerators across three chassis, designed for co-optimized hardware-software stacks handling low-to-high complexity ranking and recommendation models[2].
•Meta continues heavy reliance on Nvidia for training with over 1.3 million GPUs planned by end-2025, while shifting some inference workloads to AMD MI300X and future MI455X for cost-efficiency[6].

🛠️ Technical Deep Dive

•Next-gen MTIA: TSMC 5nm process, clocked at 1.35 GHz (up from 800 MHz), power at 90W (up from 25W), with 128 MB on-chip memory and 64 GB off-chip LPDDR5[2].
•Memory bandwidth: Local 400 GB/s per PE, on-chip 800 GB/s, off-chip LPDDR5 176 GB/s; compute at 2.76 TFLOPS/s (FP32)[2].
•Rack system: Up to 72 accelerators (3 chassis x 12 boards x 2 chips), optimized for 10x-100x varying model sizes in ranking/recommendation workloads[2].
•Earlier MTIA variant: 256 MB on-chip memory, 128 GB off-chip LPDDR5, local memory 384 KB per PE with 1 TB/s bandwidth per PE[2].

🔮 Future ImplicationsAI analysis grounded in cited sources

Meta will deploy first 1 GW of AMD chips by end-2026

AMD-Meta agreement schedules shipments of custom MI450 GPUs and EPYC Venice CPUs starting 2H 2026 to support gigawatt-scale AI infrastructure[3][5].

MTIA chips expand to training workloads by 2027

Meta CFO stated plans to extend custom silicon from inference/ranking to AI model training, with new MTIA versions deploying in 2026-2027[4][7].

Meta's hybrid strategy reduces Nvidia dependency

Meta adopts AMD for inference while retaining Nvidia for training, scaling back some in-house ASIC plans in favor of AMD MI455X[6].

⏳ Timeline

2024-01

First-gen MTIA deployed for ranking/recommendation workloads

2025-12

Meta plans 1.3M+ Nvidia GPUs operational

2026-02

AMD-Meta announce 6 GW multi-year GPU/CPU partnership

2026-03

Meta reveals four MTIA chips for expanded AI workloads

📎 Sources (7)

Factual claims are grounded in the sources below. Forward-looking analysis is AI-generated interpretation.

🇬🇧Read original article on The Register - AI/ML

📰

Weekly AI Recap

Read this week's curated digest of top AI events →

👉Related Updates

Same topic

Explore #ai-chips

Same product

Anthropic in talks with Samsung for custom AI chips

cnBeta (Full RSS)•Jul 4

AI-curated news aggregator. All content rights belong to original publishers.
Original source: The Register - AI/ML ↗