๐Ÿ’ผFreshcollected in 27m

Databricks launches LTAP to eliminate AI data pipeline latency

Databricks launches LTAP to eliminate AI data pipeline latency
PostLinkedIn
๐Ÿ’ผRead original on VentureBeat

๐Ÿ’กEliminate ETL bottlenecks for AI agents with Databricks' new LTAP architecture for real-time data access.

โšก 30-Second TL;DR

What Changed

LTAP (Lake Transactional/Analytical Processing) unifies storage for Postgres transactional data and analytical lakehouse tables.

Why It Matters

This shift significantly reduces the complexity of building real-time AI applications by removing the 'ETL tax' on data. It allows developers to feed live operational data directly into analytical engines, enabling more responsive and intelligent AI agents.

What To Do Next

Evaluate your current ETL architecture and test if migrating to LTAP can reduce latency for your real-time AI agent workflows.

Who should care:Developers & AI Engineers

๐Ÿง  Deep Insight

Web-grounded analysis with 22 cited sources.

๐Ÿ”‘ Enhanced Key Takeaways

  • โ€ขLakehouse//RT is powered by a new real-time execution engine named Reyden, designed to handle the concurrency and latency demands of modern agentic enterprises.
  • โ€ขLTAP leverages Databricks' existing Lakebase platform, which functions as a fully-managed, serverless PostgreSQL database built on open object storage, specifically tailored for the agent era.
  • โ€ขIn preview, Lakehouse//RT demonstrated up to 16x better performance compared to traditional real-time serving layers, achieving response times as low as 10ms on smaller datasets and sub-100ms on larger ones, while sustaining sub-100ms latency at 12,000 queries per second on analytical benchmarks.
  • โ€ขThe LTAP architecture supports instant copy-on-write database branching, allowing developers to quickly spin up full-fidelity branches of live production databases for safe debugging and testing of AI agents with tools like GitHub Copilot.
  • โ€ขDatabricks positions LTAP as a distinct approach to Hybrid Transactional/Analytical Processing (HTAP), unifying data at the storage layer using open formats (Delta and Iceberg) rather than through engine convergence, aiming to avoid performance compromises and proprietary lock-in associated with traditional HTAP solutions.
๐Ÿ“Š Competitor Analysisโ–ธ Show

While Databricks' LTAP and Lakehouse//RT represent a novel approach to unifying transactional and analytical processing for AI agents, several other platforms offer solutions in the broader HTAP and real-time analytics space. Traditional HTAP databases have historically attempted to merge OLTP and OLAP workloads, and major cloud data platforms are also evolving their offerings to support real-time AI.

Feature/PlatformDatabricks LTAP / Lakehouse//RTTraditional HTAP (e.g., SingleStore, MySQL HeatWave, SAP HANA)Snowflake (with Unistore)Microsoft Fabric
ApproachStorage-layer unification (Lakebase + Lakehouse on open formats)Engine convergence (single database engine for both workloads)Multi-cluster, shared data architecture with transactional tablesUnified SaaS platform with OneLake and Copilot AI
Transactional DataServerless Postgres (Lakebase) writes directly to open columnar formats (Delta/Iceberg)Optimized for high-volume, low-latency OLTP transactionsHybrid tables for transactional workloads alongside analyticalIntegrates various data sources, including transactional
Analytical DataMillisecond query latency on Delta/Iceberg via Reyden engineReal-time analytics on operational dataStrong SQL analytics and BI capabilitiesComprehensive analytics, BI, and data science
Data MovementZero ETL, zero data copies, single source of truthAims to eliminate ETL between OLTP/OLAPReduces data movement between transactional and analytical storesOneLake centralizes data, reducing copies
AI Agent FocusDesigned specifically to accelerate AI agents with real-time, unified data accessSupports real-time feature stores and autonomous decision-making for AISnowpark ML for machine learning workflowsCopilot AI embedded across workloads
GovernanceUnified governance via Unity CatalogVaries by vendor, typically database-levelUnified governance and securityCentralized governance with OneLake
OpennessOpen formats (Delta, Iceberg), open source componentsOften proprietary, though some use open source databasesSupports Iceberg tables, but core platform is proprietaryOneLake uses Delta Parquet format
Performance (Analytical)Sub-100ms latency at 12,000 QPS (benchmarks)Low-latency for mixed workloadsOptimized for analytical queriesVaries by workload
PricingExisting Lakehouse customers can adopt as drop-in replacement, promotional pricing plannedVaries by vendor, often proprietaryUsage-based pricing, separates storage/computeCapacity-based pricing

๐Ÿ› ๏ธ Technical Deep Dive

  • Lakehouse//RT Engine (Reyden): This new compute engine is specifically designed to deliver millisecond query latency at massive scale, supporting tens of thousands of concurrent users and AI agents directly on governed Delta Lake and Apache Iceberg tables.
  • Direct Querying: Lakehouse//RT queries Delta and Iceberg tables directly, eliminating the need for proprietary formats, data copies, or separate ingestion pipelines.
  • Intelligent Autoscaling: The system features intelligent autoscaling capabilities, allowing it to rapidly scale up to meet demanding workloads and scale down to zero during idle periods to optimize costs.
  • LTAP Foundation (Lakebase): LTAP is built on Lakebase, a serverless PostgreSQL database that stores data directly in Unity Catalog using the same open formats (Delta and Iceberg) as the Lakehouse. This ensures a single source of truth for operational, analytical, and streaming data.
  • PostgreSQL Compatibility: LTAP maintains PostgreSQL compatibility for applications, allowing transactional applications to operate with native PostgreSQL performance while data is instantly written to columnar formats for analytics.
  • Copy-on-Write Database Branching: Lakebase supports instant copy-on-write database branching, enabling developers to create full-fidelity, temporary branches of live production databases in seconds for safe debugging and development of AI agents.
  • Unified Governance (Unity Catalog): Both Lakehouse//RT and LTAP operate under the Unity Catalog's governance framework, providing a single identity, permissions, and audit model across all operational, analytical, and streaming data.

๐Ÿ”ฎ Future ImplicationsAI analysis grounded in cited sources

The development and deployment of production-grade AI agents will significantly accelerate.
By eliminating data pipeline latency and simplifying the data stack, developers can iterate faster and deploy AI agents that rely on real-time, unified data more efficiently, moving AI from experimental pilots to automated workflows.
The Lakehouse architecture will become the dominant paradigm for both operational and analytical workloads.
LTAP's ability to natively handle transactional data alongside analytical data on a single, open-format storage layer removes a major barrier, making the lakehouse a more comprehensive and unified platform for all enterprise data needs.
The competitive landscape for Hybrid Transactional/Analytical Processing (HTAP) solutions will undergo a significant shift.
Databricks' novel approach to HTAP, unifying at the storage layer with open formats, could challenge traditional HTAP vendors who focus on engine convergence and proprietary systems, potentially leading to broader industry adoption of storage-centric HTAP.

โณ Timeline

2019
Databricks publicly introduced Delta Lake, an open-source storage layer bringing reliability to data lakes.
2020
Databricks coined the term 'Lakehouse' to describe an architecture combining data lake flexibility with data warehouse capabilities.
2021-05
Delta Lake, Hudi, and Iceberg projects enabled building warehousing-like capabilities directly on the data lake, bringing structure, reliability, and performance.
2025-05
Databricks acquired Neon, a PostgreSQL-based OLTP system, as a strategic move to natively run transactional workloads within its Lakehouse.
2025
Databricks introduced synchronization capabilities to move data between operational and analytical environments, a precursor to the more radical LTAP approach.
2026-06-16
Databricks launched LTAP (Lake Transactional/Analytical Processing) and Lakehouse//RT, unifying transactional and analytical data storage for real-time AI agents.
๐Ÿ“ฐ

Weekly AI Recap

Read this week's curated digest of top AI events โ†’

๐Ÿ‘‰Related Updates

AI-curated news aggregator. All content rights belong to original publishers.
Original source: VentureBeat โ†—