Skip to main content
Registration is now open! Early-bird pricing available through May 5, 2026. Register now

All Accepted Demos

L.A.K.E.: Logic Agent for Knowledge Extraction in Data Planning

Jean-Flavien Bussotti (Megagon Labs), Naoki Otani (Megagon Labs), Eser Kandogan (Megagon Labs)

Architectural Patterns & Composition Engineering & Operations

Summary

An agentic data planning framework that maps natural language questions to executable workflows over diverse data lake sources, with interactive DAG-based provenance visualization.

Description

Data lakes in modern enterprises are massive, heterogeneous, and noisy, often preventing non-experts from effectively extracting value. Bridging the semantic gap between ambiguous user intent and explicit data requires orchestrating multiple tools under an open-world assumption. However, reliably executing these compound AI workflows to solve complex extraction tasks, while also providing the transparency needed for evaluation and debugging, remains a significant bottleneck. We propose L.A.K.E. (Logic Agent for Knowledge Extraction), an agentic data planning framework designed to map natural language questions to executable workflows over diverse data sources. Rather than relying on a brittle "one-size-fits-all" approach, L.A.K.E. dynamically generates a declarative plan comprised of modular operators — spanning relational (SQL), text, operators, and semantic functions. Within this framework, we introduce and benchmark three distinct planning regimes: Cascade Planning, Single-Shot Tree Planning, and Iterative Planning. We present an interactive demonstration platform that enables users to visually compare the latency and robustness trade-offs of these planners. By rendering execution paths as interactive Directed Acyclic Graphs (DAGs) with step-level provenance, L.A.K.E. provides the critical observability needed to establish trust, debug failures, and optimize data planning for enterprise-scale lakes.

ACM CAIS 2026 Sponsors