Funding
DataBahn Raises $17M to Redefine Enterprise Data Pipelines

In a major vote of confidence for AI-native infrastructure, DataBahn has raised $17 million in a Series A round led by Forgepoint Capital, with support from S3 Ventures and GTM Capital. The funding brings the company’s total to $19 million and positions it as a next-generation powerhouse in enterprise data pipeline management.
At the heart of DataBahn’s mission is a seismic shift in how enterprises handle one of their most critical assets: telemetry. Traditionally scattered, noisy, and costly, telemetry data—from logs and events to application traces—has overwhelmed IT and security teams. But DataBahn’s platform reimagines the data layer entirely, bringing intelligence, control, and automation to every step of the pipeline.
From Data Plumbing to Data Intelligence
While legacy data tools focus on shuffling logs from point A to B, DataBahn is engineering what it calls a security-native data fabric—a foundational architecture that doesn’t just move data, but understands, enriches, and makes it AI-ready in real time.
This is enabled by their modular platform components, such as:
- Smart Edge for agentless data collection with edge analytics.
- Highway, an orchestration layer using AI to manage schema drift, reduce costs, and route data efficiently.
- Cruz, an AI agent that automates pipeline engineering tasks like parsing, normalizing, and monitoring.
- Reef, a contextual intelligence hub that turns raw data into actionable insights via graph-based correlation and AI-powered search.
These capabilities allow security, observability, and business teams to derive value from data instantly—whether by slashing SIEM log volume by 50%, onboarding new apps 10x faster, or proactively identifying IT anomalies before they escalate.
Introducing Agentic AI: Cruz
One of the standout innovations powering DataBahn’s rise is Cruz, the platform’s agentic AI. Unlike static scripts or brittle workflows, Cruz is a machine learning-powered “data engineer in a box” that adapts to changes in data sources, formats, and schemas automatically.
This agentic AI operates using reinforcement learning and semantic parsing to intelligently enrich or suppress incoming data based on context. For example, Cruz can identify when a specific log field is generating noise and suppress it across similar streams—eliminating the manual grunt work traditionally shouldered by data engineers.
This shift from procedural automation to intelligent autonomy—where agents make real-time decisions based on environmental feedback—is a hallmark of agentic AI. It marks a departure from reactive systems to proactive, self-optimizing infrastructure.
A Strategic Infrastructure Layer for the Modern Enterprise
The rise of DataBahn is timed with a global surge in data complexity. With enterprise telemetry sprawled across cloud, on-prem, IoT, and OT systems, and global data creation expected to top 394 zettabytes by 2028, the traditional data pipeline model simply cannot scale.
DataBahn’s federated architecture and mesh-based ingestion model offer a solution. By providing lossless, failover-friendly routing, the platform prevents pipeline breakage and data loss during surges—a common challenge in legacy setups. Furthermore, with 400+ prebuilt connectors and 900+ log volume reduction rules, DataBahn shortens deployment times dramatically and reduces reliance on expensive custom engineering.
Customers like CSL Behring, AXIS Capital, and Saviynt have praised the platform’s ability to transform data from a burden into a strategic advantage. Greg Stewart, senior director of cybersecurity at CSL Behring, noted that DataBahn “changed what data means to us,” turning it from a cost center into an operational weapon.
From Security to Observability and Beyond
Though initially focused on security telemetry, DataBahn’s reach is quickly expanding into observability, application performance, and IoT/OT. Its platform now acts as a unified control plane for enterprise data—an AI-powered foundation that offers transparency, governance, and flexibility across the entire data lifecycle.
And with persona-based federated search, DataBahn tailors insights to the end-user: CISOs get real-time threat analytics, SREs gain predictive outage visibility, and business analysts access enriched app data—all from the same data fabric.
Backed by Cybersecurity Veterans and AI Visionaries
Founded by alumni of top security vendors, Big Four consulting firms, and global banks, DataBahn was born out of frustration with the complexity of legacy tools. CEO Nanda Santhana and President Nithya Nareshkumar lead a team that deeply understands the problem—and has architected a solution from the ground up.
“Enterprises aren’t just overwhelmed by data volume; they’re being outpaced by its complexity,” said Santhana. “Our mission is to transform telemetry from a liability into a strategic asset.”
Forgepoint Capital’s managing director Ernie Bio, who joins the board as part of the funding round, highlighted DataBahn’s unique edge: “What’s truly rare is the customer enthusiasm. We heard consistent praise for the platform’s rapid ROI, forward-looking innovation, and responsiveness—qualities that separate great companies from the rest.”
Certainly. Here’s a rewritten final section that replaces the sales-driven language with a grounded look at the evolving state of the industry, where it’s heading, and what role platforms like DataBahn are likely to play:
The Expanding Role of Intelligent Data Infrastructure
The enterprise data landscape is undergoing a fundamental shift. As organizations adopt cloud-native architectures, hybrid work environments, and increasingly complex AI systems, the limitations of legacy data infrastructure have become more apparent. Data is no longer confined to centralized systems—it now spans edge devices, SaaS apps, cloud environments, and on-prem legacy systems.
This fragmentation is colliding with the exponential growth of machine data. By 2028, global data creation is expected to surpass 390 zettabytes, and enterprise telemetry—logs, traces, metrics, and events—will represent an outsized share of that growth. But while data volumes have exploded, the tooling to manage, contextualize, and act on that data has lagged behind.
In response, a new category of platforms is emerging: intelligent, AI-native data fabrics designed not just to transport data, but to understand it. These platforms aim to address challenges like schema drift, redundant ingestion, and real-time decision-making—issues that are especially critical for security, compliance, and AI readiness.
The market is also being reshaped by the growing adoption of agentic AI—autonomous agents that can parse and enrich data flows in real time without human intervention. This shift signals a move away from brittle, hard-coded data engineering workflows toward adaptive, learning-driven infrastructure.
Looking ahead, we can expect to see tighter convergence between data management, AI operations (AIOps), and cybersecurity. Observability will increasingly require context-aware data. Security teams will demand smarter pipelines that filter irrelevant noise before it even hits the SIEM. And AI initiatives will depend on architectures that can supply high-quality, low-latency data in real time.
Platforms like DataBahn are well-positioned to meet this moment—not as standalone tools, but as foundational layers in the modern data stack. As the complexity of telemetry increases, and as AI capabilities become table stakes, the need for composable, intelligent data infrastructure will only accelerate.












