Funding

ScaleOps Raises $130M Series C to Push Autonomous Infrastructure Into the Core of AI Operations

Published March 30, 2026

Antoine Tardif, CEO & Founder of Unite.AI

ScaleOps Team (Photo Credit: Menash Cohen)

ScaleOps has raised $130 million in Series C funding at a valuation exceeding $800 million, as demand accelerates for systems that can manage increasingly complex cloud and AI workloads without constant human intervention.

The round was led by Insight Partners, with continued backing from Lightspeed Venture Partners, NFX, Glilot Capital Partners, and Picture Capital. The company has now raised more than $210 million in total.

From Kubernetes Optimization to Autonomous Infrastructure

ScaleOps emerged in 2022 with a focus on automating one of the most persistent challenges in modern infrastructure: resource allocation in Kubernetes environments. Instead of relying on static configurations, the platform continuously adjusts compute resources such as CPU and memory based on real-time workload behavior.

At its core, the system operates as a closed-loop optimization engine. It observes workload demand, evaluates performance signals, and executes changes automatically. This includes dynamically resizing workloads, adjusting replica counts, and optimizing node usage across clusters.

Over time, this has expanded beyond traditional cloud optimization into GPU-heavy environments tied to AI systems. The platform now applies similar principles to GPU allocation, where inefficient usage can quickly become a major bottleneck.

Why This Layer Is Becoming Critical

Modern AI systems introduce a level of variability that traditional infrastructure tools were not designed to handle. Workloads spike unpredictably, models compete for limited GPU capacity, and performance requirements shift in real time.

Historically, teams have managed this through manual tuning or rule-based automation. But these approaches struggle at scale. Kubernetes itself, while designed for orchestration, still relies heavily on static configurations that engineers must constantly adjust to avoid under- or over-provisioning.

This creates a structural inefficiency. Engineering teams spend significant time reacting to performance issues, tuning resource allocation, and resolving SLO violations rather than building new systems.

ScaleOps’ approach replaces this reactive model with continuous, context-aware automation. Instead of defining thresholds in advance, the system adapts in real time based on observed behavior across the entire cluster.

Moving Toward Self-Optimizing Systems

What makes this category notable is not just cost optimization, but the shift in how infrastructure behaves.

In traditional environments, infrastructure is configured. In newer systems, it is expected to adapt.

ScaleOps reflects this transition by treating infrastructure as something that can continuously re-balance itself. The platform applies policies, monitors outcomes, and adjusts allocations without requiring engineers to intervene. In production environments, this can extend to nearly full automation of resource decisions.

This model has already shown measurable impact in real-world deployments. In one case, automated rightsizing allowed workloads to adjust dynamically to traffic fluctuations while maintaining performance targets and reducing waste across clusters.

Expanding Beyond Cost Optimization

Early cloud optimization tools focused primarily on reducing spend. ScaleOps still emphasizes cost efficiency, with claims of significant savings driven by eliminating over-provisioning and idle resources.

But the broaderection is shifting toward performance reliability and system resilience.

As AI systems become more tightly integrated into production environments, infrastructure decisionsectly impact model performance, latency, and availability. Misallocation is no longer just a cost issue, it becomes a functional limitation.

This is particularly relevant for GPU workloads, where demand often exceeds supply. Efficient allocation at the infrastructure layer can determine whether models run efficiently or sit idle waiting for resources.

A New Control Plane for AI Infrastructure

The longer-term implication is the emergence of a new control layer for computing resources.

Instead of engineers manually configuring infrastructure or relying on fragmented tooling, platforms like ScaleOps act as an autonomous control plane that continuously aligns supply with demand.

This becomes increasingly important as organizations deploy multiple AI systems simultaneously, each with different performance requirements and usage patterns.

In that context, infrastructure management begins to resemble traffic control more than static provisioning. Resources are routed, rebalanced, and optimized continuously rather than assigned once and left unchanged.

With this latest funding, ScaleOps is expected to expand its platform beyond compute and GPUs into broader areas of infrastructure management, including additional layers of cloud and AI orchestration.

The trajectory suggests a move toward environments where infrastructure decisions are no longerectly handled by engineers. Instead, they are governed by systems that interpret workload intent and adjust resources automatically.

As AI workloads continue to scale, this shift may become less of an optimization and more of a requirement.

Unite.AI