Reports
Flux’s AI Code Generation Reality Check Finds Enterprise Code Velocity Is Outrunning Visibility

Flux’s new AI Code Generation Reality Check report, based on independent research conducted by Dimensional Research, shows that AI-generated code has moved well beyond experimentation and into mainstream enterprise software development. The survey of 309 engineering leaders and practitioners across five continents found that 44.7% of organizations already have AI-generated code running in production, while another 35.0% are using AI to write code but have not yet shipped it.
AI-Generated Code Has Become Standard, But Trust Has Not Caught Up
The report makes clear that the debate is no longer whether engineering teams will use AI to write code. That shift has already happened. The more important question is whether organizations can understand, review, secure, and govern the growing volume of code AI is helping produce.
Only a small share of respondents are still on the sidelines. While 44.7% have AI-generated code in production and 35.0% are using it outside production, another 16.2% plan to use AI-generated code within the next 12 months, and 4.2% plan to do so later. In practical terms, the report suggests near-universal adoption is approaching, but deployment confidence remains uneven.
That hesitation is not rooted in a lack of productivity. It is rooted in visibility. Flux frames the problem as an “AI visibility gap”: teams can now generate code faster than they can confidently inspect, contextualize, and control it.
AI Is Being Trusted First With Low-Risk, Repetitive Work
The strongest adoption is happening in areas where patterns are predictable and failures are easier to contain. According to the report, engineering teams are using AI-generated code most often for documentation at 68.7%, unit testing at 65.9%, and simple functions at 57.7%. Code review also appears at 57.7%, while 50.4% of respondents say AI is being used to create new features.
That distribution is revealing. Organizations are not blindly handing over core architecture or mission-critical workflows. They are starting with repetitive, structured tasks where AI can reduce tedious work and improve speed without immediately introducing large-scale business risk.
The productivity gains are real. Among current users of AI-generated code, 67.1% report increased productivity, 61.8% report faster prototyping, 58.5% report better documentation, and 48.4% report reduced development costs. But the report also shows a gap between expectation and reality on quality. While 47.6% of non-users expect AI-generated code to reduce errors, only 34.6% of current users say they are actually seeing fewer errors.
The Bottleneck Has Moved From Writing Code to Reviewing Code
AI has made code creation easier, but that has pushed more pressure onto review, testing, and risk management. Nearly 80% of respondents spend at least 10% of their time on code review, and about one in ten spend 41% or more of their time reviewing code.
That matters because AI-generated code changes the rhythm of software development. More code can be produced, pull requests can grow in volume, and reviewers may have less context about how or why something was created. The report finds that the biggest challenges in understanding codebase changes include complex code at 53.7%, different development teams using different approaches at 46.3%, poor documentation at 43.0%, and large volumes of changes at 37.9%.
This is where the risk becomes more than theoretical. When asked which week-to-week changes are hardest to detect, respondents pointed to security issues at 49.2%, dependency changes at 47.7%, and performance impacts at 44.1%. These are not minor concerns; they are precisely the types of changes that can create production incidents, compliance exposure, or long-term technical debt.
AI-Generated Code Is Not Clearly Better or Worse, But It Is Different
One of the more interesting findings is that respondents are split on whether AI-generated code creates more problems than human-written code. 32.9% say AI-generated code creates somewhat or substantially more issues, while 33.4% say it creates somewhat or substantially fewer issues, and 29.7% say it creates about the same number of issues.
That split suggests the impact of AI-generated code depends heavily on the environment around it. AI code may perform well when paired with strong review practices, test coverage, security tooling, and governance. It may create more problems when organizations adopt it for speed without upgrading the systems used to evaluate the output.
The negative impacts reported by organizations show where the friction is appearing. 41.1% cite reduced learning opportunities for junior developers, 32.6% say AI-generated code does not deliver the requested functionality, 31.6% report unintended dependencies, 31.6% report security vulnerabilities, and 29.5% say AI-generated code can be hard to debug.
AI Code Risk Has Moved Beyond Engineering
The report also shows that AI-generated code is no longer just an engineering management issue. It has become an enterprise risk topic.
Security teams are the most concerned stakeholder group, cited by 62.5% of respondents. Compliance follows at 51.5%, while 46.9% cite CTO or CIO leadership, and 40.8% point to legal teams. The concern also extends to operations, QA, product management, CEOs, customer success, and even marketing.
That widening stakeholder map reflects a larger shift. Once AI-generated code reaches production, its consequences can affect data protection, customer experience, security posture, auditability, and contractual obligations. The code may be written inside the engineering organization, but the risk is distributed across the business.
Safeguards Are Becoming Core Production Infrastructure
Enterprises are already responding by spending on new safeguards. The report found that 45.6% have invested in code quality analysis tools, 39.0% are using automated code review tools, 38.5% have added static application security testing, 35.9% have adopted software composition analysis, 32.3% are using interactive application security testing, and 31.3% have implemented training classes for specific coding assistants.
Process changes are also widespread. 57.4% have introduced policies outlining the use of AI-generated code, 49.2% require training on using AI-generated code, 45.1% have made code reviews more robust, and 40.5% have assigned more developers to focus on code review.
These investments are not happening in a vacuum. Only 3.6% of respondents say AI-introduced issues never reach production. By contrast, 4.6% say they reach production often, 30.3% say sometimes, 31.8% say occasionally, and 23.6% say rarely. In other words, for most organizations using AI-generated code, production impact is already a recurring reality.
The Next Phase of AI Coding Will Be About Control
The report also captures a paradox: many engineering leaders believe AI may help solve the review burden that AI itself is increasing. 64.9% of respondents believe AI could outperform humans in at least some aspects of code review, while 21.1% disagree and 14.0% have no opinion.
Respondents see AI as potentially stronger at applying uniform standards and providing more thorough analysis, both at 57.4%. They also point to identifying patterns across codebases at 53.3%, quicker feedback at 51.8%, detecting more issues at 49.7%, and 24/7 availability at 48.2%.
That points to the next stage of enterprise AI adoption in software development. AI will not simply write more code. It will increasingly be used to inspect, classify, prioritize, and govern code changes. The winners may not be the teams that generate the most code, but the teams that build the clearest picture of what changed, where risk is accumulating, and which human decisions still matter.
The AI Code Generation Reality Check Shows the Real Enterprise Challenge
Flux’s report ultimately points to a more mature phase of AI adoption in engineering. The code itself is no longer the hard part. The harder problem is visibility, review capacity, governance, and trust.
AI-generated code is now in production at nearly half of surveyed organizations, but the supporting systems around it are still catching up. For engineering leaders, the implication is clear: AI coding can accelerate development, but only if organizations can also scale the safeguards, context, and accountability needed to keep production systems reliable. As the AI Code Generation Reality Check makes clear, the future of AI-assisted software development will be defined less by how much code AI can produce, and more by how confidently companies can understand and ship it.












